Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.
Requirements:
- TensorFlow (https://www.tensorflow.org/versions/r0.10/get_started/os_setup.html)
- Colorama (
pip install colorama
)
References:
- Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
- W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
- R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.