Skip to content

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Notifications You must be signed in to change notification settings

3cky/tensorflow-rl-tictactoe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Requirements:

References:

  • Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
  • W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
  • R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.

About

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages