chemicals in bottles
> AI Notakto

Can a “computer” win a game without being programmed with a strategy? Notakto is a variation of tic-tac-toe where both players play “X” and try to AVOID making three-in-a-row. We’ll use cups to stand in for our computer, tell it a list of valid moves, and reward it for winning. Will our computer find the optimal winning strategy?

Videos:

Popular Works:

Technical Works:

  • Reinforcement Learning in Practice by Yuxi Li: A very broad discussion of reinforcement learning that cites papers where the reader may go deeper into problems that they find interesting.