Tic-Tac-Toe the Hard Way

David delves into questions around data and training for his model including: What does a tic-tac-toe board “look” like to ML? Plus, an intro to reinforcement learning, the approach Yannick will be taking.

Show Notes

How should David represent the data needed to train his machine learning system? What does a tic-tac-toe board “look” like to ML? Should he train it on games or on individual boards? How does this decision affect how and how well the machine will learn to play? Plus, an intro to reinforcement learning, the approach Yannick will be taking.

For more information about the show, check out pair.withgoogle.com/thehardway.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri

What is Tic-Tac-Toe the Hard Way ?

A writer and a software engineer from Google's People + AI Research team explore the human choices that shape machine learning systems by building competing tic-tac-toe agents.