README.md · sr5434/alphazero-othello at bfbbe25b81b95ed1dc23cda9440b7d84b3b34f11

metadata

license: mit

AlphaZero trained to play Othello using Jax and PGX. I used a TPU v4-8 provided by the TensorFlow Research Cloud to build this. Currently, we only have a checkpoint for steps 13270 and 15154, but we will have better models soon. Model evaluations:

Step	Win % vs PGX baseline	Draw % vs baseline	Lose % vs baseline
13270	~46.8%	6.25%	~46.8%
15154	62.5%	0%	37.5%
17039	81.25%	3.125%	15.625%