sr5434
/

alphazero-othello

Model card Files Files and versions Community

alphazero-othello / README.md

sr5434's picture

Update README.md

bfbbe25 verified over 1 year ago

|

525 Bytes

	---
	license: mit
	---
	AlphaZero trained to play Othello using Jax and PGX. I used a TPU v4-8 provided by the TensorFlow Research Cloud to build this. Currently, we only have a checkpoint for steps 13270 and 15154, but we will have better models soon.
	Model evaluations:


	\| Step \| Win % vs PGX baseline \| Draw % vs baseline \| Lose % vs baseline \|
	\|------\|-----------------------\|--------------------\|--------------------\|
	\| 13270 \| ~46.8% \| 6.25% \| ~46.8% \|
	\| 15154 \| 62.5% \| 0% \| 37.5% \|
	\| 17039 \| 81.25% \| 3.125% \| 15.625% \|