cfh::blog
  • categories
  • tags
Home » Categories

Connect-Zero

1. Connect 4
April 20, 2025
2. Connect-Zero: Reinforcement Learning from Scratch
April 20, 2025
3. Basic Setup and Play
April 20, 2025
4. The REINFORCE Algorithm
April 20, 2025
5. A First Training Run and Policy Collapse
April 21, 2025
6. On Entropy
April 23, 2025
7. Entropy Regularization
April 24, 2025
8. Introducing a Benchmark Opponent
April 26, 2025
9. Model Design for Connect 4
April 28, 2025
10. REINFORCE with Baseline
April 29, 2025
11. Implementing and Evaluating REINFORCE with Baseline
May 1, 2025
12. Actor-Critic Algorithms
May 8, 2025
13. Implementing A2C
May 10, 2025
14. Evaluating A2C versus REINFORCE with baseline
May 11, 2025
15. Multi-Step Bootstrapping
May 11, 2025
16. Proximal Policy Optimization (PPO)
May 25, 2025
© 2025 cfh::blog · Powered by Hugo & PaperMod