Master Snake Game AI with PPO: Step-by-Step Guide (Part I)
This is first of two part tutorial. Here we learn to build snake game. In part two, we will learn to build a PPO agent to play with it.
This is first of two part tutorial. Here we learn to build snake game. In part two, we will learn to build a PPO agent to play with it.
In this blog post, we will explore the Proximal Policy Optimization (PPO) algorithm. We’ll compare it to other deep reinforcement learning algorithms like Double Deep Q-learning and TRPO. Additionally, we’ll learn how to implement PPO using PyTorch.
Introduction of Prioritized Experience Replay and its implementation with PyTorch.
This is an implementation of Policy Gradient algorithm using PyTorch.
Implementation of Gaussian Double Deep Q network with PyTorch