Understanding Proximal Policy Optimization (PPO) in Reinforcement Learning

Today's Progress

Gathered some research papers on standard machine learning algorithms, such as PPO and A2C.