Skip to content

Latest commit

 

History

History
16 lines (13 loc) · 595 Bytes

README.md

File metadata and controls

16 lines (13 loc) · 595 Bytes

RL agent that learns to play Ping Pong

Implemented the system using policy gradient by creating two layer neural network and applied RMSProp.
alt text

Analysis

Rewards before training:
alt text
Rewards after training:
alt text

Requirements

Open Gym: loads game environment
Pickle: stores the parameter values and rewards