├─ LAB4_slide [Slide for quick overview]
├─ LAB4_spec [Specification for Lab 4]
└─ Q-learning
├─ Q_Learning_Demo.pptx [Detials for DQN]
└─ q_learning_demo.cpp [Q learning toy demo]
- AnyDesk for me
- xrdp for yours
- In this training, you will get 64.725 point
- Test plot like this:
You may try anthor training (if your algorithm is right but ) or modify your code.
- In this training, you will get 100 point
- Test plot like this:
That is what you want to see.
MONITOR_PATH = './vedio' # video path
RECORD_VIDEO_FLAG = True # record video or not