ppo-LunarLander-v2 / replay.mp4

Commit History

PPO lunar landing trained model (1M timesteps, 5 epochs)
dc226b5
verified

ErinDelft commited on

PPO lunar landing trained model (1M timesteps, 5 epochs)
1d40d05
verified

ErinDelft commited on