alexbalandi
/

ppo-LunarLander-v2-4milsteps-200-envs

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

ppo-LunarLander-v2-4milsteps-200-envs

1 contributor

History: 3 commits

alexbalandi's picture

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.

3120398 over 1 year ago

FinetunedPPO_5mil_steps_total
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 1 year ago
ppo-LunarLander-v2
Upload PPO LunarLander-v2 trained agent, first step over 1 year ago
.gitattributes

1.48 kB

initial commit over 1 year ago
FinetunedPPO_5mil_steps_total.zip

157 kB
LFS

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 1 year ago
README.md

784 Bytes

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 1 year ago
config.json

23.7 kB

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 1 year ago
ppo-LunarLander-v2.zip

157 kB
LFS

Upload PPO LunarLander-v2 trained agent, first step over 1 year ago
replay.mp4

189 kB

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 1 year ago
results.json

163 Bytes

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 1 year ago