Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
alexbalandi
/
ppo-LunarLander-v2-4milsteps-200-envs
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
ppo-LunarLander-v2-4milsteps-200-envs
1 contributor
History:
3 commits
alexbalandi
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
3120398
over 1 year ago
FinetunedPPO_5mil_steps_total
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
over 1 year ago
ppo-LunarLander-v2
Upload PPO LunarLander-v2 trained agent, first step
over 1 year ago
.gitattributes
Safe
1.48 kB
initial commit
over 1 year ago
FinetunedPPO_5mil_steps_total.zip
Safe
157 kB
LFS
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
over 1 year ago
README.md
Safe
784 Bytes
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
over 1 year ago
config.json
Safe
23.7 kB
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
over 1 year ago
ppo-LunarLander-v2.zip
Safe
157 kB
LFS
Upload PPO LunarLander-v2 trained agent, first step
over 1 year ago
replay.mp4
Safe
189 kB
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
over 1 year ago
results.json
Safe
163 Bytes
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
over 1 year ago