alexbalandi
/

ppo-LunarLander-v2-4milsteps-200-envs

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

ppo-LunarLander-v2-4milsteps-200-envs / replay.mp4

alexbalandi's picture

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.

3120398 over 1 year ago

history contribute delete

189 kB

This file contains binary data. It cannot be displayed, but you can still download it.