mkahari
/

RL_testing

Reinforcement Learning

custom-implementation

Inference Endpoints

Model card Files Files and versions Community

RL_testing / README.md

mkahari's picture

Upload . with huggingface_hub

121803f almost 2 years ago

|

history blame contribute delete

722 Bytes

	---
	tags:
	- Taxi-v3
	- q-learning
	- reinforcement-learning
	- custom-implementation
	model-index:
	- name: RL_testing
	results:
	- task:
	type: reinforcement-learning
	name: reinforcement-learning
	dataset:
	name: Taxi-v3
	type: Taxi-v3
	metrics:
	- type: mean_reward
	value: 7.50 +/- 2.70
	name: mean_reward
	verified: false
	---

	# PPO Agent playing LunarLander-v2
	This is a trained model of a PPO agent playing LunarLander-v2
	using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).

	## Usage (with Stable-baselines3)
	TODO: Add your code


	```python
	from stable_baselines3 import ...
	from huggingface_sb3 import load_from_hub

	...
	```