Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
8
Alex Havrilla
Dahoas
Follow
ChengeHarrison's profile picture
Pwlot's profile picture
huang1994's profile picture
64 followers
·
0 following
https://dahoas.github.io/
dahoas
AI & ML interests
NLP, RL
Recent Activity
updated
a dataset
18 days ago
optimal-sampling/qwen-1.5-14B-awq-K-100-test-0.5M
updated
a dataset
18 days ago
optimal-sampling/qwen-1.5-4B-awq-K-100-test-1M
updated
a dataset
20 days ago
Dahoas/qwen-1.5-4B-K-100-test
View all activity
Articles
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
101
Organizations
Dahoas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
updated
2 datasets
18 days ago
optimal-sampling/qwen-1.5-14B-awq-K-100-test-0.5M
Viewer
•
Updated
18 days ago
•
495k
•
36
optimal-sampling/qwen-1.5-4B-awq-K-100-test-1M
Viewer
•
Updated
18 days ago
•
9.9k
•
34
updated
a dataset
20 days ago
Dahoas/qwen-1.5-4B-K-100-test
Viewer
•
Updated
20 days ago
•
500k
•
37
updated
a dataset
about 1 month ago
Dahoas/MATH_train_K_100_qwen_1.5_4B_outputs
Viewer
•
Updated
Oct 22
•
750k
•
48
updated
4 datasets
about 2 months ago
optimal-sampling/qwen-1.5-7B-k-100-train
Viewer
•
Updated
Oct 5
•
750k
•
40
optimal-sampling/qwen-1.5-14B-K-100-test-extra
Viewer
•
Updated
Oct 4
•
500k
•
32
optimal-sampling/qwen-1.5-4B-K-100-test-2M
Viewer
•
Updated
Oct 3
•
2M
•
64
optimal-sampling/qwen-1.5-32B-K-100-test
Viewer
•
Updated
Sep 29
•
500k
•
35
updated
4 datasets
2 months ago
optimal-sampling/qwen-1.5-4B-K-100-test
Viewer
•
Updated
Sep 12
•
500k
•
38
optimal-sampling/MATH-K-100-test-inputs
Viewer
•
Updated
Sep 12
•
500k
•
36
Dahoas/MATH-K-100-train
Viewer
•
Updated
Sep 12
•
750k
•
8.8k
•
2
optimal-sampling/MATH-K-100-train
Viewer
•
Updated
Sep 11
•
750k
•
50
updated
2 datasets
3 months ago
optimal-sampling/qwen-1.5-4B-K-100-train
Viewer
•
Updated
Sep 5
•
750k
•
62
Dahoas/gsm8k_reformatted
Viewer
•
Updated
Aug 13
•
8.79k
•
45
New activity in
monology/pile-uncopyrighted
6 months ago
Streaming broken for Pile
4
#5 opened 6 months ago by
Dahoas
liked
a model
7 months ago
casperhansen/llama-3-70b-instruct-awq
Text Generation
•
Updated
Apr 19
•
16.2k
•
66
liked
a model
8 months ago
reciprocate/mistral-7b-gsm8k-code-rm
Text Classification
•
Updated
Mar 24
•
11
•
3
updated
2 datasets
8 months ago
Dahoas/MATH_full_chat_format
Viewer
•
Updated
Mar 27
•
12.5k
•
45
•
1
Dahoas/MATH_chat_format
Viewer
•
Updated
Mar 26
•
7.91k
•
58
liked
a dataset
8 months ago
reciprocate/tinygsm_mixtral_12M
Viewer
•
Updated
Mar 24
•
12M
•
106
•
1
Load more