Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
8
Alex Havrilla
Dahoas
Follow
cokeroluwafemi's profile picture
datamarketinglabs's profile picture
mzhaoshuai's profile picture
64 followers
·
0 following
https://dahoas.github.io/
dahoas
AI & ML interests
NLP, RL
Recent Activity
updated
a dataset
15 days ago
optimal-sampling/qwen-1.5-14B-awq-K-100-test-0.5M
updated
a dataset
15 days ago
optimal-sampling/qwen-1.5-4B-awq-K-100-test-1M
updated
a dataset
17 days ago
Dahoas/qwen-1.5-4B-K-100-test
View all activity
Articles
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
99
Organizations
Papers
2
arxiv:
2403.04642
arxiv:
2402.10963
models
33
Sort: Recently updated
Dahoas/gptj-rm-IHP
Updated
Mar 8, 2023
•
2
Dahoas/gptneox-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
14
•
1
Dahoas/pythia-1B-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
25
•
1
Dahoas/pythia-125M-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
16
•
1
Dahoas/synthetic-pythia-6B-rm-sft-response
Text Generation
•
Updated
Mar 2, 2023
•
20
Dahoas/pythia-6B-sft-response-full-static
Text Generation
•
Updated
Feb 27, 2023
•
31
•
1
Dahoas/gptj-6B-response-full-static-sft
Text Generation
•
Updated
Feb 15, 2023
•
15
•
1
Dahoas/pythia-6B-rm-response-full-hh
Updated
Feb 15, 2023
Dahoas/gptj-response-full-sft
Text Generation
•
Updated
Feb 15, 2023
•
17
•
1
Dahoas/pythia-6b-rm-response-only-full-hh
Text Generation
•
Updated
Feb 14, 2023
•
36
Expand 33 models
datasets
141
Sort: Recently updated
Dahoas/qwen-1.5-4B-K-100-test
Viewer
•
Updated
17 days ago
•
500k
•
34
Dahoas/MATH_train_K_100_qwen_1.5_4B_outputs
Viewer
•
Updated
about 1 month ago
•
750k
•
53
Dahoas/MATH-K-100-train
Viewer
•
Updated
Sep 12
•
750k
•
9.67k
•
2
Dahoas/gsm8k_reformatted
Viewer
•
Updated
Aug 13
•
8.79k
•
50
Dahoas/MATH_full_chat_format
Viewer
•
Updated
Mar 27
•
12.5k
•
44
•
1
Dahoas/MATH_chat_format
Viewer
•
Updated
Mar 26
•
7.91k
•
61
Dahoas/openwebtext_val
Viewer
•
Updated
Feb 23
•
4.01k
•
45
Dahoas/prompted_svamp
Viewer
•
Updated
Oct 16, 2023
•
1k
•
48
•
1
Dahoas/svamp
Viewer
•
Updated
Oct 16, 2023
•
1k
•
69
Dahoas/prompted_hf_cot_gsm8k
Viewer
•
Updated
Oct 16, 2023
•
8.79k
•
74
•
6
Expand 141 datasets