admarcosai
's Collections
Beyond Human Data: Scaling Self-Training for Problem-Solving with
Language Models
Paper
•
2312.06585
•
Published
•
28
TinyGSM: achieving >80% on GSM8k with small language models
Paper
•
2312.09241
•
Published
•
37
Viewer
•
Updated
•
70k
•
2.68k
•
86
Paper
•
2309.17425
•
Published
•
6
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
•
918
•
1.64k
•
126
garage-bAInd/Open-Platypus
Viewer
•
Updated
•
24.9k
•
2.63k
•
370
Viewer
•
Updated
•
243k
•
355
•
200
Viewer
•
Updated
•
58.7k
•
93
•
43
Viewer
•
Updated
•
1.49M
•
456
•
137
Viewer
•
Updated
•
166k
•
185
•
106
Viewer
•
Updated
•
198k
•
159
•
110
Viewer
•
Updated
•
2.75M
•
9.99k
•
330
Viewer
•
Updated
•
6.2M
•
521
•
77
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
5.16k
•
274
Viewer
•
Updated
•
4.04k
•
6.79k
•
117
Viewer
•
Updated
•
14.3k
•
180
•
45
Viewer
•
Updated
•
44.8k
•
109
•
50
Viewer
•
Updated
•
6.14k
•
5.53k
•
112
Viewer
•
Updated
•
262k
•
2.17k
•
249
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
8.54k
•
125
WhiteRabbitNeo/Code-Functions-Level-Cyber
Viewer
•
Updated
•
8.44k
•
82
•
18
WhiteRabbitNeo/Code-Functions-Level-General
Viewer
•
Updated
•
8.69k
•
57
•
14
Viewer
•
Updated
•
317k
•
2.41k
•
28
Updated
•
1.21k
•
71
Viewer
•
Updated
•
183k
•
520
•
277
selfrag/selfrag_train_data
Viewer
•
Updated
•
146k
•
147
•
67
Viewer
•
Updated
•
463k
•
32
•
17
Locutusque/UltraTextbooks
Viewer
•
Updated
•
5.52M
•
1.35k
•
194
Undi95/ConversationChronicles-sharegpt-SHARDED
Viewer
•
Updated
•
787k
•
85
•
10
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Paper
•
2402.10176
•
Published
•
36
Viewer
•
Updated
•
31.1M
•
5.35k
•
566
togethercomputer/RedPajama-Data-1T
Viewer
•
Updated
•
1.73M
•
1.6k
•
1.06k
Viewer
•
Updated
•
968M
•
25.7k
•
814
Viewer
•
Updated
•
276M
•
13.6k
•
141