koalazf99 commited on
Commit
22d0ce7
1 Parent(s): cd0d83f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -3
README.md CHANGED
@@ -1,3 +1,35 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - gair-prox/FineWeb-pro
5
+ language:
6
+ - en
7
+ tags:
8
+ - llama
9
+ pipeline_tag: text-generation
10
+ ---
11
+
12
+ # FW-ProX-1.7B
13
+
14
+ <p align="center">
15
+ <img src="prox-teaser.png">
16
+ </p>
17
+
18
+ [ArXiv](http://arxiv.org/abs/xxxx) | [Models](https://huggingface.co/gair-prox/FW-ProX-1.7B) | [Data](https://huggingface.co/datasets/gair-prox/FineWeb-pro) | [Code](https://github.com/GAIR-NLP/program-every-example)
19
+
20
+ **FW-ProX-1.7B** is a small language model. It was and trained on the [FineWeb-pro](https://huggingface.co/datasets/gair-prox/FineWeb-pro) for 50B tokens.
21
+
22
+ ## Evaluations
23
+
24
+ ProX models are evaluated over 10 language model benchmarks in zero-shot setting.
25
+
26
+ | | ArC-c | ARC-e | CSQA | HellaS | MMLU | OBQA | PiQA | SIQA | WinoG | SciQ | AVG |
27
+ |-----------------------|-------|-------|-------|-----------|-------|-------|-------|-------|-------|-------|------|
28
+ | raw | 28.5 | 52.6 | 33.9 | 53.2 | 29.8 | 32.6 | 72.9 | 40.2 | 53.0 | 77.1 | 47.4 |
29
+ | ours | 34.4 | 63.9 | 32.6 | 53.0 | 33.1 | 34.4 | 73.1 | 39.3 | 52.7 | 81.5 | 49.8 |
30
+
31
+ ### Citation
32
+ ```
33
+ @misc{TBD
34
+ }
35
+ ```