gair-prox
/

FW-ProX-1.7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

koalazf99 commited on Sep 26

Commit

8155b23

•

1 Parent(s): 931fef3

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ pipeline_tag: text-generation
   <img src="prox-teaser.png">
 </p>
-[ArXiv](http://arxiv.org/abs/xxxx) | [Models](https://huggingface.co/gair-prox/FW-ProX-1.7B) | [Data](https://huggingface.co/datasets/gair-prox/FineWeb-pro) | [Code](https://github.com/GAIR-NLP/program-every-example)
 **FW-ProX-1.7B** is a small language model. It was and trained on the [FineWeb-pro](https://huggingface.co/datasets/gair-prox/FineWeb-pro) for 50B tokens.
@@ -30,6 +30,10 @@ ProX models are evaluated over 10 language model benchmarks in zero-shot setting
 ### Citation
 ```
-@misc{TBD
 }
 ```

   <img src="prox-teaser.png">
 </p>
+[ArXiv](https://arxiv.org/abs/2409.17115) | [Models](https://huggingface.co/gair-prox/FW-ProX-1.7B) | [Data](https://huggingface.co/datasets/gair-prox/FineWeb-pro) | [Code](https://github.com/GAIR-NLP/program-every-example)
 **FW-ProX-1.7B** is a small language model. It was and trained on the [FineWeb-pro](https://huggingface.co/datasets/gair-prox/FineWeb-pro) for 50B tokens.
 ### Citation
 ```
+@article{zhou2024programming,
+  title={Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale},
+  author={Zhou, Fan and Wang, Zengzhi and Liu, Qian and Li, Junlong and Liu, Pengfei},
+  journal={arXiv preprint arXiv:2409.17115},
+  year={2024}
 }
 ```