KnutJaegersberg
/

GPT-JX-3b

Text Generation

text-generation-inference

Model card Files Files and versions Community

KnutJaegersberg commited on Dec 9, 2023

Commit

1043c8a

•

1 Parent(s): 7f92913

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -13,6 +13,12 @@ inference: false
 tags:
 - text-generation-inference
 ---
 ### **Model Description**
 ***GPT-JX*** is a **3 billion paramter** autoregressive Foundational Large Language Model pre-trained on *High Quality*, *Cleaned* and *Deduplicated* **1.1 trillion tokens** of english text and code. ***GPT-JX*** uses the base architecture of traditional *Transformers Decoder* with **slight changes** which is discussed later. ***GPT-JX*** was pre-trained on tokens for **English text** and **20 Programming Languages**. ***GPT-JX*** shows impressing performance when compared to **Large Language Models with 7 billion parameters** such as **LLaMa-7B-v2, Falcon-7B & MPT-7B**.

 tags:
 - text-generation-inference
 ---
+Important Note:
+I have not created this MIT licensed model, I discovered it and downloaded it. It was taken down by its creators, so I reupload it. More info:
+https://github.com/huggingface/transformers/issues/25723
 ### **Model Description**
 ***GPT-JX*** is a **3 billion paramter** autoregressive Foundational Large Language Model pre-trained on *High Quality*, *Cleaned* and *Deduplicated* **1.1 trillion tokens** of english text and code. ***GPT-JX*** uses the base architecture of traditional *Transformers Decoder* with **slight changes** which is discussed later. ***GPT-JX*** was pre-trained on tokens for **English text** and **20 Programming Languages**. ***GPT-JX*** shows impressing performance when compared to **Large Language Models with 7 billion parameters** such as **LLaMa-7B-v2, Falcon-7B & MPT-7B**.