Spaces:
Sleeping
Sleeping
Update app.py
Browse files
app.py
CHANGED
@@ -172,18 +172,24 @@ with gr.Blocks(analytics_enabled=False, css=custom_css) as demo:
|
|
172 |
"""
|
173 |
## Demo platform for π₯ CroissantLLMChat
|
174 |
|
175 |
-
The model is of small size (1.3B), about 130 times smaller than GPT3.
|
176 |
-
As such, it's generalist Chat version logically exhibits reduced understanding, reasoning and knowledge capacities.
|
177 |
-
For industrial uses, we recommend finetuning the model, but trained this Chat version to allow for experimenting and to showcase the capabilities for it's size.
|
178 |
|
179 |
-
|
|
|
180 |
We recommend testing the chat model for open-ended writing tasks, tips, translations, etc...
|
181 |
We find direct instructions to work best, and performance to drop after the first round of interactions.
|
182 |
-
We limit the length of the conversation so clear the Chat between tests !
|
|
|
|
|
183 |
|
184 |
-
|
185 |
-
|
186 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
187 |
"""
|
188 |
)
|
189 |
|
|
|
172 |
"""
|
173 |
## Demo platform for π₯ CroissantLLMChat
|
174 |
|
|
|
|
|
|
|
175 |
|
176 |
+
### Usage recommendations
|
177 |
+
|
178 |
We recommend testing the chat model for open-ended writing tasks, tips, translations, etc...
|
179 |
We find direct instructions to work best, and performance to drop after the first round of interactions.
|
180 |
+
We limit the length of each message to 256 tokens by default (can be changed in the settings below), and of the entire conversation so clear the Chat between tests !
|
181 |
+
|
182 |
+
### Errors
|
183 |
|
184 |
+
The model is very small in size (1.3B), about 130 times smaller than GPT3. As such, it's generalist Chat version logically exhibits reduced understanding, reasoning and knowledge capacities, and may still exhibit undesired behavior such as hallucinations, or toxicity (rarely)...
|
185 |
+
For industrial applications, we recommend finetuning the model, but trained this Chat version to allow for experimenting and to showcase the capabilities for it's size.
|
186 |
+
|
187 |
+
### More info
|
188 |
+
ποΈ The blogpost: https://huggingface.co/blog/manu/croissant-llm-blog
|
189 |
+
π The 45 page report with lots of gems: https://arxiv.org/abs/2402.00786
|
190 |
+
π€ Models, Data, Demo: https://huggingface.co/croissantllm
|
191 |
+
###
|
192 |
+
|
193 |
"""
|
194 |
)
|
195 |
|