Update README.md
Browse files
README.md
CHANGED
@@ -13,32 +13,26 @@ This is a finetuned model of Cerebras 111M model. using DataBricksLabs Dolly Fra
|
|
13 |
|
14 |
### Model Description
|
15 |
|
16 |
-
|
17 |
|
|
|
18 |
|
19 |
-
|
20 |
-
- **Developed by:** [More Information Needed]
|
21 |
- **Shared by [optional]:** [More Information Needed]
|
22 |
- **Model type:** [More Information Needed]
|
23 |
-
- **Language(s) (NLP):**
|
24 |
-
- **License:**
|
25 |
-
- **Finetuned from model
|
26 |
-
|
27 |
-
### Model Sources [optional]
|
28 |
-
|
29 |
-
<!-- Provide the basic links for the model. -->
|
30 |
-
|
31 |
-
- **Repository:** [More Information Needed]
|
32 |
-
- **Paper [optional]:** [More Information Needed]
|
33 |
-
- **Demo [optional]:** [More Information Needed]
|
34 |
|
35 |
## Uses
|
36 |
|
37 |
-
|
|
|
38 |
|
39 |
### Direct Use
|
40 |
|
41 |
-
|
42 |
|
43 |
[More Information Needed]
|
44 |
|
@@ -50,22 +44,18 @@ This is a finetuned model of Cerebras 111M model. using DataBricksLabs Dolly Fra
|
|
50 |
|
51 |
### Out-of-Scope Use
|
52 |
|
53 |
-
|
|
|
|
|
54 |
|
55 |
[More Information Needed]
|
56 |
|
57 |
## Bias, Risks, and Limitations
|
58 |
|
59 |
-
|
60 |
|
61 |
[More Information Needed]
|
62 |
|
63 |
-
### Recommendations
|
64 |
-
|
65 |
-
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
66 |
-
|
67 |
-
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
68 |
-
|
69 |
## How to Get Started with the Model
|
70 |
|
71 |
Use the code below to get started with the model.
|
@@ -143,10 +133,10 @@ Use the code below to get started with the model.
|
|
143 |
|
144 |
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
145 |
|
146 |
-
- **Hardware Type:**
|
147 |
-
- **
|
148 |
-
- **Cloud Provider:**
|
149 |
-
- **Compute Region:**
|
150 |
- **Carbon Emitted:** [More Information Needed]
|
151 |
|
152 |
## Technical Specifications [optional]
|
|
|
13 |
|
14 |
### Model Description
|
15 |
|
16 |
+
This is a finetuned version of cerebras' 111million paramater model that has been trained to follow instructions.
|
17 |
|
18 |
+
It was accomplished using DataBricks Dolly training tools and the alpaca dataset, and was trained for 2 epochs.
|
19 |
|
20 |
+
- **Developed by:** Finetuned by Corianas (me) using open source tools
|
|
|
21 |
- **Shared by [optional]:** [More Information Needed]
|
22 |
- **Model type:** [More Information Needed]
|
23 |
+
- **Language(s) (NLP):** EN
|
24 |
+
- **License:** cc-by-nc-4.0
|
25 |
+
- **Finetuned from model:** https://huggingface.co/cerebras/Cerebras-GPT-111m
|
26 |
+
- **Finetuned using:** https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
|
28 |
## Uses
|
29 |
|
30 |
+
This is a simple GPT chatbot that has been finetuned to understand instructions.
|
31 |
+
Its knowledge about facts about the world is should be considered suspect at best.
|
32 |
|
33 |
### Direct Use
|
34 |
|
35 |
+
If you have a use you put it to, Please let me know.
|
36 |
|
37 |
[More Information Needed]
|
38 |
|
|
|
44 |
|
45 |
### Out-of-Scope Use
|
46 |
|
47 |
+
Any form of use where any form of accuracy is needed.
|
48 |
+
FOR THE LOVE OF GOD DO NOT FOLLOW MEDICAL ADVICE FROM THIS.
|
49 |
+
or financial advice.
|
50 |
|
51 |
[More Information Needed]
|
52 |
|
53 |
## Bias, Risks, and Limitations
|
54 |
|
55 |
+
Limitations... Yes, I am sure there are so so many.
|
56 |
|
57 |
[More Information Needed]
|
58 |
|
|
|
|
|
|
|
|
|
|
|
|
|
59 |
## How to Get Started with the Model
|
60 |
|
61 |
Use the code below to get started with the model.
|
|
|
133 |
|
134 |
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
135 |
|
136 |
+
- **Hardware Type:** 8xA100s (accomplished while I was downloading the model I was actually training.)
|
137 |
+
- **Minutes used:** 7.5
|
138 |
+
- **Cloud Provider:** LambdaGPU
|
139 |
+
- **Compute Region:** USA
|
140 |
- **Carbon Emitted:** [More Information Needed]
|
141 |
|
142 |
## Technical Specifications [optional]
|