Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ It uses RLHF and DOP to mimic a playful, human-like, and creative conversational
|
|
19 |
|
20 |
BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
|
21 |
|
22 |
-
It
|
23 |
|
24 |
## Technical Specifications
|
25 |
|
@@ -33,7 +33,7 @@ It used RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in
|
|
33 |
RMS Norm Epsilon: 1e-06, 1e-05 later
|
34 |
|
35 |
## Use Cases
|
36 |
-
This model
|
37 |
|
38 |
|
39 |
```python
|
|
|
19 |
|
20 |
BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
|
21 |
|
22 |
+
It involved 5 different epochs, with 200 steps in each epoch, applied to 0.5m conversations in a low learrning rate. Further details will be updated when the initial tests are completed.
|
23 |
|
24 |
## Technical Specifications
|
25 |
|
|
|
33 |
RMS Norm Epsilon: 1e-06, 1e-05 later
|
34 |
|
35 |
## Use Cases
|
36 |
+
This model can be used in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
|
37 |
|
38 |
|
39 |
```python
|