kevin009 commited on
Commit
f92b8ee
1 Parent(s): 2b45530

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -19,7 +19,7 @@ It uses RLHF and DOP to mimic a playful, human-like, and creative conversational
19
 
20
  BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
21
 
22
- It used RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in each epoch, applied to over half a million conversations in low learrning rate. Further details will be updated when the initial tests are completed.
23
 
24
  ## Technical Specifications
25
 
@@ -33,7 +33,7 @@ It used RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in
33
  RMS Norm Epsilon: 1e-06, 1e-05 later
34
 
35
  ## Use Cases
36
- This model excels in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
37
 
38
 
39
  ```python
 
19
 
20
  BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
21
 
22
+ It involved 5 different epochs, with 200 steps in each epoch, applied to 0.5m conversations in a low learrning rate. Further details will be updated when the initial tests are completed.
23
 
24
  ## Technical Specifications
25
 
 
33
  RMS Norm Epsilon: 1e-06, 1e-05 later
34
 
35
  ## Use Cases
36
+ This model can be used in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
37
 
38
 
39
  ```python