kevin009
/

babyllama-v0.6

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kevin009 commited on Feb 13

Commit

f92b8ee

•

1 Parent(s): 2b45530

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ It uses RLHF and DOP to mimic a playful, human-like, and creative conversational
 BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
-It used RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in each epoch, applied to over half a million conversations in low learrning rate. Further details will be updated when the initial tests are completed.
 ## Technical Specifications
@@ -33,7 +33,7 @@ It used RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in
     RMS Norm Epsilon: 1e-06, 1e-05 later
 ## Use Cases
-This model excels in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
 ```python

 BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
+It involved 5 different epochs, with 200 steps in each epoch, applied to 0.5m conversations in a low learrning rate. Further details will be updated when the initial tests are completed.
 ## Technical Specifications
     RMS Norm Epsilon: 1e-06, 1e-05 later
 ## Use Cases
+This model can be used in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
 ```python