audreyt
/

Taiwan-LLaMa-v1.0-GGML

audreyt commited on Aug 12, 2023

Commit

968e14b

•

1 Parent(s): 12181c0

Sync upstream README changes 7342cd5

Files changed (1) hide show

README.md CHANGED Viewed

@@ -42,6 +42,8 @@ They are known to work with:
 <!-- footer end -->
 # Original model card: Yen-Ting Lin's Language Models for Taiwanese Culture v1.0
 <p align="center">
 ✍️ <a href="https://huggingface.co/spaces/yentinglin/Taiwan-LLaMa2" target="_blank">Online Demo</a>
@@ -62,7 +64,7 @@ They are known to work with:
 ## Overview
-Taiwan-LLaMa is a full parameter fine-tuned model based on LLaMa 2 for traditional chinese applications.
 **Taiwan-LLaMa v1.0** pretrained on over 5 billion tokens and instruction-tuned on over 490k conversations both in traditional chinese.
@@ -81,8 +83,8 @@ A live demonstration of the model can be accessed at [Hugging Face Spaces](https
 ## Work in progress
-- [ ] **Improved Pretraining**: A refined version of the existing pretraining approach is under development, aiming to enhance model performance.
-- [ ] **Extended Model Length**: Utilizing the Rope mechanism as described in [the paper](https://arxiv.org/abs/2104.09864), the model's length will be extended from 4k to 8k.
 ## Taiwanese Culture Examples

 <!-- footer end -->
 # Original model card: Yen-Ting Lin's Language Models for Taiwanese Culture v1.0
+# Language Models for Taiwanese Culture
 <p align="center">
 ✍️ <a href="https://huggingface.co/spaces/yentinglin/Taiwan-LLaMa2" target="_blank">Online Demo</a>
 ## Overview
+Taiwan-LLaMa is a full parameter fine-tuned model based on LLaMa 2 for Traditional Chinese applications.
 **Taiwan-LLaMa v1.0** pretrained on over 5 billion tokens and instruction-tuned on over 490k conversations both in traditional chinese.
 ## Work in progress
+- [ ] **Improved pretraining**: A refined pretraining process (e.g. more data from Taiwan, training strategies) is under development, aiming to enhance model performance for better Taiwanese culture.
+- [ ] **Extend max length**: Utilizing the Rope mechanism as described in [the paper](https://arxiv.org/abs/2104.09864), the model's length will be extended from 4k to 8k.
 ## Taiwanese Culture Examples