Text Generation
Chinese
audreyt commited on
Commit
968e14b
1 Parent(s): 12181c0

Sync upstream README changes 7342cd5

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -42,6 +42,8 @@ They are known to work with:
42
  <!-- footer end -->
43
 
44
  # Original model card: Yen-Ting Lin's Language Models for Taiwanese Culture v1.0
 
 
45
 
46
  <p align="center">
47
  ✍️ <a href="https://huggingface.co/spaces/yentinglin/Taiwan-LLaMa2" target="_blank">Online Demo</a>
@@ -62,7 +64,7 @@ They are known to work with:
62
 
63
 
64
  ## Overview
65
- Taiwan-LLaMa is a full parameter fine-tuned model based on LLaMa 2 for traditional chinese applications.
66
 
67
  **Taiwan-LLaMa v1.0** pretrained on over 5 billion tokens and instruction-tuned on over 490k conversations both in traditional chinese.
68
 
@@ -81,8 +83,8 @@ A live demonstration of the model can be accessed at [Hugging Face Spaces](https
81
 
82
 
83
  ## Work in progress
84
- - [ ] **Improved Pretraining**: A refined version of the existing pretraining approach is under development, aiming to enhance model performance.
85
- - [ ] **Extended Model Length**: Utilizing the Rope mechanism as described in [the paper](https://arxiv.org/abs/2104.09864), the model's length will be extended from 4k to 8k.
86
 
87
 
88
  ## Taiwanese Culture Examples
 
42
  <!-- footer end -->
43
 
44
  # Original model card: Yen-Ting Lin's Language Models for Taiwanese Culture v1.0
45
+ # Language Models for Taiwanese Culture
46
+
47
 
48
  <p align="center">
49
  ✍️ <a href="https://huggingface.co/spaces/yentinglin/Taiwan-LLaMa2" target="_blank">Online Demo</a>
 
64
 
65
 
66
  ## Overview
67
+ Taiwan-LLaMa is a full parameter fine-tuned model based on LLaMa 2 for Traditional Chinese applications.
68
 
69
  **Taiwan-LLaMa v1.0** pretrained on over 5 billion tokens and instruction-tuned on over 490k conversations both in traditional chinese.
70
 
 
83
 
84
 
85
  ## Work in progress
86
+ - [ ] **Improved pretraining**: A refined pretraining process (e.g. more data from Taiwan, training strategies) is under development, aiming to enhance model performance for better Taiwanese culture.
87
+ - [ ] **Extend max length**: Utilizing the Rope mechanism as described in [the paper](https://arxiv.org/abs/2104.09864), the model's length will be extended from 4k to 8k.
88
 
89
 
90
  ## Taiwanese Culture Examples