dahara1
/

imatrix-jpn-test

GGUF

Inference Endpoints

imatrix

conversational

Model card Files Files and versions Community

dahara1 commited on Sep 23

Commit

1cb00ac

•

1 Parent(s): e18c4c4

Update README.md

Browse files

Files changed (1) hide show

README.md +9 -4

README.md CHANGED Viewed

@@ -78,7 +78,7 @@ Example:
 - Please note that the imatrix-jpn-test model uses 5 times as much text for the imatrix as the bartowski model. There is a possibility that the performance may be slightly increased simply because there is more text.
 - In reality, it is better to measure performance with real tasks rather than perplexity. However, there are many different benchmarks for real tasks, so I will leave it up to you to verify this.
-- モデルによってこの結果は異なってくる可能性があります。あらゆるモデルに通用する結果とはまだ思わない方がよいです。特にgemmaはL/f16クォンツで性能が向上すると言われています
 - ほぼ同等の条件でも微妙にスコアが増減する事があります。わずかな差に注目するのではなく傾向に注目する事が望ましいです
 - imatrix-jpn-testモデルはbartowskiモデルに比べてimatrixに5倍のテキストを使用している事に留意してください。単純にテキストが多いため性能が微妙に増えている可能性があります
 - 本来はperplexityではなく実タスクで性能を測定する事が望ましいです。しかし、実タスクのベンチマークも多様なのでその検証は皆さんにお任せします
@@ -104,8 +104,11 @@ The following information may be helpful in your further exploration.
 ### 謝辞 Acknowledgements
 Thanks to the llama.cpp community.
 llama.cppのコミュニティの皆さんに感謝します。
 Thanks to u/noneabove1182 for the advice and motivation.
 アドバイスとモチベーションをくれたu/noneabove1182に感謝します
@@ -114,15 +117,17 @@ I do not know all the inventors of each method, so please point out any that I h
 - **Developed by:** [dahara1@webbigdata]
 - **Language(s) (NLP):** [English, Japanese]
-- **Finetuned from model [optional]:** [gemma-2-9b-it]
 **BibTeX:**
 @misc{dahara2024imatrix,
-  author       = {Dahara1},
   title        = {IMatrix JPN Test: A Multilingual Model for Improved Performance},
   year         = {2024},
   howpublished = {\url{https://huggingface.co/dahara1/imatrix-jpn-test}},
   note         = {Accessed: 2024-09-23},
   abstract     = {This model demonstrates the effectiveness of using a multilingual imatrix for model quantization, especially for improving performance in Japanese and other non-English languages.},
-}

 - Please note that the imatrix-jpn-test model uses 5 times as much text for the imatrix as the bartowski model. There is a possibility that the performance may be slightly increased simply because there is more text.
 - In reality, it is better to measure performance with real tasks rather than perplexity. However, there are many different benchmarks for real tasks, so I will leave it up to you to verify this.
+- モデルによってこの結果は異なってくる可能性があります。あらゆるモデルに通用する結果とはまだ思わない方がよいです。特にgemmaはL/fp16クォンツで性能が向上すると言われています
 - ほぼ同等の条件でも微妙にスコアが増減する事があります。わずかな差に注目するのではなく傾向に注目する事が望ましいです
 - imatrix-jpn-testモデルはbartowskiモデルに比べてimatrixに5倍のテキストを使用している事に留意してください。単純にテキストが多いため性能が微妙に増えている可能性があります
 - 本来はperplexityではなく実タスクで性能を測定する事が望ましいです。しかし、実タスクのベンチマークも多様なのでその検証は皆さんにお任せします
 ### 謝辞 Acknowledgements
 Thanks to the llama.cpp community.
 llama.cppのコミュニティの皆さんに感謝します。
+Thanks to the Google Gemma-2.
+google gemma-2に感謝します
 Thanks to u/noneabove1182 for the advice and motivation.
 アドバイスとモチベーションをくれたu/noneabove1182に感謝します
 - **Developed by:** [dahara1@webbigdata]
 - **Language(s) (NLP):** [English, Japanese]
+- **base model [optional]:** [gemma-2-9b-it]
 **BibTeX:**
+```
 @misc{dahara2024imatrix,
+  author       = {dahara1@webbigdata},
   title        = {IMatrix JPN Test: A Multilingual Model for Improved Performance},
   year         = {2024},
   howpublished = {\url{https://huggingface.co/dahara1/imatrix-jpn-test}},
   note         = {Accessed: 2024-09-23},
   abstract     = {This model demonstrates the effectiveness of using a multilingual imatrix for model quantization, especially for improving performance in Japanese and other non-English languages.},
+}
+```