tokyotech-llm
/

Swallow-MX-8x7b-NVE-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

okoge commited on Mar 9

Commit

a849304

•

1 Parent(s): 733b01c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ Our Swallow-MX-8x7b-NVE-v0.1 model has undergone continuous pre-training from th
 ## Model Details
-* **Model type**: Please refer to Mixtral technical report for details on the model architecture.
 * **Language(s)**: Japanese English
 * **Tokenizer**: This model utilizes the same tokenizer as employed by Mixtral-8x7B-Instruct-v0.1.
 * **Contact**: swallow[at]nlp.c.titech.ac.jp

 ## Model Details
+* **Model type**: Please refer to [Mixtral technical report](https://arxiv.org/abs/2401.04088) for details on the model architecture.
 * **Language(s)**: Japanese English
 * **Tokenizer**: This model utilizes the same tokenizer as employed by Mixtral-8x7B-Instruct-v0.1.
 * **Contact**: swallow[at]nlp.c.titech.ac.jp