Florents-Tselai
/

Meltemi-llamafile

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Florents-Tselai commited on Oct 4

Commit

a599fc6

•

1 Parent(s): f60a82c

Update README.md

Files changed (1) hide show

README.md +2 -10

README.md CHANGED Viewed

@@ -13,21 +13,14 @@ base_model:
 # Meltemi 7B Instruct v1.5 gguf
-This is [Meltemi 7B Instract v1.5](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1.5) published in `gguf`, [llama.cpp](https://github.com/ggerganov/llama.cpp)-compatible format.
-Meltemi is the first Greek Large Language Model (LLM) trained by the [Institute for Language and Speech Processing](https://www.athenarc.gr/en/ilsp) at [Athena Research & Innovation Center](https://www.athenarc.gr/en).
-Meltemi is built on top of [Mistral-7B-Instruct](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1), extending its capabilities for Greek through continual pretraining on a large corpus of high-quality and locally relevant Greek texts.
 # Model Information
 - Vocabulary extension of the Mistral 7b tokenizer with Greek tokens for lower costs and faster inference (**1.52** vs. 6.80 tokens/word for Greek)
 - 8192 context length
-- Fine-tuning has been done with the [Odds Ratio Preference Optimization (ORPO)](https://arxiv.org/abs/2403.07691) algorithm using 97k preference data:
-  * 89,730 Greek preference data which are mostly translated versions of high-quality datasets on Hugging Face
-  * 7,342 English preference data
-- Our alignment procedure is based on the [TRL - Transformer Reinforcement Learning](https://huggingface.co/docs/trl/index) library and partially on the [Hugging Face finetuning recipes](https://github.com/huggingface/alignment-handbook)
 # Instruction format
@@ -52,4 +45,3 @@ llama-server -m ./Meltemi-7B-Instruct-v1.5-F16.gguf --port 8080
 ```
-For more details please refer to the original model https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1.5

 # Meltemi 7B Instruct v1.5 gguf
+This is [Meltemi 7B Instract v1.5](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1.5), the first Greek Large Language Model (LLM) published in the `gguf`, [llama.cpp](https://github.com/ggerganov/llama.cpp)-compatible format.
 # Model Information
 - Vocabulary extension of the Mistral 7b tokenizer with Greek tokens for lower costs and faster inference (**1.52** vs. 6.80 tokens/word for Greek)
 - 8192 context length
+For more details, please refer to the original model card [Meltemi 7B Instract v1.5](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1.5)
 # Instruction format
 ```