Safetensors
Lithuanian
llama
Lt-Llama-2-7b-hf / README.md
artena's picture
Update README.md
3d8482f verified
metadata
license: llama2
language:
  - lt
datasets:
  - uonlp/CulturaX

Model Card for Model ID

Lt-Llama2 is a family of pretrained and fine-tuned generative text models for Lithuanian. This is the repository for the foundational 7B model. Links to other models can be found at the bottom of this page.

Model Details

Model Description

Neurotechnology company marks the first open-source initiative dedicated to developing a large language model (LLM) specialized in Lithuanian. The company has created and publicly released a collection of Lithuanian LLMs, available both as foundational models and instructional variants.

  • Developed by: Neurotechnology
  • Language(s): Lithuanian
  • License: Llama2 Community License Agreement
  • Continual pretrained from model: Llama-2-7b

Model Sources

Intended Use

Intended Use Cases

Lt-Llama2 is designed for research purposes in Lithuanian. The base models can be tailored for various natural language tasks, while the instruction models are geared towards assistant-like conversational interactions.

Prohibited use

Utilizing the model in ways that breach the license, violate any applicable laws or regulations, or involve languages other than Lithuanian.

How to Get Started with the Model

Use the code below to get started with the model.

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("neurotechnology/Lt-Llama-2-7b-hf")
model = AutoModelForCausalLM.from_pretrained("neurotechnology/Lt-Llama-2-7b-hf")
input_text = "Kartą gyveno senelis ir senelė "
input_ids = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(**input_ids, max_new_tokens=100)
print(tokenizer.decode(outputs[0]))

Benchmarks

Model Average ARC MMLU Winogrande HellaSwag GSM8k TruthfulQA
Llama-2-7b 29.16 27.82 26.98 51.93 28.35 2.5 37.38
Lt-Llama2-7b 32.90 43.18 26.01 53.67 33.17 0.0 41.38

RoLlama2 Model Family

Model Link
Lt-Llama2-7b link
Lt-Llama2-7b-instruct link
Lt-Llama2-13b link
Lt-Llama2-13b-instruct link

Citation

@misc{nakvosas2024openllama2modellithuanian,
      title={Open Llama2 Model for the Lithuanian Language},
      author={Artūras Nakvosas and Povilas Daniušis and Vytas Mulevičius},
      year={2024},
      eprint={2408.12963},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2408.12963},
}