Model Summary

GritLM is a generative representational instruction tuned language model. It unifies text representation (embedding) and text generation into a single model achieving state-of-the-art performance on both types of tasks.

Repository: ContextualAI/gritlm
Paper: https://arxiv.org/abs/2402.09906
Logs: https://wandb.ai/muennighoff/gritlm/runs/id130s1m/overview
Script: https://github.com/ContextualAI/gritlm/blob/main/scripts/training/train_gritlm_8x7b.sh

Model	Description
GritLM 7B	Mistral 7B finetuned using GRIT
GritLM 8x7B	Mixtral 8x7B finetuned using GRIT

Use

The model usage is documented here.

Citation

@misc{muennighoff2024generative,
      title={Generative Representational Instruction Tuning}, 
      author={Niklas Muennighoff and Hongjin Su and Liang Wang and Nan Yang and Furu Wei and Tao Yu and Amanpreet Singh and Douwe Kiela},
      year={2024},
      eprint={2402.09906},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Downloads last month: 3,764

Safetensors

Model size

46.7B params

Tensor type

BF16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for GritLM/GritLM-8x7B

Quantizations

2 models

Dataset used to train GritLM/GritLM-8x7B

Space using GritLM/GritLM-8x7B 1

Collection including GritLM/GritLM-8x7B

GritLM

Collection

Generative Representational Instruction Tuning (GRIT) • 64 items • Updated Apr 17 • 7

Evaluation results

accuracy on MTEB AmazonCounterfactualClassification (en)
test set self-reported

80.478
ap on MTEB AmazonCounterfactualClassification (en)
test set self-reported

44.388
f1 on MTEB AmazonCounterfactualClassification (en)
test set self-reported

74.336
accuracy on MTEB AmazonPolarityClassification
test set self-reported

96.322
ap on MTEB AmazonPolarityClassification
test set self-reported

94.803
f1 on MTEB AmazonPolarityClassification
test set self-reported

96.321
accuracy on MTEB AmazonReviewsClassification (en)
test set self-reported

57.184
f1 on MTEB AmazonReviewsClassification (en)
test set self-reported

55.945
map_at_1 on MTEB ArguAna
test set self-reported

34.353
map_at_10 on MTEB ArguAna
test set self-reported

50.773

View on Papers With Code