metadata

language:
  - en
license: other
tags:
  - facebook
  - nvidia
  - meta
  - pytorch
  - llama
  - llama-3
  - mlx
pipeline_tag: text-generation
license_name: llama3
license_link: LICENSE

mlx-community/Llama3-ChatQA-1.5-8B-4bit

This model was converted to MLX format from mlx-community/Llama3-ChatQA-1.5-8B using mlx-lm version 0.12.0.

Model added by Prince Canuma.

Refer to the original model card for more details on the model.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Llama3-ChatQA-1.5-8B-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)