Edit model card

Coding model comming soon!

Uploaded model

  • Developed by: NotAiLOL
  • License: apache-2.0
  • Finetuned from model : unsloth/Qwen2-0.5B-Instruct-bnb-4bit

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Details

This model was trained on microsoft/orca-math-word-problems-200k for 3 epochs with rsLoRA + QLoRA.

Training Loss Graph image/png

The model follows the Alpaca format:

<|im_start|>system
You are a professional mathematician.|im_end|>

<|im_start|>user
{}<|im_end|>

<|im_start|>assistant
{}
Downloads last month
9
Safetensors
Model size
494M params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for NotASI/Qwen2-0.5B-Math

Finetuned
(8)
this model

Dataset used to train NotASI/Qwen2-0.5B-Math

Collection including NotASI/Qwen2-0.5B-Math