Uploaded model

Developed by: NotAiLOL
License: apache-2.0
Finetuned from model : unsloth/Qwen2-0.5B-Instruct-bnb-4bit

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Details

This model was trained on microsoft/orca-math-word-problems-200k for 3 epochs with rsLoRA + QLoRA.

Training Loss Graph

The model follows the Alpaca format:

<|im_start|>system
You are a professional mathematician.|im_end|>

<|im_start|>user
{}<|im_end|>

<|im_start|>assistant
{}

Downloads last month: 9

Safetensors

Model size

494M params

Tensor type

FP16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for NotASI/Qwen2-0.5B-Math

Base model

unsloth/Qwen2-0.5B-Instruct-bnb-4bit

Finetuned

(8)

this model

Dataset used to train NotASI/Qwen2-0.5B-Math

Collection including NotASI/Qwen2-0.5B-Math

Math Fine Tune 🔮

Collection

2 items • Updated Oct 10