Edit model card

language: en

rawpowertools/MH_250T_L_Qwen2_500M Model Data

Base_Model: unsloth/Qwen2-0.5B

Training_Data: mh_250_train

Eval_Input: mh_small_test

Epochs: 5

Rank: 32

Alpha: 32

LR: 0.0005

LR_Scheduler: linear

ClearML: http://clearml.rptinternal.com:8080/projects/d061c7fcfaa049b69a4ee1ff0ed89be2/experiments/beb81b4f2f9b402b92f65791f6c8917e/output/log

Downloads last month
5
Safetensors
Model size
494M params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .