iqwiki-kor
/

Qwen2.5-3B-MP-RM

Text Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

JW17 commited on Oct 7

Commit

41f8e32

•

1 Parent(s): 98eac45

End of training

Files changed (2) hide show

README.md +1 -1
config.json +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 # Qwen2.5-3B-Instruct-E80
-This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) on an unknown dataset.
 ## Model description

 # Qwen2.5-3B-Instruct-E80
+This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) on the iqwiki-kor/mhp-108k dataset.
 ## Model description

config.json CHANGED Viewed

@@ -30,7 +30,7 @@
   "tie_word_embeddings": true,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.45.1",
-  "use_cache": false,
   "use_sliding_window": false,
   "vocab_size": 151666
 }

   "tie_word_embeddings": true,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.45.1",
+  "use_cache": true,
   "use_sliding_window": false,
   "vocab_size": 151666
 }