raincandy-u
/

Coder1.8-ORPO-TEST

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

raincandy-u commited on Apr 19

Commit

4538042

•

1 Parent(s): 809e1a0

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -23,13 +23,13 @@ This is a test model and may generate incorrect responses. Use at your own risk.
 ## Train Details
-Base: Qwen1.5-1.8B
-Training Data: ~20k [code examples](https://huggingface.co/datasets/reciprocate/dpo_ultra-capybara-code_filtered-best)
-Epochs: 1
-Method: ORPO
-Hardware: 2 x A40
-Quantization: 4-bit QLora
-Lora Rank/Alpha: 16
 # Limitations

 ## Train Details
+- Base: Qwen1.5-1.8B
+- Training Data: ~20k [code examples](https://huggingface.co/datasets/reciprocate/dpo_ultra-capybara-code_filtered-best)
+- Epochs: 1
+- Method: ORPO
+- Hardware: 2 x A40
+- Quantization: 4-bit QLora
+- Lora Rank/Alpha: 16
 # Limitations