Update README.md (#17)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -221,7 +221,7 @@ Developers should apply responsible AI best practices and are responsible for en
 * Architecture: Phi-3 Mini has 3.8B parameters and is a dense decoder-only Transformer model. The model is fine-tuned with Supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) to ensure alignment with human preferences and safety guidlines.
 * Inputs: Text. It is best suited for prompts using chat format.
-* Context length: 128K tokens
 * GPUS: 512 H100-80G
 * Training time: 7 days
 * Training data: 3.3T tokens

 * Architecture: Phi-3 Mini has 3.8B parameters and is a dense decoder-only Transformer model. The model is fine-tuned with Supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) to ensure alignment with human preferences and safety guidlines.
 * Inputs: Text. It is best suited for prompts using chat format.
+* Context length: 4K tokens
 * GPUS: 512 H100-80G
 * Training time: 7 days
 * Training data: 3.3T tokens