Qwen
/

Qwen2.5-Coder-1.5B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Update README.md

#4

by cyente - opened Sep 20

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (
 **This repo contains the 1.5B Qwen2.5-Coder model**, which has the following features:
 - Type: Causal Language Models
-- Training Stage: Pretraining & Post-training
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings
 - Number of Parameters: 1.54B
 - Number of Paramaters (Non-Embedding): 1.31B

 **This repo contains the 1.5B Qwen2.5-Coder model**, which has the following features:
 - Type: Causal Language Models
+- Training Stage: Pretraining
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings
 - Number of Parameters: 1.54B
 - Number of Paramaters (Non-Embedding): 1.31B