deepseek-ai
/

DeepSeek-V2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (1)

This is by far the best model I have seen until now.

#8 opened 5 months ago by

How many tokens per second when using Deepseek-V2(236B) as inference model in 8*A100

#7 opened 6 months ago by

Can DeepSeek-V2 run on two nodes (each with 4 A100)?

#5 opened 6 months ago by

Calculation of _mscale during YARN RoPE scaling

#4 opened 6 months ago by

keyError: 'sdpa'

#3 opened 7 months ago by

Smaller Models

#2 opened 7 months ago by

KV Cache for compress_kv or key-value states

#1 opened 7 months ago by