deepseek-ai
/

DeepSeek-V2-Chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (1)

NAN issue using FP16 to load the model

#15 opened 11 days ago by

ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn. Run `pip install flash_attn`

#14 opened 4 months ago by

How much memory is needed if you make the 128k context length

#13 opened 5 months ago by

Implement MLA inference optimizations to DeepseekV2Attention

#12 opened 5 months ago by

Can you provide a sample code for training with DeepSpeed ZeRO3?

#10 opened 6 months ago by

Ollama support

#9 opened 6 months ago by

MoE offloading strategy？

#8 opened 6 months ago by

Update README.md

#7 opened 6 months ago by

VanishingPsychopath

kv cache

#6 opened 6 months ago by

function/tool calling support

#5 opened 6 months ago by

fail to run the example

#4 opened 6 months ago by

GPTQ plz

#3 opened 6 months ago by

Parkerlambert123

vllm support

#2 opened 6 months ago by

llama.cpp support

#1 opened 6 months ago by