Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DeepInfra
/
Llama-2-70b-chat-hf-trt-fp8
like
0
Follow
Deep Infra Inc.
5
License:
llama2
Model card
Files
Files and versions
Community
main
Llama-2-70b-chat-hf-trt-fp8
2 contributors
History:
14 commits
Pernekhan
Set legacy to True after initialization
95d9654
12 months ago
ensemble
update models for newer trt 0.6.1 version
12 months ago
postprocessing
bugfix
12 months ago
preprocessing
Set legacy to True after initialization
12 months ago
tensorrt_llm
update models for newer trt 0.6.1 version
12 months ago
.gitattributes
Safe
1.91 kB
add trtllm weights
12 months ago
.gitignore
Safe
6 Bytes
add smaller files
12 months ago
README.md
Safe
24 Bytes
initial commit
12 months ago