Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-7B-Chat-Int8
like
8
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
8-bit precision
gptq
arxiv:
5 papers
Model card
Files
Files and versions
Community
1
Train
Use this model
main
Qwen-7B-Chat-Int8
1 contributor
History:
18 commits
yangapku
update wechat
e531de0
10 months ago
assets
update wechat
10 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
LICENSE
6.9 kB
upload model
12 months ago
NOTICE
15.3 kB
update
10 months ago
README.md
30.7 kB
update wechat
10 months ago
cache_autogptq_cuda_256.cpp
8.4 kB
upload model
12 months ago
cache_autogptq_cuda_kernel_256.cu
52 kB
upload model
12 months ago
config.json
1.2 kB
update
10 months ago
configuration_qwen.py
2.35 kB
upload model
12 months ago
cpp_kernels.py
1.92 kB
upload model
12 months ago
generation_config.json
273 Bytes
update
10 months ago
model-00001-of-00005.safetensors
2.03 GB
LFS
upload model
12 months ago
model-00002-of-00005.safetensors
2.03 GB
LFS
upload model
12 months ago
model-00003-of-00005.safetensors
2.03 GB
LFS
upload model
12 months ago
model-00004-of-00005.safetensors
1.8 GB
LFS
upload model
12 months ago
model-00005-of-00005.safetensors
1.24 GB
LFS
upload model
12 months ago
model.safetensors.index.json
65.7 kB
upload model
12 months ago
modeling_qwen.py
55.6 kB
update modeling_qwen.py
10 months ago
quantize_config.json
214 Bytes
update int8 quantization info
12 months ago
qwen.tiktoken
2.56 MB
upload model
12 months ago
qwen_generation_utils.py
14.6 kB
upload model
12 months ago
tokenization_qwen.py
9.62 kB
upload model
12 months ago
tokenizer_config.json
174 Bytes
update
10 months ago