Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-7B-Chat-Int8
like
8
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
8-bit precision
gptq
arxiv:
5 papers
Model card
Files
Files and versions
Community
1
Train
Use this model
7a7e29c
Qwen-7B-Chat-Int8
1 contributor
History:
7 commits
yangapku
Upload 3 files
7a7e29c
12 months ago
assets
Upload 3 files
12 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
LICENSE
6.9 kB
upload model
12 months ago
NOTICE
2.7 kB
upload model
12 months ago
README.md
30.6 kB
update int8 quantization info
12 months ago
cache_autogptq_cuda_256.cpp
8.4 kB
upload model
12 months ago
cache_autogptq_cuda_kernel_256.cu
52 kB
upload model
12 months ago
config.json
1.2 kB
upload model
12 months ago
configuration_qwen.py
2.35 kB
upload model
12 months ago
cpp_kernels.py
1.92 kB
upload model
12 months ago
generation_config.json
249 Bytes
update default generate hyperparams
12 months ago
model-00001-of-00005.safetensors
2.03 GB
LFS
upload model
12 months ago
model-00002-of-00005.safetensors
2.03 GB
LFS
upload model
12 months ago
model-00003-of-00005.safetensors
2.03 GB
LFS
upload model
12 months ago
model-00004-of-00005.safetensors
1.8 GB
LFS
upload model
12 months ago
model-00005-of-00005.safetensors
1.24 GB
LFS
upload model
12 months ago
model.safetensors.index.json
65.7 kB
upload model
12 months ago
modeling_qwen.py
57.6 kB
update modeling_qwen.py
12 months ago
quantize_config.json
214 Bytes
update int8 quantization info
12 months ago
qwen.tiktoken
2.56 MB
upload model
12 months ago
qwen_generation_utils.py
14.6 kB
upload model
12 months ago
tokenization_qwen.py
9.62 kB
upload model
12 months ago
tokenizer_config.json
173 Bytes
upload model
12 months ago