Qwen
/

Qwen-7B-Chat-Int8

Text Generation

8-bit precision

Model card Files Files and versions Community

Qwen-7B-Chat-Int8 / modeling_qwen.py

Commit History

remove fix-sized causal mask

dcef457

yangapku commited on Nov 14, 2023

add kernel file check in modeling_qwen.py

c94803d

yangapku commited on Nov 5, 2023

update modeling.py

24ac14a

yangapku commited on Oct 26, 2023

update modeling_qwen.py

502a463

yangapku commited on Oct 16, 2023

update batch inference

1241954

yangapku commited on Oct 14, 2023

upload model

ce1512e

yangapku commited on Oct 11, 2023