Qwen-7B-Chat-Int8 / modeling_qwen.py

Commit History

remove fix-sized causal mask
dcef457

yangapku commited on

add kernel file check in modeling_qwen.py
c94803d

yangapku commited on

update modeling.py
24ac14a

yangapku commited on

update modeling_qwen.py
502a463

yangapku commited on

update batch inference
1241954

yangapku commited on

upload model
ce1512e

yangapku commited on