LLaMA-2-7B-32K / modeling_flash_llama.py

Commit History

Add _support_flash_attn_2 to Llama 2 32k (#37)
061211f
verified

arshzahed commited on

Correct the output dtype of rmsnorm_func (#13)
aef6d89

juewang ag0 commited on

remove torch.jit
4ec6edc

juewang commited on

init
cf6ad2b

juewang commited on