dictalm-7b / modeling_megatron_gpt.py

Commit History

Updated flash attention usage
95780d0

Shaltiel commited on

Upload 11 files
f4d185c

Shaltiel commited on