frankminors123
/

Chinese-CodeLlama-7B-PT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

frankminors123 commited on Nov 8, 2023

Commit

b5a333d

•

1 Parent(s): 0671c1f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,6 +12,6 @@ tags:
 The training data contains approximately 400 million tokens which from high-quality code dataset on HuggingFace.
-In addition, we applied `memory_efficient_attention` to the pre-training, which saves us a lot of GPU memory space.
 Our model can be used for SFT, and we hope to contribute more valuable work in the Chinese field.

 The training data contains approximately 400 million tokens which from high-quality code dataset on HuggingFace.
+In addition, we applied `memory_efficient_attention` to the pre-training, which saves us a lot of GPU memory space. If you want to quickly use this technology in your LLaMA model, you can refer to my GitHub: https://github.com/FrankMinions/memory_efficient_adapter.
 Our model can be used for SFT, and we hope to contribute more valuable work in the Chinese field.