YaxinLuo commited on
Commit
70def18
1 Parent(s): fddf168

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ ---
6
+ # Model Card for Model ID
7
+
8
+ $\gamma$-MOD is a novel approach to enhance computational efficiency in Multimodal Large Language Models (MLLMs) by incorporating Mixture-of-Depth (MoD) layers. This plug-and-play strategy seamlessly replaces redundant dense layers, significantly reducing computational costs while maintaining performance.
9
+ ## Model Details
10
+ ### Model Description
11
+
12
+ $\gamma$-MOD introduces a new paradigm that focuses on reducing activated tokens, offering superior efficiency compared to existing methods. The approach is inspired by the concept of activated tokens and aims to transform dense MLLM layers into sparse MoD layers, ultimately making MLLMs more accessible and applicable in resource-constrained environments.
13
+ Key features include:
14
+ 1. ARank Metric: Guides replacing redundant layers with MoD layers.
15
+ 2. Shared Vision-Language Router: Facilitates cross-modality token routing.
16
+ 3. Masked Routing Learning: Prevents critical tokens from being skipped during model adaptation.
17
+
18
+ - **Developed by:** Yaxin Luo
19
+ - **License:** MIT License
20
+ - **Finetuned from model :** Vicuna-v1.5-7B
21
+
22
+ ### Model Sources
23
+
24
+ - **Repository:** https://github.com/Yaxin9Luo/Gamma-MOD
25
+ - **Paper:** https://arxiv.org/abs/2410.13859
26
+ - **Demo:** https://yaxin9luo.github.io/gamma-mod-webpage/
27
+
28
+ ## Citation
29
+
30
+ **BibTeX:**
31
+ ```
32
+ @misc{luo2024gammamodexploringmixtureofdepthadaptation,
33
+ title={$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models},
34
+ author={Yaxin Luo and Gen Luo and Jiayi Ji and Yiyi Zhou and Xiaoshuai Sun and Zhiqiang Shen and Rongrong Ji},
35
+ year={2024},
36
+ eprint={2410.13859},
37
+ archivePrefix={arXiv},
38
+ primaryClass={cs.CV},
39
+ url={https://arxiv.org/abs/2410.13859},
40
+ }
41
+ ```