Feature Selective Anchor-Free Module for Single-Shot Object Detection Paper โข 1903.00621 โข Published Mar 2, 2019
AMC: AutoML for Model Compression and Acceleration on Mobile Devices Paper โข 1802.03494 โข Published Feb 10, 2018
Channel Pruning for Accelerating Very Deep Neural Networks Paper โข 1707.06168 โข Published Jul 19, 2017
Upcycling Large Language Models into Mixture of Experts Paper โข 2410.07524 โข Published Oct 10 โข 3 โข 1
LongVILA: Scaling Long-Context Visual Language Models for Long Videos Paper โข 2408.10188 โข Published Aug 19 โข 51
Upcycling Large Language Models into Mixture of Experts Paper โข 2410.07524 โข Published Oct 10 โข 3
Upcycling Large Language Models into Mixture of Experts Paper โข 2410.07524 โข Published Oct 10 โข 3
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Paper โข 2403.07816 โข Published Mar 12 โข 39