Experts Weights Averaging: A New General Training Scheme for Vision Transformers Paper • 2308.06093 • Published Aug 11, 2023 • 2
Weight Averaging Improves Knowledge Distillation under Domain Shift Paper • 2309.11446 • Published Sep 20, 2023 • 1
SWAMP: Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning Paper • 2305.14852 • Published May 24, 2023 • 1
Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging Paper • 2306.16788 • Published Jun 29, 2023 • 1