Shears
Collection
Shears Models (Shears: Unstructured Sparsity with Neural Low-rank Adapter Search)
•
14 items
•
Updated
The sparsified MPT-7B with 50% sparsity as a base model in Shears.
@article{munoz2024shears,
title = {Shears: Unstructured Sparsity with Neural Low-rank Adapter Search},
author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
journal={The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-2024)},
year={2024}
}
Thanks to the work Wanda (paper, code), which provides a simple but effective pruning approach.
Apache-2.0