Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2305.17190

Papers - Training - Multiplication Free

Multiplication-Free Transformer Training via Piecewise Affine Operations

Paper • 2305.17190 • Published May 26, 2023 • 2
Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4 • 11

Papers - Training - PAM faster vs MatMul - CPU

Multiplication-Free Transformer Training via Piecewise Affine Operations

Paper • 2305.17190 • Published May 26, 2023 • 2

Papers - Training - Piecewise Affine Multiplication

Multiplication-Free Transformer Training via Piecewise Affine Operations

Paper • 2305.17190 • Published May 26, 2023 • 2

Papers - ResNet

Wide Residual Networks

Paper • 1605.07146 • Published May 23, 2016 • 2
Characterizing signal propagation to close the performance gap in unnormalized ResNets

Paper • 2101.08692 • Published Jan 21, 2021 • 2
Pareto-Optimal Quantized ResNet Is Mostly 4-bit

Paper • 2105.03536 • Published May 7, 2021 • 2
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations

Paper • 2106.01548 • Published Jun 3, 2021 • 2

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model

Paper • 2305.15265 • Published May 24, 2023 • 1
Mesa: A Memory-saving Training Framework for Transformers

Paper • 2111.11124 • Published Nov 22, 2021 • 1
Full Parameter Fine-tuning for Large Language Models with Limited Resources

Paper • 2306.09782 • Published Jun 16, 2023 • 29
Layered gradient accumulation and modular pipeline parallelism: fast and efficient training of large language models

Paper • 2106.02679 • Published Jun 4, 2021 • 1

Backpropagation

Sparse Backpropagation for MoE Training

Paper • 2310.00811 • Published Oct 1, 2023 • 2
The Forward-Forward Algorithm: Some Preliminary Investigations

Paper • 2212.13345 • Published Dec 27, 2022 • 2
Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 2
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Paper • 2309.13192 • Published Sep 22, 2023 • 1

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs