Yongchang Hao's picture

7 2 1

Yongchang Hao

yongchanghao

·

https://yongchanghao.github.io

AI & ML interests

None yet

Recent Activity

Reacted to their post with 🔥 24 days ago

We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below). Read more about the work at https://huggingface.co/papers/2410.20650

authored a paper 25 days ago

Teacher Forcing Recovers Reward Functions for Text Generation

posted an update 27 days ago

We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below). Read more about the work at https://huggingface.co/papers/2410.20650

View all activity

Organizations

yongchanghao's activity

Reacted to their post with 🔥 24 days ago

Post

3733

We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below).

Read more about the work at NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks (2410.20650)

authored a paper 25 days ago

Teacher Forcing Recovers Reward Functions for Text Generation

Paper • 2210.08708 • Published Oct 17, 2022

posted an update 27 days ago

Post

3733

We just released a paper (NeuZip) that compresses VRAM in a lossless manner to run larger models. This should be particularly useful when VRAM is insufficient during training/inference. Specifically, we look inside each floating number and find that the exponents are highly compressible (as shown in the figure below).

Read more about the work at NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks (2410.20650)

upvoted a paper 27 days ago

Flora: Low-Rank Adapters Are Secretly Gradient Compressors

Paper • 2402.03293 • Published Feb 5 • 6

commented a paper 27 days ago

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published Oct 28 • 16 •

upvoted a paper 27 days ago

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published Oct 28 • 16

New activity in EleutherAI/coqa about 1 month ago

Raw JSON files from the paper's official host

#2 opened about 1 month ago by

updated a dataset about 1 month ago

EleutherAI/coqa

Updated Oct 3 • 3.7k • 2

New activity in THUDM/LongBench 2 months ago

Convert dataset to Parquet

#5 opened 2 months ago by

Convert dataset to Parquet

#4 opened 2 months ago by

Convert dataset to Parquet

#3 opened 2 months ago by

New activity in Anthropic/hh-rlhf over 1 year ago

Identical 'chosen' and 'rejected'

#7 opened over 1 year ago by

liked a dataset almost 2 years ago

stanfordnlp/SHP

Viewer • Updated Oct 10, 2023 • 386k • 1.66k • 295

New activity in google/bigbench almost 2 years ago

Missing tasks

#5 opened almost 2 years ago by