2 6

Haocheng Xi

Xihc20

AI & ML interests

None yet

Recent Activity

upvoted an article 11 days ago

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

upvoted a paper 12 days ago

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

commented a paper 12 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

View all activity

Organizations

Xihc20's activity

upvoted an article 11 days ago

Article

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

•

Nov 30, 2023

• 23

upvoted a paper 12 days ago

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Paper • 2407.14505 • Published Jul 19 • 25

commented a paper 12 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published 30 days ago • 18 •

upvoted a paper 26 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published 30 days ago • 18

commented a paper 26 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published 30 days ago • 18 •

upvoted a paper about 1 month ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17 • 53

upvoted a paper about 2 months ago

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Paper • 2410.02367 • Published Oct 3 • 47

updated a dataset 4 months ago

Xihc20/OpenWebTextNanoGPT

Updated Jul 18 • 5

updated 2 models 9 months ago

Xihc20/zephyr-7b-dpo-full

Text Generation • Updated Feb 18 • 6

Xihc20/zephyr-7b-sft-full

Text Generation • Updated Feb 17 • 2

authored a paper over 1 year ago

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22

upvoted a paper over 1 year ago

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22