27 20 169

Kaizhao Liang

kz919

https://kyleliang919.github.io/

AI & ML interests

Multimodal foundational model

Recent Activity

liked a model about 8 hours ago

deepseek-ai/DeepSeek-V2-Lite

liked a model 1 day ago

deepseek-ai/DeepSeek-V2.5

Organizations

kz919's activity

upvoted 2 papers 3 months ago

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4 • 89

Memory-Efficient LLM Training with Online Subspace Descent

Paper • 2408.12857 • Published Aug 23 • 12

upvoted an article 4 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 166

upvoted a paper 4 months ago

Longhorn: State Space Models are Amortized Online Learners

Paper • 2407.14207 • Published Jul 19 • 17

upvoted a paper 5 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 84

upvoted an article 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted 3 papers 6 months ago

The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

Paper • 2402.04347 • Published Feb 6 • 13

Towards Modular LLMs by Building and Reusing a Library of LoRAs

Paper • 2405.11157 • Published May 18 • 26

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Paper • 2405.07518 • Published May 13 • 24

upvoted a paper 7 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 254

upvoted 4 papers 9 months ago

Efficiently Adapting Pretrained Language Models To New Languages

Paper • 2311.05741 • Published Nov 9, 2023 • 11

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

Training-Free Long-Context Scaling of Large Language Models

Paper • 2402.17463 • Published Feb 27 • 19

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27 • 189

upvoted a paper 10 months ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 138

upvoted 2 papers about 1 year ago

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Paper • 2306.02858 • Published Jun 5, 2023 • 18

PIE: Simulating Disease Progression via Progressive Image Editing

Paper • 2309.11745 • Published Sep 21, 2023 • 3

upvoted 3 papers over 1 year ago