Lei Zhang's picture

6 8 4

Lei Zhang

Lemoncoke

·

AI & ML interests

None yet

Organizations

Lemoncoke's activity

upvoted a paper about 1 month ago

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Paper • 2409.18943 • Published Sep 27 • 26

upvoted an article about 1 month ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 22

upvoted 3 papers about 1 month ago

One Shot Learning as Instruction Data Prospector for Large Language Models

Paper • 2312.10302 • Published Dec 16, 2023 • 3

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Paper • 2405.15232 • Published May 24 • 2

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

Paper • 2406.17419 • Published Jun 25 • 16

upvoted 2 papers about 2 months ago

Marathon: A Race Through the Realm of Long Context with Large Language Models

Paper • 2312.09542 • Published Dec 15, 2023 • 1

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 125

upvoted a collection about 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 14 items • Updated Sep 25 • 85