CheeryLJH (Jiaheng Liu)

upvoted a paper 4 days ago

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published 8 days ago • 45

upvoted a paper 7 days ago

Pixel-Space Post-Training of Latent Diffusion Models

Paper • 2409.17565 • Published 8 days ago • 18

upvoted 2 papers 9 days ago

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published 11 days ago • 24

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published 10 days ago • 40

upvoted a paper 15 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 16 days ago • 120

upvoted a paper 28 days ago

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Paper • 2409.01944 • Published about 1 month ago • 44

upvoted a paper about 1 month ago

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17 • 51

upvoted a paper about 2 months ago

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Paper • 2408.08072 • Published Aug 15 • 31

upvoted a paper 2 months ago

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23 • 20

upvoted an article 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 244

upvoted 5 papers 3 months ago

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25 • 20

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21 • 60

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Paper • 2406.13923 • Published Jun 20 • 21

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Paper • 2406.11817 • Published Jun 17 • 13

upvoted 5 papers 4 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17 • 48

upvoted 5 papers 6 months ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 62

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 103

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9 • 33

MuPT: A Generative Symbolic Music Pretrained Transformer

Paper • 2404.06393 • Published Apr 9 • 14

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Paper • 2404.04167 • Published Apr 5 • 12

upvoted a collection 6 months ago

M-A-P Full Paper List

Collection

25 items • Updated 6 days ago • 4

upvoted a collection 7 months ago

LLM Leaderboard best models ❤️‍🔥

Collection

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 264 items • Updated Jun 22 • 399

upvoted a paper 7 months ago

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22 • 82

upvoted a paper 9 months ago

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Paper • 2401.06951 • Published Jan 13 • 24

Jiaheng Liu

AI & ML interests

Organizations

CheeryLJH's activity