Chuanming Liu's picture

Chuanming Liu

Chuanming

·

Chuanming

AI & ML interests

Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed

Organizations

Chuanming's activity

upvoted a collection 4 months ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 43 items • Updated 4 days ago • 53

upvoted a paper 4 months ago

RewardBench: Evaluating Reward Models for Language Modeling

Paper • 2403.13787 • Published Mar 20 • 19

upvoted 2 articles 4 months ago

Article

Putting RL back in RLHF

Jun 12

• 60

Article

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

Jun 5

• 17

upvoted 3 collections 4 months ago

YOLOv10

This collection hosts the YOLOv10 model releases • 16 items • Updated Jun 3 • 16

Yi-1.5 (2024/05)

10 items • Updated May 20 • 89

SimPO

This collections contains a list of SimPO and baseline models. • 49 items • Updated 26 days ago • 13

upvoted 2 collections 5 months ago

LLM architecture

90 items • Updated Jul 18 • 6

OpenELM Pretrained Models

4 items • Updated Jun 19 • 46

upvoted 2 articles 5 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22

• 78

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

upvoted 3 collections 6 months ago

LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 264 items • Updated Jun 22 • 398

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17 • 56

MoEs papers reading list

57 items • Updated 8 days ago • 133

upvoted a paper 7 months ago

NExT-GPT: Any-to-Any Multimodal LLM

Paper • 2309.05519 • Published Sep 11, 2023 • 78

upvoted a collection 7 months ago

occiglot-eu5-7b-v0.1

First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1. • 10 items • Updated Mar 7 • 21

upvoted a paper 8 months ago

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

Paper • 2309.14509 • Published Sep 25, 2023 • 17

upvoted 2 collections 8 months ago

Sora参考论文

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。 • 32 items • Updated Feb 18 • 53

MoE-LLaVA Model

9 items • Updated Feb 2 • 8

upvoted a paper 8 months ago

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31 • 21

upvoted a collection 8 months ago

AIM

AIM: Autoregressive Image Models • 5 items • Updated Jun 19 • 48

upvoted 2 collections 9 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 212

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 64

upvoted a collection 10 months ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 112