Qian Liu's picture

Qian Liu PRO

SivilTaram

·

http://siviltaram.github.io/

AI & ML interests

Cooking cool things

Recent Activity

New activity 3 days ago

OpenCoder-LLM/opc-sft-stage1

updated a dataset 3 days ago

OpenCoder-LLM/opc-sft-stage1

updated a dataset 3 days ago

OpenCoder-LLM/opc-sft-stage2

Articles

RegMix: Data Mixture as Regression for Language Model Pre-training

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Efficient Table Pre-training without Real Data: An Introduction to TAPEX

Organizations

SivilTaram's activity

upvoted a paper 5 days ago

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15 • 6

upvoted a paper 9 days ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published 10 days ago • 42

upvoted a collection 9 days ago

OpenCoder Model

OpenCoder Models • 9 items • Updated 2 days ago • 9

upvoted 2 collections 12 days ago

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated 6 days ago • 36

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 9 items • Updated 3 days ago • 70

upvoted a paper 13 days ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 14 days ago • 108

upvoted a paper 14 days ago

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published 17 days ago • 32

upvoted a collection 24 days ago

🫐 ProX Projects

Collection for: "Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale" • 18 items • Updated 29 days ago • 2

upvoted a paper about 1 month ago

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Paper • 2410.07137 • Published Oct 9 • 7

upvoted a collection about 2 months ago

ProX Dataset

a collection of pre-training corpora refined by ProX • 5 items • Updated Oct 18 • 5

upvoted a paper about 2 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

upvoted a paper 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 136

upvoted a collection 2 months ago

MagpieLM

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated Sep 22 • 15

upvoted an article 3 months ago

Article

Meet Yi-Coder: A Small but Mighty LLM for Code

By

•

Sep 4

• 12

upvoted a paper 3 months ago

WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild

Paper • 2409.03753 • Published Sep 5 • 18

upvoted a collection 3 months ago

OLMoE

Artifacts for open mixture-of-experts language models. • 13 items • Updated 7 days ago • 25

upvoted 4 papers 3 months ago

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published Aug 31 • 38

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19 • 51