Qian Liu's picture

Qian Liu PRO

SivilTaram

·

http://siviltaram.github.io/

AI & ML interests

Cooking cool things

Recent Activity

New activity 3 days ago

OpenCoder-LLM/opc-sft-stage1

updated a dataset 3 days ago

OpenCoder-LLM/opc-sft-stage1

updated a dataset 3 days ago

OpenCoder-LLM/opc-sft-stage2

Articles

RegMix: Data Mixture as Regression for Language Model Pre-training

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Efficient Table Pre-training without Real Data: An Introduction to TAPEX

Organizations

SivilTaram's activity

New activity in OpenCoder-LLM/opc-sft-stage1 3 days ago

License

#5 opened 3 days ago by

New activity in OpenCoder-LLM/opc-annealing-corpus 7 days ago

License

#3 opened 7 days ago by

New activity in OpenCoder-LLM/fineweb-code-corpus 10 days ago

Code elements inside web page are badly processed for FineWeb

#2 opened 10 days ago by

commented a paper about 1 month ago

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Paper • 2410.07137 • Published Oct 9 • 7 •

New activity in SivilTaram/starcoder2-documentation about 1 month ago

release plan for the rest of the-stack-v2-train-extras

#2 opened about 1 month ago by

New activity in microsoft/tapex-large-finetuned-wtq 3 months ago

is it possible to support multiple languages, like Chinese?

#5 opened 4 months ago by

New activity in bigcode/the-stack-v2 3 months ago

"Documentation" data?

#8 opened 8 months ago by

Where is the-stack-v2-train-extras?

#17 opened 8 months ago by

question about starcoder 2 jupyter notebook conversion

#29 opened 4 months ago by

commented 3 papers 4 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52 •

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52 •

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52 •

New activity in sail/regmix-data 5 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

New activity in sail/regmix-data-sample 5 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

commented 3 papers 5 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1 • 35 •

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1 • 35 •

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14 • 38 •

New activity in sail/Sailor-14B-Chat 6 months ago

Adding `safetensors` variant of this model

#1 opened 6 months ago by

New activity in sail/Sailor-7B 7 months ago

Any plan to open source the 200 B token dataset?

#2 opened 7 months ago by

New activity in aisingapore/sea-lion-7b-instruct 7 months ago

Consider Reporting Performance of Sailor-Chat

#1 opened 8 months ago by