Yixin Song's picture

Yixin Song

yixinsong

·

AI & ML interests

None yet

Recent Activity

liked a dataset 26 days ago

BAAI/Infinity-MM

liked a dataset 28 days ago

dyyyyyyyy/ScaleQuest-Math

liked a dataset 30 days ago

Organizations

yixinsong's activity

New activity in PowerInfer/TurboSparse-Mistral-Instruct 2 months ago

problems about sample strategies

#1 opened 2 months ago by

New activity in yixinsong/persona 3 months ago

[bot] Conversion to Parquet

#1 opened 3 months ago by

parquet-converter

New activity in BAAI/Infinity-Instruct 4 months ago

0729聊天数据集有计划开源吗？

#16 opened 4 months ago by

New activity in HuggingFaceTB/SmolLM-1.7B 4 months ago

MMLU doesn't match on lm-evaluation-harness

#2 opened 4 months ago by

New activity in SparseLLM/relu2-5B 5 months ago

Inference API not working properly. Lack of proper modeling file?

#1 opened 5 months ago by

New activity in SparseLLM/relu-5B 5 months ago

Difference between SparseLLM/relu and SparseLLM/reglu - lack of modeling file?

#1 opened 5 months ago by

commented 3 papers 5 months ago

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10 • 36 •

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10 • 22 •

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10 • 36 •

New activity in migtissera/Tess-v2.5-Qwen2-72B 5 months ago

Nice work! Do we have plan for opening source the datasets?

#1 opened 5 months ago by

New activity in TIGER-Lab/MMLU-Pro 6 months ago

Script for evaluation?

#7 opened 6 months ago by

New activity in Vezora/Mistral-22B-v0.1 6 months ago

Any update about the merge method?

#8 opened 6 months ago by

commented a paper 8 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 603 •

New activity in PowerInfer/Bamboo-DPO-v0.1-gguf 8 months ago

DPO vs non-DPO

#1 opened 8 months ago by

lazyDataScientist

New activity in TencentARC/Mistral_Pro_8B_v0.1 9 months ago

Does this model just trained on cosmopedia?

#1 opened 9 months ago by

New activity in Crystalcareai/Qwen1.5-8x7b 9 months ago

Evaluation result?

#1 opened 9 months ago by

New activity in nampdn-ai/tiny-webtext 10 months ago

TinyWeb usage?

#2 opened 10 months ago by

New activity in Mihaiii/Pallas-0.5-LASER-0.6 11 months ago

Memory reduction

#2 opened 11 months ago by

New activity in cerebras/SlimPajama-627B 11 months ago

How to just download one chunk?

#7 opened about 1 year ago by

New activity in cerebras/SlimPajama-627B about 1 year ago

How to just download one chunk?

#7 opened about 1 year ago by