Unchun Yang's picture

500 1269

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Organizations

ucyang's activity

upvoted a paper 2 days ago

Watermark Anything with Localized Messages

Paper • 2411.07231 • Published 6 days ago • 18

upvoted an article 3 days ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

4 days ago

• 88

upvoted a paper 5 days ago

Large Scale Transfer Learning for Tabular Data via Language Modeling

Paper • 2406.12031 • Published Jun 17 • 9

upvoted a collection 5 days ago

TabuLa-8B

Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 • 4 items • Updated Jun 19 • 10

upvoted a paper 5 days ago

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published Jun 17 • 19

upvoted 4 collections 5 days ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 54

🍷 FineWeb datasets

5 items • Updated Jun 26 • 20

📚 FineWeb-Edu

FineWeb-Edu datasets, classifier and ablation model • 5 items • Updated Jun 12 • 11

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8 • 9

upvoted a paper 5 days ago

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published Oct 11 • 83

upvoted a collection 7 days ago

Switch-Transformers release

This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated Jul 31 • 15

upvoted a paper 12 days ago

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published 13 days ago • 20

upvoted a paper 13 days ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16 • 42

upvoted a collection 13 days ago

OuteTTS

2 items • Updated 13 days ago • 9

upvoted a paper 15 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 50

upvoted a collection 16 days ago

Model Depot

Leading generative models packaged in OpenVino format optimized for use on AI PCs • 50 items • Updated 21 days ago • 5

upvoted 2 collections 17 days ago

Functionary V3.2

Fine-tuning Llama-3.1 using own our prompt template for function calling • 3 items • Updated Oct 16 • 1

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 13 days ago • 168

upvoted a paper 18 days ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published 27 days ago • 42

upvoted a collection 18 days ago

Pangea

A Fully Open Multilingual Multimodal LLM for 39 Languages • 18 items • Updated 16 days ago • 17