15 175 203

lhl PRO

leonardlin

https://randomfoo.net/

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

openai/webgpt_comparisons

liked a dataset 1 day ago

Dahoas/synthetic-instruct-gptj-pairwise

updated a model 21 days ago

kinokokoro/cyberagent-mistral-nemo-webnovels

Articles

Organizations

leonardlin's activity

upvoted 9 papers 3 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12 • 61

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15 • 10

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Paper • 2408.03822 • Published Aug 7 • 9

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 33

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4 • 17

Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers

Paper • 2408.05506 • Published Aug 10 • 8

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12 • 13

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13 • 30

upvoted a collection 3 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated 3 days ago • 158

upvoted 7 papers 3 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2 • 8

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Paper • 2306.14145 • Published Jun 25, 2023 • 1

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 65

One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech

Paper • 2008.00768 • Published Aug 3, 2020 • 1

Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization

Paper • 2402.01692 • Published Jan 23 • 1

Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion

Paper • 2010.08136 • Published Oct 16, 2020 • 1

Sailor: Open Language Models for South-East Asia

Paper • 2404.03608 • Published Apr 4 • 20

upvoted 3 papers 4 months ago

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23 • 21

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 95

Best Practices and Lessons Learned on Synthetic Data for Language Models

Paper • 2404.07503 • Published Apr 11 • 29

lhl PRO

AI & ML interests

Recent Activity

Articles

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Not Legal Advice on AI Training Data in Japan

Evaling llm-jp-eval (evals are hard)

Organizations

leonardlin's activity