Aymeric Roucher's picture

Aymeric Roucher

m-ric

·

http://aymeric-roucher.github.io

AI & ML interests

MLE at Hugging Face 🤗 LLMs, Agents, RAG, Multimodal.

Recent Activity

liked a Space about 15 hours ago

andrewrreed/phoenix-arize-observability-demo

posted an update about 19 hours ago

Made a new app to visualize the LLM race ⇒ 𝗡𝗼 𝗘𝘂𝗿𝗼𝗽𝗲𝗮𝗻 𝗰𝗼𝗺𝗽𝗮𝗻𝘆 𝗶𝗻 𝘁𝗵𝗲 𝘁𝗼𝗽 𝟭𝟬 🇪🇺❌ See the app here 👉 https://huggingface.co/spaces/m-ric/llm-race-to-the-top I've adapted an app by @andrewrreed that tracks progress of LLMs (https://huggingface.co/spaces/andrewrreed/closed-vs-open-arena-elo), on the Chatbot Arena leaderboard, to compare companies from different countries. The outcome is quite sad, as a Frenchman and European. The top 10 is exclusively US 🇺🇸 and Chinese 🇨🇳 companies (after great Chinese LLM releases recently, like the Qwen2.5 series), with the notable exception of Mistral AI 🇫🇷. American companies are making fast progress, Chinese ones even faster. Europe is at risk of being left behind. And the EU AI Act hasn't even come into force yet to slow down the EU market. We need to wake up 😬 ⚠️ Caution: This Chatbot Arena ELO ranking is not the most accurate, especially at high scores like this, because LLM makers can game it to some extent.

upvoted an article about 24 hours ago

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

View all activity

Articles

Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge

Our Transformers Code Agent beats the GAIA benchmark!

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

License to Call: Introducing Transformers Agents 2.0

Open-source LLMs as LangChain Agents

Organizations

m-ric's activity

upvoted an article about 24 hours ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

•

2 days ago

• 17

upvoted an article 3 days ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

4 days ago

• 72

upvoted an article 4 days ago

Article

Decoding Strategies in Large Language Models

By

•

25 days ago

• 38

upvoted a paper 8 days ago

Watermark Anything with Localized Messages

Paper • 2411.07231 • Published 12 days ago • 19

upvoted 2 papers 11 days ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published 18 days ago • 63

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 16 days ago • 108

upvoted an article 11 days ago

Article

Hugging Face Welcomes the Qwen2.5-Coder Series

By

•

11 days ago

• 6

upvoted a paper 15 days ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published 23 days ago • 48

upvoted a paper 18 days ago

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published 19 days ago • 24

upvoted 2 papers 21 days ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published about 1 month ago • 199

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Paper • 2410.23168 • Published 24 days ago • 22

upvoted a paper 25 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 29 days ago • 79

upvoted a paper 26 days ago

Farmer.Chat: Scaling AI-Powered Agricultural Services for Smallholder Farmers

Paper • 2409.08916 • Published Sep 13 • 2

upvoted 2 papers about 1 month ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17 • 53

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22 • 29

upvoted an article about 1 month ago

Article

A Short Summary of Chinese AI Global Expansion

Oct 3

• 15

upvoted 3 papers about 1 month ago

Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

Paper • 2401.00448 • Published Dec 31, 2023 • 28

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Paper • 2206.10789 • Published Jun 22, 2022 • 4

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17 • 30

upvoted an article about 1 month ago

Article

Welcome, Gradio 5

Oct 9

• 72