Julien Chaumond's picture

Julien Chaumond PRO

julien-c

·

https://huggingface.co

AI & ML interests

<3 ML/AI for everyone, building products to propel communities fwd

Articles

XetHub is joining Hugging Face!

Hugging Face partners with Wiz Research to Improve AI Security

Introducing Storage Regions on the HF Hub

Hugging Face Selected for the French Data Protection Agency Enhanced Support Program

How to train a new language model from scratch using Transformers and Tokenizers

Organizations

julien-c's activity

upvoted a paper 1 day ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published 8 days ago • 43

upvoted a collection 2 days ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 3 days ago • 276

upvoted an article 2 days ago

Article

Llama can now see and run on your device - welcome Llama 3.2

4 days ago

• 122

upvoted 2 collections 3 days ago

Llama 3.2 3B & 1B GGUF Quants

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated 3 days ago • 33

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 3 days ago • 178

upvoted a paper 3 days ago

Improvements to SDXL in NovelAI Diffusion V3

Paper • 2409.15997 • Published 5 days ago • 9

upvoted a collection 5 days ago

Wonder Tools picks

Notable demo apps for exploring useful ways to capitalize on AI • 12 items • Updated 16 days ago • 9

upvoted 3 papers 5 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 10 days ago • 115

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 9 days ago • 119

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published 9 days ago • 64

upvoted an article 6 days ago

Article

Exploring the Daily Papers Page on Hugging Face

6 days ago

• 18

upvoted a collection 6 days ago

Core ML Segment Anything 2

4 items • Updated 15 days ago • 20

upvoted a collection 10 days ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 10 days ago • 192

upvoted 2 articles 10 days ago

Article

Introducing Community Tools on HuggingChat

13 days ago

• 26

Article

Introducing the SQL Console on Datasets

12 days ago

• 15

upvoted a paper 12 days ago

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Paper • 2409.09506 • Published 14 days ago • 2

upvoted a paper 18 days ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 120

upvoted an article 19 days ago

Article

The Environmental Impacts of AI -- Primer

By

•

25 days ago

• 26

upvoted a paper 20 days ago

ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution

Paper • 2408.15993 • Published Aug 28 • 7

upvoted an article 24 days ago

Article

Hugging Face partners with TruffleHog to Scan for Secrets

25 days ago

• 9

upvoted 5 papers 24 days ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published about 1 month ago • 50

VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters

Paper • 2408.17253 • Published 29 days ago • 35

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published 28 days ago • 38

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published 25 days ago • 76

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Paper • 2409.01437 • Published 26 days ago • 70

upvoted an article 26 days ago

Article

Scaling robotics datasets with video encoding

Aug 27

• 33

upvoted 4 articles about 2 months ago

Article

Introduction to ggml

Aug 13

• 95

Article

Parquet in Action: A Beginners Guide

By

•

Aug 14

• 3

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 98

Article

XetHub is joining Hugging Face!

Aug 8

• 77

upvoted a paper about 2 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3 • 74

upvoted an article about 2 months ago

Article

Gradio joins Hugging Face!

Dec 21, 2021

• 3

upvoted 3 papers about 2 months ago

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

Paper • 2408.00874 • Published Aug 1 • 40

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

Meltemi: The first open Large Language Model for Greek

Paper • 2407.20743 • Published Jul 30 • 67

upvoted 3 collections about 2 months ago

Gemma Scope Release

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Aug 11 • 13

ShieldGemma Release

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 31 • 11

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

upvoted an article about 2 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 58

upvoted a collection 2 months ago

Research projects on top of vLLM

Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29 • 12

upvoted 2 articles 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 206

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 196

upvoted a collection 2 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 3 days ago • 584

upvoted 3 articles 2 months ago

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22

• 54

Article

Querying Datasets with the Datasets Explorer Chrome Extension

By

•

Jul 19

• 6

Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

By

•

Jul 19

• 17

upvoted 5 papers 2 months ago

E-BATCH: Energy-Efficient and High-Throughput RNN Batching

Paper • 2009.10656 • Published Sep 22, 2020 • 1

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17 • 48

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 123

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Paper • 2407.07775 • Published Jul 10 • 3

upvoted an article 3 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 74

upvoted 7 papers 3 months ago

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Paper • 2407.03963 • Published Jul 4 • 15

Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 80

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10 • 52

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 64

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9 • 41

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11 • 47

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11 • 50

upvoted an article 3 months ago

Article

How to run Gemini Nano locally in your browser

By

•

Jul 11

• 42