Gabriel Martín Blázquez's picture

Gabriel Martín Blázquez

gabrielmbmb

·

https://gabrielmb.com

AI & ML interests

ML Engineer

Recent Activity

upvoted a paper 6 days ago

Thinking LLMs: General Instruction Following with Thought Generation

liked a dataset 8 days ago

microsoft/orca-agentinstruct-1M-v1

liked a model 8 days ago

numind/NuExtract-1.5-smol

View all activity

Articles

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Organizations

gabrielmbmb's activity

upvoted a paper 6 days ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14 • 16

upvoted a paper 9 days ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 11 days ago • 59

upvoted a collection 13 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 5 days ago • 229

upvoted a collection 23 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 2 days ago • 174

upvoted an article 24 days ago

Article

Code a simple RAG from scratch

By

•

25 days ago

• 8

upvoted a collection about 1 month ago

🍓 Ichigo v0.3

The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated 12 days ago • 17

upvoted 6 articles about 1 month ago

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

Oct 22

• 41

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

By

•

Oct 21

• 18

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17

• 55

Article

Fixing Gradient Accumulation

Oct 16

• 41

Article

How to optimize your data labelling project with custom interfaces

By

•

Oct 16

• 18

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14

• 55

upvoted a paper about 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 166

upvoted an article about 2 months ago

Article

Improving Parquet Dedupe on Hugging Face Hub

Oct 5

• 31

upvoted a paper about 2 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3 • 52

upvoted an article about 2 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

Sep 27

• 35

upvoted a paper about 2 months ago

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19 • 21

upvoted a collection about 2 months ago

Useful Spaces

20 items • Updated 7 days ago • 5

upvoted an article about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 169

upvoted an article 2 months ago

Article

Exploring the Daily Papers Page on Hugging Face

Sep 23

• 39