Manuel Romero's picture

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

liked a dataset about 23 hours ago

HuggingFaceTB/smoltalk

upvoted a paper 1 day ago

Hymba: A Hybrid-head Architecture for Small Language Models

View all activity

Organizations

mrm8488's activity

upvoted a paper 1 day ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published 3 days ago • 23

upvoted 3 articles about 1 month ago

Article

Allegro: Advanced Video Generation Model

By

•

Oct 22

• 55

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

Oct 21

• 27

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14

• 55

upvoted an article about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 169

upvoted 2 collections about 2 months ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 20 items • Updated 2 days ago • 40

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 9 days ago • 273

upvoted an article 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 203

upvoted 2 collections 3 months ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4 • 11

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5 • 20

upvoted an article 3 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 28

upvoted a collection 3 months ago

embeddings-spanish-models 🎯

A collection with embeddings models I fine-tuned for better performance in Spanish texts. • 3 items • Updated Aug 30 • 2

upvoted 4 articles 3 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 73

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 85

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 245

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 19

upvoted a collection 3 months ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 46

upvoted 2 articles 4 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 59

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22

• 55

upvoted a collection 4 months ago

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22 • 41