Razvan's picture

Razvan

razvanab

·

AI & ML interests

None yet

Recent Activity

liked a Space about 15 hours ago

yeq6x/Image2Body_gradio

liked a Space about 15 hours ago

Djrango/qwen2vl-flux-mini-demo

liked a model about 15 hours ago

showlab/ShowUI-2B

View all activity

Organizations

razvanab's activity

upvoted a collection 28 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated 1 day ago • 97

upvoted a collection 2 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated about 11 hours ago • 275

upvoted a paper 2 months ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20 • 67

upvoted an article 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 273

upvoted 2 collections 2 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 201

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated about 7 hours ago • 392

upvoted a collection 3 months ago

xLAM models

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 29 days ago • 43

upvoted a collection 4 months ago

Neo-Models

Neo • 9 items • Updated May 29 • 17

upvoted 2 collections 5 months ago

InternLM2.5

14 items • Updated Sep 14 • 70

Diffusion model Spaces

315 items • Updated Oct 22 • 31

upvoted a collection 6 months ago

The SPRIGHT T2I collection

This collection contains the datasets, model, paper, and demo associated with the SPRIGHT (SPatially RIGHT) release. • 5 items • Updated Apr 2 • 5

upvoted 2 collections 7 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 14 days ago • 500

Arctic

A collection of pre-trained dense-MoE Hybrid transformer models • 2 items • Updated Apr 24 • 23

upvoted a paper 7 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 253

upvoted a collection 7 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 683

upvoted 2 collections 8 months ago

Pile-T5

T5 trained on the Pile with Llama Tokenizer • 4 items • Updated Jul 6 • 17

Audio Spaces

103 items • Updated 25 days ago • 11

upvoted a paper 8 months ago

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29 • 47

upvoted a collection 8 months ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 91

upvoted a paper 8 months ago

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Paper • 2403.13745 • Published Mar 20 • 11