9 83 567

Anthonny OLIME

Citaman

Citaman

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

stabilityai/stable-diffusion-3.5-medium

liked a model 19 days ago

stabilityai/stable-diffusion-3.5-large

liked a model 19 days ago

HuggingFaceTB/SmolLM2-135M-Instruct

Organizations

Citaman's activity

upvoted an article 2 months ago

Article

Token Merging for fast LLM inference : Background and first trials with Mistral

•

Apr 30

• 3

upvoted a paper 4 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

upvoted an article 5 months ago

Article

How I train a LoRA: m3lt style training overview

•

Jul 1

• 47

upvoted 3 papers 5 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 95

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13 • 86

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11 • 52

upvoted a collection 6 months ago

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated Sep 10 • 12

upvoted 3 papers 6 months ago

upvoted an article 6 months ago

Article

GPU Poor Savior: Revolutionizing Low-Bit Open Source LLMs and Cost-Effective Edge Computing

•

May 25

• 9

upvoted 2 articles 7 months ago

Article

Transformers

•

Jul 2

• 5

Article

Diffusion Models

•

May 19

• 13

upvoted 6 papers 8 months ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 78

Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

Paper • 2403.18795 • Published Mar 27 • 18

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 24

ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

Paper • 2403.18818 • Published Mar 27 • 25

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27 • 52

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27 • 44

upvoted a collection 8 months ago

MGM

Collection

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3 • 46