n3rdium (Ishan Gajbhiye)

upvoted a paper 13 days ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published 16 days ago • 35

upvoted a collection about 1 month ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 9 days ago • 429

upvoted 3 papers 2 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23 • 25

Scalable Autoregressive Image Generation with Mamba

Paper • 2408.12245 • Published Aug 22 • 23

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

Paper • 2408.09702 • Published Aug 19 • 9

upvoted 2 papers 5 months ago

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24 • 53

Semantica: An Adaptable Image-Conditioned Diffusion Model

Paper • 2405.14857 • Published May 23 • 8

upvoted 2 collections 6 months ago

RecurrentGemma Release

Collection

8 items • Updated Jul 31 • 39

Top Mini LLM

Collection

Collection of top mini llms • 5 items • Updated Oct 1 • 10

upvoted an article 6 months ago

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

May 3

• 13

upvoted a paper 7 months ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18 • 21

upvoted 2 collections 7 months ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 3 days ago • 41

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Sep 18 • 206

upvoted 7 papers 9 months ago

Machine Unlearning for Image-to-Image Generative Models

Paper • 2402.00351 • Published Feb 1 • 12

AToM: Amortized Text-to-Mesh using 2D Diffusion

Paper • 2402.00867 • Published Feb 1 • 10

SymbolicAI: A framework for logic-based approaches combining generative models and solvers

Paper • 2402.00854 • Published Feb 1 • 19

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1 • 21

Efficient Exploration for LLMs

Paper • 2402.00396 • Published Feb 1 • 21

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Paper • 2402.00769 • Published Feb 1 • 20

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31 • 59

Ishan Gajbhiye

AI & ML interests

Organizations

n3rdium's activity

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Llama 3.2

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Scalable Autoregressive Image Generation with Mamba

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Semantica: An Adaptable Image-Conditioned Diffusion Model

RecurrentGemma Release

Top Mini LLM

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

EdgeFusion: On-Device Text-to-Image Generation

xLAM models

Qwen1.5

Machine Unlearning for Image-to-Image Generative Models

AToM: Amortized Text-to-Mesh using 2D Diffusion

SymbolicAI: A framework for logic-based approaches combining generative models and solvers

Can Large Language Models Understand Context?

Efficient Exploration for LLMs

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research