𝒕𝒂𝒏𝒗𝒊𝒓's picture

𝒕𝒂𝒏𝒗𝒊𝒓

Tanvir1337

·

AI & ML interests

Deep Learning, Generative Adversarial Networks, Transformer, Diffusion, SOTA Foundation Models

Recent Activity

reacted to Symbol-LLM's post with 🚀 about 1 hour ago

updated a dataset about 4 hours ago

Tanvir1337/CrackStation-password-list

updated a dataset about 4 hours ago

Tanvir1337/CrackStation-password-list

Organizations

Tanvir1337's activity

upvoted a collection 11 days ago

GPT-Generated Unified Format (GGUF)

ease of reading • 17 items • Updated 2 days ago • 10

upvoted a collection 12 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 9 items • Updated 4 days ago • 70

upvoted 3 collections 20 days ago

Flux LoRA Collections

Flux THE LoRA • 80 items • Updated about 4 hours ago • 26

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14 • 12

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 5 hours ago • 172

upvoted a paper 21 days ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

upvoted a collection 21 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 15 days ago • 95

upvoted a paper 23 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 50

upvoted a collection 23 days ago

D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc

Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. • 57 items • Updated about 14 hours ago • 3

upvoted a collection 24 days ago

GGUF Image Model Quants

List of GGUF quants for text to image base models. • 9 items • Updated 23 days ago • 12

upvoted a paper 27 days ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 80

upvoted 4 articles about 1 month ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

Oct 20

• 30

Article

Mamba Out

By

•

Oct 18

• 8

Article

AI is turning nuclear: a review

By

•

Oct 20

• 11

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

Oct 21

• 27

upvoted a collection about 1 month ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 17 days ago • 89

upvoted a paper about 1 month ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 136

upvoted an article about 1 month ago

Article

Accelerate 1.0.0

Sep 13

• 50

upvoted an article about 2 months ago

Article

Practical 3D Asset Generation: A Step-by-Step Guide

Aug 1, 2023

• 5

upvoted a collection about 2 months ago

Emu3

5 items • Updated 27 days ago • 65