Diwank Tomer's picture

Diwank Tomer PRO

diwank

·

https://diwank.name

AI & ML interests

None yet

Recent Activity

updated a collection about 19 hours ago

liked a dataset about 19 hours ago

HuggingFaceTB/smoltalk

updated a collection about 19 hours ago

View all activity

Articles

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

Organizations

diwank's activity

upvoted a paper about 21 hours ago

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published 2 days ago • 8

upvoted a collection 3 days ago

InternVL 2.5

Better than InternVL 2.0 • 16 items • Updated about 23 hours ago • 7

upvoted 2 papers 5 days ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published 9 days ago • 52

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 16 days ago • 48

upvoted a paper 7 days ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published 9 days ago • 66

upvoted a paper 8 days ago

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

Paper • 2411.08307 • Published 10 days ago • 6

upvoted 2 papers 10 days ago

Watermark Anything with Localized Messages

Paper • 2411.07231 • Published 12 days ago • 19

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published 11 days ago • 24

upvoted 2 papers 12 days ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 16 days ago • 34

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 91

upvoted a collection 15 days ago

LipSync and Face Operations

10 items • Updated 16 days ago • 24

upvoted a paper 23 days ago

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13 • 3

upvoted an article 24 days ago

Article

Decoding Strategies in Large Language Models

By

•

25 days ago

• 38

upvoted a collection 24 days ago

NanoBEIR 🍺

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11 • 6

upvoted a paper 24 days ago

The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

Paper • 2304.01412 • Published Apr 3, 2023 • 2

upvoted a collection 28 days ago

OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text • 6 items • Updated Oct 21 • 1

upvoted a paper 28 days ago

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10 • 3

upvoted a collection 28 days ago

Mono-InternVL

A Pioneering Monolithic MLLM • 2 items • Updated Oct 21 • 4

upvoted an article about 1 month ago

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

Oct 21

• 27

upvoted a paper about 1 month ago

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published Oct 15 • 20