Fynn Kröger's picture

Fynn Kröger

fynnkroeger

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

upvoted a paper 11 days ago

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

upvoted a paper 27 days ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

View all activity

Organizations

None yet

fynnkroeger's activity

upvoted a paper about 3 hours ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published 2 days ago • 32

upvoted a paper 11 days ago

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published 12 days ago • 28

upvoted a paper 27 days ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22 • 88

upvoted a paper about 1 month ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17 • 27

upvoted 2 papers about 2 months ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24 • 41

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24 • 16

upvoted a paper 2 months ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16 • 38

upvoted 8 papers 3 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters

Paper • 2408.17253 • Published Aug 30 • 35

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29 • 92

Scalable Autoregressive Image Generation with Mamba

Paper • 2408.12245 • Published Aug 22 • 25

Towards Conversational Diagnostic AI

Paper • 2401.05654 • Published Jan 11 • 16

MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning

Paper • 2408.11001 • Published Aug 20 • 11

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20 • 56

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15 • 44

upvoted 4 papers 4 months ago

POA: Pre-training Once for Models of All Sizes

Paper • 2408.01031 • Published Aug 2 • 26

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 107

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16 • 54

upvoted a paper 5 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5 • 52