Michael Barry's picture

Michael Barry

MichaelBarryUK

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

upvoted a paper about 19 hours ago

upvoted a paper about 19 hours ago

Organizations

None yet

MichaelBarryUK's activity

upvoted a paper about 14 hours ago

Stylecodes: Encoding Stylistic Information For Image Generation

Paper • 2411.12811 • Published 2 days ago • 6

upvoted 4 papers about 19 hours ago

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published 1 day ago • 23

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published 11 days ago • 9

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Paper • 2411.11922 • Published 4 days ago • 12

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published 5 days ago • 35

upvoted 7 papers 2 days ago

Drowning in Documents: Consequences of Scaling Reranker Inference

Paper • 2411.11767 • Published 3 days ago • 16

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published 10 days ago • 15

SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers

Paper • 2411.10510 • Published 6 days ago • 8

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published 6 days ago • 37

AnimateAnything: Consistent and Controllable Animation for Video Generation

Paper • 2411.10836 • Published 5 days ago • 18

Generative World Explorer

Paper • 2411.11844 • Published 3 days ago • 55

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Paper • 2411.10669 • Published 6 days ago • 9

upvoted a paper 4 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published 6 days ago • 88

upvoted 4 papers 8 days ago

Can sparse autoencoders be used to decompose and interpret steering vectors?

Paper • 2411.08790 • Published 8 days ago • 8

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published 9 days ago • 13

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published 9 days ago • 24

Hardware and Software Platform Inference

Paper • 2411.05197 • Published 14 days ago • 3

upvoted 3 papers 10 days ago

Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published 13 days ago • 6

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published 12 days ago • 18

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published 10 days ago • 28