Inui's picture

Inui

Norm

·

https://normxu.github.io/

AI & ML interests

Video Diffusion; Large Language Model; Object Detection; OCR

Recent Activity

upvoted a paper 1 day ago

Open-Sora Plan: Open-Source Large Video Generation Model

updated a collection 1 day ago

Image / Video Gen

updated a collection 1 day ago

View all activity

Organizations

Norm's activity

upvoted a paper 1 day ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 6 days ago • 22

updated 3 collections 1 day ago

Image / Video Gen

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 25 items • Updated 1 day ago • 6

VAE

4 items • Updated 1 day ago

TI2V Research

7 items • Updated 1 day ago

upvoted a paper 1 day ago

Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations

Paper • 2410.10792 • Published Oct 14 • 27

upvoted a paper 5 days ago

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Paper • 2411.11922 • Published 16 days ago • 17

updated a collection 8 days ago

Image / Video Gen

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 25 items • Updated 1 day ago • 6

upvoted a paper 8 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published 12 days ago • 42

updated a collection 8 days ago

TI2V Research

7 items • Updated 1 day ago

liked a model 8 days ago

Djrango/Qwen2vl-Flux

Text-to-Image • Updated 7 days ago • 358

upvoted a paper 9 days ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published 12 days ago • 55

updated a collection 11 days ago

Multimodal Language Model

What does matter besides data receipt when training a Multimodal language model? • 25 items • Updated 11 days ago • 1

upvoted a paper 11 days ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published 13 days ago • 40

liked a model 11 days ago

allenai/Llama-3.1-Tulu-3-8B

Text Generation • Updated 8 days ago • 8.72k • 85

updated a collection 14 days ago

TI2V Research

7 items • Updated 1 day ago

updated a collection 17 days ago

Video2Video

1 item • Updated 17 days ago