28 25 10

Sherman Chann

152334H

https://152334H.github.io

152334H

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

updated a collection about 2 months ago

mycollection1

updated a collection about 2 months ago

mycollection1

Organizations

152334H's activity

upvoted 3 papers about 2 months ago

upvoted a paper 2 months ago

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13 • 47

upvoted 5 papers 3 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29 • 52

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Paper • 2409.01944 • Published Sep 3 • 44

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22 • 50

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22 • 89

upvoted 9 papers 4 months ago

ShieldGemma: Generative AI Content Moderation Based on Gemma

Paper • 2407.21772 • Published Jul 31 • 13

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 107

Video-to-Audio Generation with Hidden Alignment

Paper • 2407.07464 • Published Jul 10 • 16

Controlling Space and Time with Diffusion Models

Paper • 2407.07860 • Published Jul 10 • 16

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10 • 40

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 67

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11 • 50

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11 • 31

GTA: A Benchmark for General Tool Agents

Paper • 2407.08713 • Published Jul 11 • 14

upvoted 2 papers 5 months ago

MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions

Paper • 2407.06358 • Published Jul 8 • 18

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4 • 16