-
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization
Paper • 2410.12957 • Published • 7 -
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Paper • 2410.12183 • Published • 3 -
Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key
Paper • 2410.10210 • Published • 3 -
MedMobile: A mobile-sized language model with expert-level clinical capabilities
Paper • 2410.09019 • Published • 8
talisman
smithxcelrot
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet