MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published 16 days ago • 7
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration Paper • 2410.12183 • Published 16 days ago • 3
Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key Paper • 2410.10210 • Published 18 days ago • 3
MedMobile: A mobile-sized language model with expert-level clinical capabilities Paper • 2410.09019 • Published 21 days ago • 8