VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding Paper • 2407.12594 • Published Jul 17 • 19
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 5 days ago • 486
CAST: Character labeling in Animation using Self-supervision by Tracking Paper • 2201.07619 • Published Jan 19, 2022 • 2