LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 8 days ago • 94
view post Post 2495 Reply Let’s dive into the exciting releases from the Chinese community last week 🔥🚀More details 👉 https://huggingface.co/zh-ai-communityCode model:✨Qwen 2.5 coder by Alibaba Qwen Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f✨OpenCoder by InflyAI - Fully open code model🙌 infly/opencoder-672cec44bbb86c39910fb55eImage model: ✨Hunyuan3D-1.0 by Tencent tencent/Hunyuan3D-1MLLM: ✨JanusFlow by DeepSeek deepseek-ai/JanusFlow-1.3B deepseek-ai/JanusFlow-1.3B✨Mono-InternVL-2B by OpenGVlab OpenGVLab/Mono-InternVL-2BVideo model: ✨CogVideoX 1.5 by ChatGLM THUDM/CogVideoX1.5-5B-SATAudio model: ✨Fish Agent by FishAudio fishaudio/fish-agent-v0.1-3bDataset: ✨OPI dataset by BAAIBeijing BAAI/OPI 🔥 10 10 👀 4 4 🚀 2 2 +
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 16 days ago • 109
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models Paper • 2411.05830 • Published 18 days ago • 20
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 19 days ago • 178
Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models Paper • 2411.07140 • Published 12 days ago • 33
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Paper • 2411.07232 • Published 12 days ago • 60
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Paper • 2411.07199 • Published 12 days ago • 43