MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Paper • 2409.05840 • Published 26 days ago • 45
Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published Sep 4 • 72
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3 • 78
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing Paper • 2406.05534 • Published Jun 8 • 3
CLAIR and APO Collection Data and Models for the paper "Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment" • 8 items • Updated Aug 14 • 3
Word Sense Linking Collection Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory. • 5 items • Updated Aug 5 • 3
Cerebras DocChat Collection GPT-4 Level Conversational QA Trained In a Few Hours • 5 items • Updated Aug 21 • 3
Llama Scope Collection An Open-Source Suite of 416 Sparse Autoencoders on Llama-3.1-8B • 1 item • Updated Sep 3 • 4
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Paper • 2401.08417 • Published Jan 16 • 31
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published Jun 17 • 18
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning Paper • 2206.08657 • Published Jun 17, 2022 • 2
Korean Datasets I've released so far. Collection 지금까지 업로드한 한국어 데이터셋 콜렉션입니다. • 8 items • Updated May 24 • 16
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published Jun 24 • 55
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • Jun 23 • 33
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 102
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 65
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization Paper • 2402.09320 • Published Feb 14 • 6
Journal Club Collection Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21 • 26
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Paper • 2309.06380 • Published Sep 12, 2023 • 32
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records Paper • 2308.14089 • Published Aug 27, 2023 • 28