Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published Oct 24 • 40
LOGO -- Long cOntext aliGnment via efficient preference Optimization Paper • 2410.18533 • Published Oct 24 • 42
KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model Paper • 2311.11564 • Published Nov 20, 2023 • 1
Harnessing the Power of David against Goliath: Exploring Instruction Data Generation without Using Closed-Source Models Paper • 2308.12711 • Published Aug 24, 2023 • 1
Rethinking Negative Instances for Generative Named Entity Recognition Paper • 2402.16602 • Published Feb 26 • 3
OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure Paper • 2406.17276 • Published Jun 25
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM Paper • 2408.12076 • Published Aug 22 • 12
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? Paper • 2410.02115 • Published Oct 3 • 10
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? Paper • 2410.02115 • Published Oct 3 • 10
GNER Collection We introduce GNER, a Generative Named Entity Recognition framework, which demonstrates enhanced zero-shot capabilities across unseen entity domains. • 7 items • Updated Feb 28 • 7
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis Paper • 2401.17093 • Published Jan 30 • 19
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models Paper • 2309.09506 • Published Sep 18, 2023 • 14
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch Paper • 2309.10706 • Published Sep 19, 2023 • 16
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch Paper • 2309.10706 • Published Sep 19, 2023 • 16
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation Paper • 2305.09515 • Published May 16, 2023 • 2