levanduc
's Collections
LLM-Papers
updated
PDFTriage: Question Answering over Long, Structured Documents
Paper
•
2309.08872
•
Published
•
53
Adapting Large Language Models via Reading Comprehension
Paper
•
2309.09530
•
Published
•
77
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
•
2310.09263
•
Published
•
39
Context-Aware Meta-Learning
Paper
•
2310.10971
•
Published
•
16
Data-Centric Financial Large Language Models
Paper
•
2310.17784
•
Published
•
14
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
Modeling Likewise
Paper
•
2310.19019
•
Published
•
9
Contrastive Chain-of-Thought Prompting
Paper
•
2311.09277
•
Published
•
34
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
70
Context Tuning for Retrieval Augmented Generation
Paper
•
2312.05708
•
Published
•
16
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
56
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
79
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
181
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
Paper
•
2401.04398
•
Published
•
21
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
•
2401.08967
•
Published
•
28
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper
•
2401.15024
•
Published
•
69
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
•
2402.05140
•
Published
•
20
AutoMathText: Autonomous Data Selection with Language Models for
Mathematical Texts
Paper
•
2402.07625
•
Published
•
11
How to Train Data-Efficient LLMs
Paper
•
2402.09668
•
Published
•
40
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
•
2402.10986
•
Published
•
77
Knowledge Fusion of Large Language Models
Paper
•
2401.10491
•
Published
•
3
SaulLM-7B: A pioneering Large Language Model for Law
Paper
•
2403.03883
•
Published
•
75
RAFT: Adapting Language Model to Domain Specific RAG
Paper
•
2403.10131
•
Published
•
67
TnT-LLM: Text Mining at Scale with Large Language Models
Paper
•
2403.12173
•
Published
•
19
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Paper
•
2403.13372
•
Published
•
62
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small
Reference Models
Paper
•
2405.20541
•
Published
•
21
Towards a Personal Health Large Language Model
Paper
•
2406.06474
•
Published
•
18
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Paper
•
2406.09170
•
Published
•
24
Instruction Pre-Training: Language Models are Supervised Multitask
Learners
Paper
•
2406.14491
•
Published
•
85
The FineWeb Datasets: Decanting the Web for the Finest Text Data at
Scale
Paper
•
2406.17557
•
Published
•
86
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
•
2406.19215
•
Published
•
29
Show Less, Instruct More: Enriching Prompts with Definitions and
Guidelines for Zero-Shot NER
Paper
•
2407.01272
•
Published
•
8
LETS-C: Leveraging Language Embedding for Time Series Classification
Paper
•
2407.06533
•
Published
•
2
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Paper
•
2407.09413
•
Published
•
9
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Paper
•
2407.12854
•
Published
•
29
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
•
2407.18961
•
Published
•
39
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal
Domain
Paper
•
2407.19584
•
Published
•
62
Visual Riddles: a Commonsense and World Knowledge Challenge for Large
Vision and Language Models
Paper
•
2407.19474
•
Published
•
23
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
•
2407.18248
•
Published
•
31
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper
•
2407.20183
•
Published
•
38
LAMBDA: A Large Model Based Data Agent
Paper
•
2407.17535
•
Published
•
34
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Paper
•
2407.15017
•
Published
•
33
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
•
2407.03502
•
Published
•
48
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Paper
•
2408.14717
•
Published
•
24
Foundation Models for Music: A Survey
Paper
•
2408.14340
•
Published
•
42
Efficient Detection of Toxic Prompts in Large Language Models
Paper
•
2408.11727
•
Published
•
12
Sapiens: Foundation for Human Vision Models
Paper
•
2408.12569
•
Published
•
89
Controllable Text Generation for Large Language Models: A Survey
Paper
•
2408.12599
•
Published
•
63
TableBench: A Comprehensive and Complex Benchmark for Table Question
Answering
Paper
•
2408.09174
•
Published
•
51
OLMoE: Open Mixture-of-Experts Language Models
Paper
•
2409.02060
•
Published
•
77
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Paper
•
2409.05840
•
Published
•
45
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
•
2409.02795
•
Published
•
72
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
Language Models
Paper
•
2409.11136
•
Published
•
21
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
135
A Controlled Study on Long Context Extension and Generalization in LLMs
Paper
•
2409.12181
•
Published
•
43
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic
reasoning
Paper
•
2409.12183
•
Published
•
36
A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language
Models: An Experimental Analysis up to 405B
Paper
•
2409.11055
•
Published
•
16
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector
Retrieval
Paper
•
2409.10516
•
Published
•
39
Guiding Vision-Language Model Selection for Visual Question-Answering
Across Tasks, Domains, and Knowledge Types
Paper
•
2409.09269
•
Published
•
7
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
Paper
•
2409.15277
•
Published
•
34
HelloBench: Evaluating Long Text Generation Capabilities of Large
Language Models
Paper
•
2409.16191
•
Published
•
41
Boosting Healthcare LLMs Through Retrieved Context
Paper
•
2409.15127
•
Published
•
19
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Paper
•
2409.17481
•
Published
•
46
Instruction Following without Instruction Tuning
Paper
•
2409.14254
•
Published
•
27
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Paper
•
2409.20566
•
Published
•
52
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper
•
2410.00531
•
Published
•
29
Law of the Weakest Link: Cross Capabilities of Large Language Models
Paper
•
2409.19951
•
Published
•
53
Embodied-RAG: General non-parametric Embodied Memory for Retrieval and
Generation
Paper
•
2409.18313
•
Published
•
3
Not All LLM Reasoners Are Created Equal
Paper
•
2410.01748
•
Published
•
27
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal
Foundation Models
Paper
•
2410.02740
•
Published
•
52
Video Instruction Tuning With Synthetic Data
Paper
•
2410.02713
•
Published
•
37
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Paper
•
2410.03017
•
Published
•
25
LLMs Know More Than They Show: On the Intrinsic Representation of LLM
Hallucinations
Paper
•
2410.02707
•
Published
•
48
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for
Data-Driven Scientific Discovery
Paper
•
2410.05080
•
Published
•
19
Personalized Visual Instruction Tuning
Paper
•
2410.07113
•
Published
•
69
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for
Enhanced Following of Instructions with Multiple Constraints
Paper
•
2410.06458
•
Published
•
8
MLE-bench: Evaluating Machine Learning Agents on Machine Learning
Engineering
Paper
•
2410.07095
•
Published
•
6
MathCoder2: Better Math Reasoning from Continued Pretraining on
Model-translated Mathematical Code
Paper
•
2410.08196
•
Published
•
44
MLLM as Retriever: Interactively Learning Multimodal Retrieval for
Embodied Agents
Paper
•
2410.03450
•
Published
•
36
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
with Curriculum Preference Learning
Paper
•
2410.06508
•
Published
•
10
Vector-ICL: In-context Learning with Continuous Vector Representations
Paper
•
2410.05629
•
Published
•
3
Baichuan-Omni Technical Report
Paper
•
2410.08565
•
Published
•
84
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via
Inference-time Hybrid Information Structurization
Paper
•
2410.08815
•
Published
•
42
From Generalist to Specialist: Adapting Vision Language Models via
Task-Specific Visual Instruction Tuning
Paper
•
2410.06456
•
Published
•
35
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Paper
•
2410.10563
•
Published
•
37
Thinking LLMs: General Instruction Following with Thought Generation
Paper
•
2410.10630
•
Published
•
17
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Paper
•
2410.11096
•
Published
•
12
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language
Models
Paper
•
2410.13085
•
Published
•
20
WorldCuisines: A Massive-Scale Benchmark for Multilingual and
Multicultural Visual Question Answering on Global Cuisines
Paper
•
2410.12705
•
Published
•
29
TransAgent: Transfer Vision-Language Foundation Models with
Heterogeneous Agent Collaboration
Paper
•
2410.12183
•
Published
•
3
UCFE: A User-Centric Financial Expertise Benchmark for Large Language
Models
Paper
•
2410.14059
•
Published
•
53
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial
Samples
Paper
•
2410.14669
•
Published
•
35
Improve Vision Language Model Chain-of-thought Reasoning
Paper
•
2410.16198
•
Published
•
17
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Paper
•
2410.17215
•
Published
•
12
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Paper
•
2410.19168
•
Published
•
19
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum
Reinforcement Learning
Paper
•
2411.02337
•
Published
•
36
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge
in RAG Systems
Paper
•
2411.02959
•
Published
•
64
Self-Consistency Preference Optimization
Paper
•
2411.04109
•
Published
•
14