Datasets: A Community Library for Natural Language Processing Paper • 2109.02846 • Published Sep 7, 2021 • 8
ProAgent: From Robotic Process Automation to Agentic Process Automation Paper • 2311.10751 • Published Nov 2, 2023 • 8
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning Paper • 2311.11501 • Published Nov 20, 2023 • 33
Instruction-Following Evaluation for Large Language Models Paper • 2311.07911 • Published Nov 14, 2023 • 19
DiLoCo: Distributed Low-Communication Training of Language Models Paper • 2311.08105 • Published Nov 14, 2023 • 14
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming Paper • 2311.07689 • Published Nov 13, 2023 • 7
LASER: LLM Agent with State-Space Exploration for Web Navigation Paper • 2309.08172 • Published Sep 15, 2023 • 11
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval Paper • 2310.15511 • Published Oct 24, 2023 • 4
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 96
A Zero-Shot Language Agent for Computer Control with Structured Reflection Paper • 2310.08740 • Published Oct 12, 2023 • 14
Ranking LLM-Generated Loop Invariants for Program Verification Paper • 2310.09342 • Published Oct 13, 2023 • 2
ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations Paper • 2203.13602 • Published Mar 25, 2022 • 1
GoLLIE Collection We present GoLLIE, a Large Language Model trained to follow annotation guidelines that outperforms previous approaches on zero-shot IE. • 4 items • Updated Mar 11 • 17
HeaP: Hierarchical Policies for Web Actions using LLMs Paper • 2310.03720 • Published Oct 5, 2023 • 6
FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation Paper • 2310.03214 • Published Oct 5, 2023 • 17
How FaR Are Large Language Models From Agents with Theory-of-Mind? Paper • 2310.03051 • Published Oct 4, 2023 • 34
LMDX: Language Model-based Document Information Extraction and Localization Paper • 2309.10952 • Published Sep 19, 2023 • 64
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer Paper • 2308.06873 • Published Aug 14, 2023 • 25
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 239
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs Paper • 2307.08581 • Published Jul 17, 2023 • 27
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 80
Building Cooperative Embodied Agents Modularly with Large Language Models Paper • 2307.02485 • Published Jul 5, 2023 • 11
GLIMMER: generalized late-interaction memory reranker Paper • 2306.10231 • Published Jun 17, 2023 • 7
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models Paper • 2306.11698 • Published Jun 20, 2023 • 12