Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning Paper • 2305.13660 • Published May 23, 2023
ConFit: Improving Resume-Job Matching using Data Augmentation and Contrastive Learning Paper • 2401.16349 • Published Jan 29
LIONs: An Empirically Optimized Approach to Align Language Models Paper • 2407.06542 • Published Jul 9
Improving Autonomous AI Agents with Reflective Tree Search and Self-Learning Paper • 2410.02052 • Published Oct 2 • 9
Improving Autonomous AI Agents with Reflective Tree Search and Self-Learning Paper • 2410.02052 • Published Oct 2 • 9
Improving Autonomous AI Agents with Reflective Tree Search and Self-Learning Paper • 2410.02052 • Published Oct 2 • 9 • 2
LION-datasets Collection Datasets used to train the LION pipeline. Paper: https://arxiv.org/abs/2407.06542; Code: https://github.com/Columbia-NLP-Lab/LionAlignment • 9 items • Updated Jul 10
LION-series Collection Models trained using the LION pipeline. Paper: https://arxiv.org/abs/2407.06542; Code: https://github.com/Columbia-NLP-Lab/LionAlignment • 6 items • Updated Jul 10