SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents
Abstract
Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-pi, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings. We show that our training method allows a 7B LLM to reach the social goal completion ability of an expert model (GPT-4-based agent), while improving the safety of language agents and maintaining general QA ability on the MMLU benchmark. We also find that this training paradigm uncovers some difficulties in LLM-based evaluation of social intelligence: LLM-based evaluators overestimate the abilities of the language agents trained specifically for social interaction.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs (2024)
- Self-Rewarding Language Models (2024)
- Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards (2024)
- Academically intelligent LLMs are not necessarily socially intelligent (2024)
- Large Language Model-based Human-Agent Collaboration for Complex Task Solving (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend