AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
Abstract
Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling open-ended computer tasks in real-world environments. Inspired by the rich functionality of the App store, we present AgentStore, a scalable platform designed to dynamically integrate heterogeneous agents for automating computer tasks. AgentStore empowers users to integrate third-party agents, allowing the system to continuously enrich its capabilities and adapt to rapidly evolving operating systems. Additionally, we propose a novel core MetaAgent with the AgentToken strategy to efficiently manage diverse agents and utilize their specialized and generalist abilities for both domain-specific and system-wide tasks. Extensive experiments on three challenging benchmarks demonstrate that AgentStore surpasses the limitations of previous systems with narrow capabilities, particularly achieving a significant improvement from 11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous results. Comprehensive quantitative and qualitative results further demonstrate AgentStore's ability to enhance agent systems in both generalization and specialization, underscoring its potential for developing the specialized generalist computer assistant. All our codes will be made publicly available in https://chengyou-jia.github.io/AgentStore-Home.
Community
AgentStore is a scalable platform that integrates diverse digital agents to automate complex computer tasks. It introduces a MetaAgent with an AgentToken strategy for efficient management and coordination. This system significantly improves agentic task performance, achieving stunning success rate on benchmarks like OSWorld, demonstrating enhanced specialization and generalization capabilities for digital assistants.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning (2024)
- Agent S: An Open Agentic Framework that Uses Computers Like a Human (2024)
- MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration (2024)
- HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale (2024)
- Agent-Oriented Planning in Multi-Agent Systems (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper