AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Jia, Chengyou, Luo, Minnan, Dang, Zhuohang, Sun, Qiushi, Xu, Fangzhi, Hu, Junlin, Xie, Tianbao, Wu, Zhiyong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Jia, Chengyou
Luo, Minnan
Dang, Zhuohang
Sun, Qiushi
Xu, Fangzhi
Hu, Junlin
Xie, Tianbao
Wu, Zhiyong
description Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling open-ended computer tasks in real-world environments. Inspired by the rich functionality of the App store, we present AgentStore, a scalable platform designed to dynamically integrate heterogeneous agents for automating computer tasks. AgentStore empowers users to integrate third-party agents, allowing the system to continuously enrich its capabilities and adapt to rapidly evolving operating systems. Additionally, we propose a novel core \textbf{MetaAgent} with the \textbf{AgentToken} strategy to efficiently manage diverse agents and utilize their specialized and generalist abilities for both domain-specific and system-wide tasks. Extensive experiments on three challenging benchmarks demonstrate that AgentStore surpasses the limitations of previous systems with narrow capabilities, particularly achieving a significant improvement from 11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous results. Comprehensive quantitative and qualitative results further demonstrate AgentStore's ability to enhance agent systems in both generalization and specialization, underscoring its potential for developing the specialized generalist computer assistant. All our codes will be made publicly available in https://chengyou-jia.github.io/AgentStore-Home.
doi_str_mv 10.48550/arxiv.2410.18603
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2410_18603</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410_18603</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2410_186033</originalsourceid><addsrcrecordid>eNqFjrsOgkAQRbexMOoHWDk_IIKAIXaGqFhjT0YcyCbLLtldjPr1jsTe6s7j3OQIsYzCIMnSNNygfcpHsE34EGW7MJ4KOrSkfemNpT2UNSq8KYKL9tRa9NJoMA0U5MkaBskMDsYGh4Oyp1qikm-6w5m_lmfnITddP3CDEcc7aj8XkwaVo8UvZ2J1Ol7zYj0KVb2VHdpX9RWrRrH4P_EB_ldEwQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</title><source>arXiv.org</source><creator>Jia, Chengyou ; Luo, Minnan ; Dang, Zhuohang ; Sun, Qiushi ; Xu, Fangzhi ; Hu, Junlin ; Xie, Tianbao ; Wu, Zhiyong</creator><creatorcontrib>Jia, Chengyou ; Luo, Minnan ; Dang, Zhuohang ; Sun, Qiushi ; Xu, Fangzhi ; Hu, Junlin ; Xie, Tianbao ; Wu, Zhiyong</creatorcontrib><description>Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling open-ended computer tasks in real-world environments. Inspired by the rich functionality of the App store, we present AgentStore, a scalable platform designed to dynamically integrate heterogeneous agents for automating computer tasks. AgentStore empowers users to integrate third-party agents, allowing the system to continuously enrich its capabilities and adapt to rapidly evolving operating systems. Additionally, we propose a novel core \textbf{MetaAgent} with the \textbf{AgentToken} strategy to efficiently manage diverse agents and utilize their specialized and generalist abilities for both domain-specific and system-wide tasks. Extensive experiments on three challenging benchmarks demonstrate that AgentStore surpasses the limitations of previous systems with narrow capabilities, particularly achieving a significant improvement from 11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous results. Comprehensive quantitative and qualitative results further demonstrate AgentStore's ability to enhance agent systems in both generalization and specialization, underscoring its potential for developing the specialized generalist computer assistant. All our codes will be made publicly available in https://chengyou-jia.github.io/AgentStore-Home.</description><identifier>DOI: 10.48550/arxiv.2410.18603</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Robotics</subject><creationdate>2024-10</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2410.18603$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2410.18603$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Jia, Chengyou</creatorcontrib><creatorcontrib>Luo, Minnan</creatorcontrib><creatorcontrib>Dang, Zhuohang</creatorcontrib><creatorcontrib>Sun, Qiushi</creatorcontrib><creatorcontrib>Xu, Fangzhi</creatorcontrib><creatorcontrib>Hu, Junlin</creatorcontrib><creatorcontrib>Xie, Tianbao</creatorcontrib><creatorcontrib>Wu, Zhiyong</creatorcontrib><title>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</title><description>Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling open-ended computer tasks in real-world environments. Inspired by the rich functionality of the App store, we present AgentStore, a scalable platform designed to dynamically integrate heterogeneous agents for automating computer tasks. AgentStore empowers users to integrate third-party agents, allowing the system to continuously enrich its capabilities and adapt to rapidly evolving operating systems. Additionally, we propose a novel core \textbf{MetaAgent} with the \textbf{AgentToken} strategy to efficiently manage diverse agents and utilize their specialized and generalist abilities for both domain-specific and system-wide tasks. Extensive experiments on three challenging benchmarks demonstrate that AgentStore surpasses the limitations of previous systems with narrow capabilities, particularly achieving a significant improvement from 11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous results. Comprehensive quantitative and qualitative results further demonstrate AgentStore's ability to enhance agent systems in both generalization and specialization, underscoring its potential for developing the specialized generalist computer assistant. All our codes will be made publicly available in https://chengyou-jia.github.io/AgentStore-Home.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrsOgkAQRbexMOoHWDk_IIKAIXaGqFhjT0YcyCbLLtldjPr1jsTe6s7j3OQIsYzCIMnSNNygfcpHsE34EGW7MJ4KOrSkfemNpT2UNSq8KYKL9tRa9NJoMA0U5MkaBskMDsYGh4Oyp1qikm-6w5m_lmfnITddP3CDEcc7aj8XkwaVo8UvZ2J1Ol7zYj0KVb2VHdpX9RWrRrH4P_EB_ldEwQ</recordid><startdate>20241024</startdate><enddate>20241024</enddate><creator>Jia, Chengyou</creator><creator>Luo, Minnan</creator><creator>Dang, Zhuohang</creator><creator>Sun, Qiushi</creator><creator>Xu, Fangzhi</creator><creator>Hu, Junlin</creator><creator>Xie, Tianbao</creator><creator>Wu, Zhiyong</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241024</creationdate><title>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</title><author>Jia, Chengyou ; Luo, Minnan ; Dang, Zhuohang ; Sun, Qiushi ; Xu, Fangzhi ; Hu, Junlin ; Xie, Tianbao ; Wu, Zhiyong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2410_186033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Jia, Chengyou</creatorcontrib><creatorcontrib>Luo, Minnan</creatorcontrib><creatorcontrib>Dang, Zhuohang</creatorcontrib><creatorcontrib>Sun, Qiushi</creatorcontrib><creatorcontrib>Xu, Fangzhi</creatorcontrib><creatorcontrib>Hu, Junlin</creatorcontrib><creatorcontrib>Xie, Tianbao</creatorcontrib><creatorcontrib>Wu, Zhiyong</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jia, Chengyou</au><au>Luo, Minnan</au><au>Dang, Zhuohang</au><au>Sun, Qiushi</au><au>Xu, Fangzhi</au><au>Hu, Junlin</au><au>Xie, Tianbao</au><au>Wu, Zhiyong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</atitle><date>2024-10-24</date><risdate>2024</risdate><abstract>Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling open-ended computer tasks in real-world environments. Inspired by the rich functionality of the App store, we present AgentStore, a scalable platform designed to dynamically integrate heterogeneous agents for automating computer tasks. AgentStore empowers users to integrate third-party agents, allowing the system to continuously enrich its capabilities and adapt to rapidly evolving operating systems. Additionally, we propose a novel core \textbf{MetaAgent} with the \textbf{AgentToken} strategy to efficiently manage diverse agents and utilize their specialized and generalist abilities for both domain-specific and system-wide tasks. Extensive experiments on three challenging benchmarks demonstrate that AgentStore surpasses the limitations of previous systems with narrow capabilities, particularly achieving a significant improvement from 11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous results. Comprehensive quantitative and qualitative results further demonstrate AgentStore's ability to enhance agent systems in both generalization and specialization, underscoring its potential for developing the specialized generalist computer assistant. All our codes will be made publicly available in https://chengyou-jia.github.io/AgentStore-Home.</abstract><doi>10.48550/arxiv.2410.18603</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2410.18603
ispartof
issn
language eng
recordid cdi_arxiv_primary_2410_18603
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Robotics
title AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T23%3A54%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AgentStore:%20Scalable%20Integration%20of%20Heterogeneous%20Agents%20As%20Specialized%20Generalist%20Computer%20Assistant&rft.au=Jia,%20Chengyou&rft.date=2024-10-24&rft_id=info:doi/10.48550/arxiv.2410.18603&rft_dat=%3Carxiv_GOX%3E2410_18603%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true