AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Jia, Chengyou Luo, Minnan Dang, Zhuohang Sun, Qiushi Xu, Fangzhi Hu, Junlin Xie, Tianbao Wu, Zhiyong |
description | Digital agents capable of automating complex computer tasks have attracted
considerable attention due to their immense potential to enhance human-computer
interaction. However, existing agent methods exhibit deficiencies in their
generalization and specialization capabilities, especially in handling
open-ended computer tasks in real-world environments. Inspired by the rich
functionality of the App store, we present AgentStore, a scalable platform
designed to dynamically integrate heterogeneous agents for automating computer
tasks. AgentStore empowers users to integrate third-party agents, allowing the
system to continuously enrich its capabilities and adapt to rapidly evolving
operating systems. Additionally, we propose a novel core \textbf{MetaAgent}
with the \textbf{AgentToken} strategy to efficiently manage diverse agents and
utilize their specialized and generalist abilities for both domain-specific and
system-wide tasks. Extensive experiments on three challenging benchmarks
demonstrate that AgentStore surpasses the limitations of previous systems with
narrow capabilities, particularly achieving a significant improvement from
11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous
results. Comprehensive quantitative and qualitative results further demonstrate
AgentStore's ability to enhance agent systems in both generalization and
specialization, underscoring its potential for developing the specialized
generalist computer assistant. All our codes will be made publicly available in
https://chengyou-jia.github.io/AgentStore-Home. |
doi_str_mv | 10.48550/arxiv.2410.18603 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2410_18603</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410_18603</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2410_186033</originalsourceid><addsrcrecordid>eNqFjrsOgkAQRbexMOoHWDk_IIKAIXaGqFhjT0YcyCbLLtldjPr1jsTe6s7j3OQIsYzCIMnSNNygfcpHsE34EGW7MJ4KOrSkfemNpT2UNSq8KYKL9tRa9NJoMA0U5MkaBskMDsYGh4Oyp1qikm-6w5m_lmfnITddP3CDEcc7aj8XkwaVo8UvZ2J1Ol7zYj0KVb2VHdpX9RWrRrH4P_EB_ldEwQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</title><source>arXiv.org</source><creator>Jia, Chengyou ; Luo, Minnan ; Dang, Zhuohang ; Sun, Qiushi ; Xu, Fangzhi ; Hu, Junlin ; Xie, Tianbao ; Wu, Zhiyong</creator><creatorcontrib>Jia, Chengyou ; Luo, Minnan ; Dang, Zhuohang ; Sun, Qiushi ; Xu, Fangzhi ; Hu, Junlin ; Xie, Tianbao ; Wu, Zhiyong</creatorcontrib><description>Digital agents capable of automating complex computer tasks have attracted
considerable attention due to their immense potential to enhance human-computer
interaction. However, existing agent methods exhibit deficiencies in their
generalization and specialization capabilities, especially in handling
open-ended computer tasks in real-world environments. Inspired by the rich
functionality of the App store, we present AgentStore, a scalable platform
designed to dynamically integrate heterogeneous agents for automating computer
tasks. AgentStore empowers users to integrate third-party agents, allowing the
system to continuously enrich its capabilities and adapt to rapidly evolving
operating systems. Additionally, we propose a novel core \textbf{MetaAgent}
with the \textbf{AgentToken} strategy to efficiently manage diverse agents and
utilize their specialized and generalist abilities for both domain-specific and
system-wide tasks. Extensive experiments on three challenging benchmarks
demonstrate that AgentStore surpasses the limitations of previous systems with
narrow capabilities, particularly achieving a significant improvement from
11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous
results. Comprehensive quantitative and qualitative results further demonstrate
AgentStore's ability to enhance agent systems in both generalization and
specialization, underscoring its potential for developing the specialized
generalist computer assistant. All our codes will be made publicly available in
https://chengyou-jia.github.io/AgentStore-Home.</description><identifier>DOI: 10.48550/arxiv.2410.18603</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Robotics</subject><creationdate>2024-10</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2410.18603$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2410.18603$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Jia, Chengyou</creatorcontrib><creatorcontrib>Luo, Minnan</creatorcontrib><creatorcontrib>Dang, Zhuohang</creatorcontrib><creatorcontrib>Sun, Qiushi</creatorcontrib><creatorcontrib>Xu, Fangzhi</creatorcontrib><creatorcontrib>Hu, Junlin</creatorcontrib><creatorcontrib>Xie, Tianbao</creatorcontrib><creatorcontrib>Wu, Zhiyong</creatorcontrib><title>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</title><description>Digital agents capable of automating complex computer tasks have attracted
considerable attention due to their immense potential to enhance human-computer
interaction. However, existing agent methods exhibit deficiencies in their
generalization and specialization capabilities, especially in handling
open-ended computer tasks in real-world environments. Inspired by the rich
functionality of the App store, we present AgentStore, a scalable platform
designed to dynamically integrate heterogeneous agents for automating computer
tasks. AgentStore empowers users to integrate third-party agents, allowing the
system to continuously enrich its capabilities and adapt to rapidly evolving
operating systems. Additionally, we propose a novel core \textbf{MetaAgent}
with the \textbf{AgentToken} strategy to efficiently manage diverse agents and
utilize their specialized and generalist abilities for both domain-specific and
system-wide tasks. Extensive experiments on three challenging benchmarks
demonstrate that AgentStore surpasses the limitations of previous systems with
narrow capabilities, particularly achieving a significant improvement from
11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous
results. Comprehensive quantitative and qualitative results further demonstrate
AgentStore's ability to enhance agent systems in both generalization and
specialization, underscoring its potential for developing the specialized
generalist computer assistant. All our codes will be made publicly available in
https://chengyou-jia.github.io/AgentStore-Home.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrsOgkAQRbexMOoHWDk_IIKAIXaGqFhjT0YcyCbLLtldjPr1jsTe6s7j3OQIsYzCIMnSNNygfcpHsE34EGW7MJ4KOrSkfemNpT2UNSq8KYKL9tRa9NJoMA0U5MkaBskMDsYGh4Oyp1qikm-6w5m_lmfnITddP3CDEcc7aj8XkwaVo8UvZ2J1Ol7zYj0KVb2VHdpX9RWrRrH4P_EB_ldEwQ</recordid><startdate>20241024</startdate><enddate>20241024</enddate><creator>Jia, Chengyou</creator><creator>Luo, Minnan</creator><creator>Dang, Zhuohang</creator><creator>Sun, Qiushi</creator><creator>Xu, Fangzhi</creator><creator>Hu, Junlin</creator><creator>Xie, Tianbao</creator><creator>Wu, Zhiyong</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241024</creationdate><title>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</title><author>Jia, Chengyou ; Luo, Minnan ; Dang, Zhuohang ; Sun, Qiushi ; Xu, Fangzhi ; Hu, Junlin ; Xie, Tianbao ; Wu, Zhiyong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2410_186033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Jia, Chengyou</creatorcontrib><creatorcontrib>Luo, Minnan</creatorcontrib><creatorcontrib>Dang, Zhuohang</creatorcontrib><creatorcontrib>Sun, Qiushi</creatorcontrib><creatorcontrib>Xu, Fangzhi</creatorcontrib><creatorcontrib>Hu, Junlin</creatorcontrib><creatorcontrib>Xie, Tianbao</creatorcontrib><creatorcontrib>Wu, Zhiyong</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jia, Chengyou</au><au>Luo, Minnan</au><au>Dang, Zhuohang</au><au>Sun, Qiushi</au><au>Xu, Fangzhi</au><au>Hu, Junlin</au><au>Xie, Tianbao</au><au>Wu, Zhiyong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant</atitle><date>2024-10-24</date><risdate>2024</risdate><abstract>Digital agents capable of automating complex computer tasks have attracted
considerable attention due to their immense potential to enhance human-computer
interaction. However, existing agent methods exhibit deficiencies in their
generalization and specialization capabilities, especially in handling
open-ended computer tasks in real-world environments. Inspired by the rich
functionality of the App store, we present AgentStore, a scalable platform
designed to dynamically integrate heterogeneous agents for automating computer
tasks. AgentStore empowers users to integrate third-party agents, allowing the
system to continuously enrich its capabilities and adapt to rapidly evolving
operating systems. Additionally, we propose a novel core \textbf{MetaAgent}
with the \textbf{AgentToken} strategy to efficiently manage diverse agents and
utilize their specialized and generalist abilities for both domain-specific and
system-wide tasks. Extensive experiments on three challenging benchmarks
demonstrate that AgentStore surpasses the limitations of previous systems with
narrow capabilities, particularly achieving a significant improvement from
11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous
results. Comprehensive quantitative and qualitative results further demonstrate
AgentStore's ability to enhance agent systems in both generalization and
specialization, underscoring its potential for developing the specialized
generalist computer assistant. All our codes will be made publicly available in
https://chengyou-jia.github.io/AgentStore-Home.</abstract><doi>10.48550/arxiv.2410.18603</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2410.18603 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2410_18603 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Robotics |
title | AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T23%3A54%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AgentStore:%20Scalable%20Integration%20of%20Heterogeneous%20Agents%20As%20Specialized%20Generalist%20Computer%20Assistant&rft.au=Jia,%20Chengyou&rft.date=2024-10-24&rft_id=info:doi/10.48550/arxiv.2410.18603&rft_dat=%3Carxiv_GOX%3E2410_18603%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |