EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment

Embodied artificial intelligence emphasizes the role of an agent's body in generating human-like behaviors. The recent efforts on EmbodiedAI pay a lot of attention to building up machine learning models to possess perceiving, planning, and acting abilities, thereby enabling real-time interactio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Gao, Chen, Zhao, Baining, Zhang, Weichen, Mao, Jinzhu, Zhang, Jun, Zheng, Zhiheng, Man, Fanhang, Fang, Jianjie, Zhou, Zile, Cui, Jinqiang, Chen, Xinlei, Li, Yong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Gao, Chen
Zhao, Baining
Zhang, Weichen
Mao, Jinzhu
Zhang, Jun
Zheng, Zhiheng
Man, Fanhang
Fang, Jianjie
Zhou, Zile
Cui, Jinqiang
Chen, Xinlei
Li, Yong
description Embodied artificial intelligence emphasizes the role of an agent's body in generating human-like behaviors. The recent efforts on EmbodiedAI pay a lot of attention to building up machine learning models to possess perceiving, planning, and acting abilities, thereby enabling real-time interaction with the world. However, most works focus on bounded indoor environments, such as navigation in a room or manipulating a device, with limited exploration of embodying the agents in open-world scenarios. That is, embodied intelligence in the open and outdoor environment is less explored, for which one potential reason is the lack of high-quality simulators, benchmarks, and datasets. To address it, in this paper, we construct a benchmark platform for embodied intelligence evaluation in real-world city environments. Specifically, we first construct a highly realistic 3D simulation environment based on the real buildings, roads, and other elements in a real city. In this environment, we combine historically collected data and simulation algorithms to conduct simulations of pedestrian and vehicle flows with high fidelity. Further, we designed a set of evaluation tasks covering different EmbodiedAI abilities. Moreover, we provide a complete set of input and output interfaces for access, enabling embodied agents to easily take task requirements and current environmental observations as input and then make decisions and obtain performance evaluations. On the one hand, it expands the capability of existing embodied intelligence to higher levels. On the other hand, it has a higher practical value in the real world and can support more potential applications for artificial general intelligence. Based on this platform, we evaluate some popular large language models for embodied intelligence capabilities of different dimensions and difficulties.
doi_str_mv 10.48550/arxiv.2410.09604
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2410_09604</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410_09604</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2410_096043</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMgEKGFiaGZhwMoS45iblp2SmpjhnllRaKTgqOKXmJWfkJhZlKwTkJJak5RflKgAJBZgyBcf01LwShcw8haDUxBzd8vyinBQFkF4F17yyzKL8vFygNA8Da1piTnEqL5TmZpB3cw1x9tAF2x9fUJQJtKAyHuSOeLA7jAmrAACV5D1A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment</title><source>arXiv.org</source><creator>Gao, Chen ; Zhao, Baining ; Zhang, Weichen ; Mao, Jinzhu ; Zhang, Jun ; Zheng, Zhiheng ; Man, Fanhang ; Fang, Jianjie ; Zhou, Zile ; Cui, Jinqiang ; Chen, Xinlei ; Li, Yong</creator><creatorcontrib>Gao, Chen ; Zhao, Baining ; Zhang, Weichen ; Mao, Jinzhu ; Zhang, Jun ; Zheng, Zhiheng ; Man, Fanhang ; Fang, Jianjie ; Zhou, Zile ; Cui, Jinqiang ; Chen, Xinlei ; Li, Yong</creatorcontrib><description>Embodied artificial intelligence emphasizes the role of an agent's body in generating human-like behaviors. The recent efforts on EmbodiedAI pay a lot of attention to building up machine learning models to possess perceiving, planning, and acting abilities, thereby enabling real-time interaction with the world. However, most works focus on bounded indoor environments, such as navigation in a room or manipulating a device, with limited exploration of embodying the agents in open-world scenarios. That is, embodied intelligence in the open and outdoor environment is less explored, for which one potential reason is the lack of high-quality simulators, benchmarks, and datasets. To address it, in this paper, we construct a benchmark platform for embodied intelligence evaluation in real-world city environments. Specifically, we first construct a highly realistic 3D simulation environment based on the real buildings, roads, and other elements in a real city. In this environment, we combine historically collected data and simulation algorithms to conduct simulations of pedestrian and vehicle flows with high fidelity. Further, we designed a set of evaluation tasks covering different EmbodiedAI abilities. Moreover, we provide a complete set of input and output interfaces for access, enabling embodied agents to easily take task requirements and current environmental observations as input and then make decisions and obtain performance evaluations. On the one hand, it expands the capability of existing embodied intelligence to higher levels. On the other hand, it has a higher practical value in the real world and can support more potential applications for artificial general intelligence. Based on this platform, we evaluate some popular large language models for embodied intelligence capabilities of different dimensions and difficulties.</description><identifier>DOI: 10.48550/arxiv.2410.09604</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Robotics</subject><creationdate>2024-10</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2410.09604$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2410.09604$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Gao, Chen</creatorcontrib><creatorcontrib>Zhao, Baining</creatorcontrib><creatorcontrib>Zhang, Weichen</creatorcontrib><creatorcontrib>Mao, Jinzhu</creatorcontrib><creatorcontrib>Zhang, Jun</creatorcontrib><creatorcontrib>Zheng, Zhiheng</creatorcontrib><creatorcontrib>Man, Fanhang</creatorcontrib><creatorcontrib>Fang, Jianjie</creatorcontrib><creatorcontrib>Zhou, Zile</creatorcontrib><creatorcontrib>Cui, Jinqiang</creatorcontrib><creatorcontrib>Chen, Xinlei</creatorcontrib><creatorcontrib>Li, Yong</creatorcontrib><title>EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment</title><description>Embodied artificial intelligence emphasizes the role of an agent's body in generating human-like behaviors. The recent efforts on EmbodiedAI pay a lot of attention to building up machine learning models to possess perceiving, planning, and acting abilities, thereby enabling real-time interaction with the world. However, most works focus on bounded indoor environments, such as navigation in a room or manipulating a device, with limited exploration of embodying the agents in open-world scenarios. That is, embodied intelligence in the open and outdoor environment is less explored, for which one potential reason is the lack of high-quality simulators, benchmarks, and datasets. To address it, in this paper, we construct a benchmark platform for embodied intelligence evaluation in real-world city environments. Specifically, we first construct a highly realistic 3D simulation environment based on the real buildings, roads, and other elements in a real city. In this environment, we combine historically collected data and simulation algorithms to conduct simulations of pedestrian and vehicle flows with high fidelity. Further, we designed a set of evaluation tasks covering different EmbodiedAI abilities. Moreover, we provide a complete set of input and output interfaces for access, enabling embodied agents to easily take task requirements and current environmental observations as input and then make decisions and obtain performance evaluations. On the one hand, it expands the capability of existing embodied intelligence to higher levels. On the other hand, it has a higher practical value in the real world and can support more potential applications for artificial general intelligence. Based on this platform, we evaluate some popular large language models for embodied intelligence capabilities of different dimensions and difficulties.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMgEKGFiaGZhwMoS45iblp2SmpjhnllRaKTgqOKXmJWfkJhZlKwTkJJak5RflKgAJBZgyBcf01LwShcw8haDUxBzd8vyinBQFkF4F17yyzKL8vFygNA8Da1piTnEqL5TmZpB3cw1x9tAF2x9fUJQJtKAyHuSOeLA7jAmrAACV5D1A</recordid><startdate>20241012</startdate><enddate>20241012</enddate><creator>Gao, Chen</creator><creator>Zhao, Baining</creator><creator>Zhang, Weichen</creator><creator>Mao, Jinzhu</creator><creator>Zhang, Jun</creator><creator>Zheng, Zhiheng</creator><creator>Man, Fanhang</creator><creator>Fang, Jianjie</creator><creator>Zhou, Zile</creator><creator>Cui, Jinqiang</creator><creator>Chen, Xinlei</creator><creator>Li, Yong</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241012</creationdate><title>EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment</title><author>Gao, Chen ; Zhao, Baining ; Zhang, Weichen ; Mao, Jinzhu ; Zhang, Jun ; Zheng, Zhiheng ; Man, Fanhang ; Fang, Jianjie ; Zhou, Zile ; Cui, Jinqiang ; Chen, Xinlei ; Li, Yong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2410_096043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Gao, Chen</creatorcontrib><creatorcontrib>Zhao, Baining</creatorcontrib><creatorcontrib>Zhang, Weichen</creatorcontrib><creatorcontrib>Mao, Jinzhu</creatorcontrib><creatorcontrib>Zhang, Jun</creatorcontrib><creatorcontrib>Zheng, Zhiheng</creatorcontrib><creatorcontrib>Man, Fanhang</creatorcontrib><creatorcontrib>Fang, Jianjie</creatorcontrib><creatorcontrib>Zhou, Zile</creatorcontrib><creatorcontrib>Cui, Jinqiang</creatorcontrib><creatorcontrib>Chen, Xinlei</creatorcontrib><creatorcontrib>Li, Yong</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gao, Chen</au><au>Zhao, Baining</au><au>Zhang, Weichen</au><au>Mao, Jinzhu</au><au>Zhang, Jun</au><au>Zheng, Zhiheng</au><au>Man, Fanhang</au><au>Fang, Jianjie</au><au>Zhou, Zile</au><au>Cui, Jinqiang</au><au>Chen, Xinlei</au><au>Li, Yong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment</atitle><date>2024-10-12</date><risdate>2024</risdate><abstract>Embodied artificial intelligence emphasizes the role of an agent's body in generating human-like behaviors. The recent efforts on EmbodiedAI pay a lot of attention to building up machine learning models to possess perceiving, planning, and acting abilities, thereby enabling real-time interaction with the world. However, most works focus on bounded indoor environments, such as navigation in a room or manipulating a device, with limited exploration of embodying the agents in open-world scenarios. That is, embodied intelligence in the open and outdoor environment is less explored, for which one potential reason is the lack of high-quality simulators, benchmarks, and datasets. To address it, in this paper, we construct a benchmark platform for embodied intelligence evaluation in real-world city environments. Specifically, we first construct a highly realistic 3D simulation environment based on the real buildings, roads, and other elements in a real city. In this environment, we combine historically collected data and simulation algorithms to conduct simulations of pedestrian and vehicle flows with high fidelity. Further, we designed a set of evaluation tasks covering different EmbodiedAI abilities. Moreover, we provide a complete set of input and output interfaces for access, enabling embodied agents to easily take task requirements and current environmental observations as input and then make decisions and obtain performance evaluations. On the one hand, it expands the capability of existing embodied intelligence to higher levels. On the other hand, it has a higher practical value in the real world and can support more potential applications for artificial general intelligence. Based on this platform, we evaluate some popular large language models for embodied intelligence capabilities of different dimensions and difficulties.</abstract><doi>10.48550/arxiv.2410.09604</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2410.09604
ispartof
issn
language eng
recordid cdi_arxiv_primary_2410_09604
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Robotics
title EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T02%3A11%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=EmbodiedCity:%20A%20Benchmark%20Platform%20for%20Embodied%20Agent%20in%20Real-world%20City%20Environment&rft.au=Gao,%20Chen&rft.date=2024-10-12&rft_id=info:doi/10.48550/arxiv.2410.09604&rft_dat=%3Carxiv_GOX%3E2410_09604%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true