Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs

We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2022-06
Hauptverfasser:	Paul, Nathalie, Wirtz, Tim, Wrobel, Stefan, Kister, Alexander
Format:	Artikel
Sprache:	eng
Schlagworte:	Costs Learning Multiagent systems Nodes Route planning Vehicle routing
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Paul, Nathalie Wirtz, Tim Wrobel, Stefan Kister, Alexander
description	We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only its own cost. Our multi-agent reinforcement learning approach, the so-called multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to solve the problem by iteratively rewriting solutions. Parallel agent action execution and partial observability require new rewriting rules for the game. We propose the introduction of a so-called pool in the system which serves as a collection point for unvisited nodes. It enables agents to act simultaneously and exchange nodes in a conflict-free manner. We realize limited disclosure of agent-specific costs by only sharing them during learning. During inference, each agents acts decentrally, solely based on its own cost. First empirical results on small problem sizes demonstrate that we reach a performance close to the employed OR-Tools benchmark which operates in the perfect cost information setting.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2676388135</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2676388135</sourcerecordid><originalsourceid>FETCH-proquest_journals_26763881353</originalsourceid><addsrcrecordid>eNqNjsEKgkAUAJcgSMp_eNBZ0N3WvIYVHSpCoquIPXVlc2vfLv5-HvqATnOYOcyMBVyIJMo2nC9YSNTHcczTLZdSBOx28dqpaNfi4OCK3lYaChytcmihMRYe2KlaIxTGOzW0MCrXwVm9puAJe0W1NuQtgmkgN-RoxeZNpQnDH5dsfTzc81P0tubjkVzZG2-HSZXTRCqyLBFS_Fd9AUDsPso</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2676388135</pqid></control><display><type>article</type><title>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</title><source>Free E- Journals</source><creator>Paul, Nathalie ; Wirtz, Tim ; Wrobel, Stefan ; Kister, Alexander</creator><creatorcontrib>Paul, Nathalie ; Wirtz, Tim ; Wrobel, Stefan ; Kister, Alexander</creatorcontrib><description>We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only its own cost. Our multi-agent reinforcement learning approach, the so-called multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to solve the problem by iteratively rewriting solutions. Parallel agent action execution and partial observability require new rewriting rules for the game. We propose the introduction of a so-called pool in the system which serves as a collection point for unvisited nodes. It enables agents to act simultaneously and exchange nodes in a conflict-free manner. We realize limited disclosure of agent-specific costs by only sharing them during learning. During inference, each agents acts decentrally, solely based on its own cost. First empirical results on small problem sizes demonstrate that we reach a performance close to the employed OR-Tools benchmark which operates in the perfect cost information setting.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Costs ; Learning ; Multiagent systems ; Nodes ; Route planning ; Vehicle routing</subject><ispartof>arXiv.org, 2022-06</ispartof><rights>2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>781,785</link.rule.ids></links><search><creatorcontrib>Paul, Nathalie</creatorcontrib><creatorcontrib>Wirtz, Tim</creatorcontrib><creatorcontrib>Wrobel, Stefan</creatorcontrib><creatorcontrib>Kister, Alexander</creatorcontrib><title>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</title><title>arXiv.org</title><description>We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only its own cost. Our multi-agent reinforcement learning approach, the so-called multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to solve the problem by iteratively rewriting solutions. Parallel agent action execution and partial observability require new rewriting rules for the game. We propose the introduction of a so-called pool in the system which serves as a collection point for unvisited nodes. It enables agents to act simultaneously and exchange nodes in a conflict-free manner. We realize limited disclosure of agent-specific costs by only sharing them during learning. During inference, each agents acts decentrally, solely based on its own cost. First empirical results on small problem sizes demonstrate that we reach a performance close to the employed OR-Tools benchmark which operates in the perfect cost information setting.</description><subject>Costs</subject><subject>Learning</subject><subject>Multiagent systems</subject><subject>Nodes</subject><subject>Route planning</subject><subject>Vehicle routing</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjsEKgkAUAJcgSMp_eNBZ0N3WvIYVHSpCoquIPXVlc2vfLv5-HvqATnOYOcyMBVyIJMo2nC9YSNTHcczTLZdSBOx28dqpaNfi4OCK3lYaChytcmihMRYe2KlaIxTGOzW0MCrXwVm9puAJe0W1NuQtgmkgN-RoxeZNpQnDH5dsfTzc81P0tubjkVzZG2-HSZXTRCqyLBFS_Fd9AUDsPso</recordid><startdate>20220613</startdate><enddate>20220613</enddate><creator>Paul, Nathalie</creator><creator>Wirtz, Tim</creator><creator>Wrobel, Stefan</creator><creator>Kister, Alexander</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220613</creationdate><title>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</title><author>Paul, Nathalie ; Wirtz, Tim ; Wrobel, Stefan ; Kister, Alexander</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26763881353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Costs</topic><topic>Learning</topic><topic>Multiagent systems</topic><topic>Nodes</topic><topic>Route planning</topic><topic>Vehicle routing</topic><toplevel>online_resources</toplevel><creatorcontrib>Paul, Nathalie</creatorcontrib><creatorcontrib>Wirtz, Tim</creatorcontrib><creatorcontrib>Wrobel, Stefan</creatorcontrib><creatorcontrib>Kister, Alexander</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Paul, Nathalie</au><au>Wirtz, Tim</au><au>Wrobel, Stefan</au><au>Kister, Alexander</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</atitle><jtitle>arXiv.org</jtitle><date>2022-06-13</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only its own cost. Our multi-agent reinforcement learning approach, the so-called multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to solve the problem by iteratively rewriting solutions. Parallel agent action execution and partial observability require new rewriting rules for the game. We propose the introduction of a so-called pool in the system which serves as a collection point for unvisited nodes. It enables agents to act simultaneously and exchange nodes in a conflict-free manner. We realize limited disclosure of agent-specific costs by only sharing them during learning. During inference, each agents acts decentrally, solely based on its own cost. First empirical results on small problem sizes demonstrate that we reach a performance close to the employed OR-Tools benchmark which operates in the perfect cost information setting.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2022-06
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2676388135
source	Free E- Journals
subjects	Costs Learning Multiagent systems Nodes Route planning Vehicle routing
title	Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-12T13%3A13%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Multi-Agent%20Neural%20Rewriter%20for%20Vehicle%20Routing%20with%20Limited%20Disclosure%20of%20Costs&rft.jtitle=arXiv.org&rft.au=Paul,%20Nathalie&rft.date=2022-06-13&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2676388135%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2676388135&rft_id=info:pmid/&rfr_iscdi=true