Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs
We interpret solving the multi-vehicle routing problem as a team Markov game with partially observable costs. For a given set of customers to serve, the playing agents (vehicles) have the common goal to determine the team-optimal agent routes with minimal total cost. Each agent thereby observes only...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Paul, Nathalie Wirtz, Tim Wrobel, Stefan Kister, Alexander |
description | We interpret solving the multi-vehicle routing problem as a team Markov game
with partially observable costs. For a given set of customers to serve, the
playing agents (vehicles) have the common goal to determine the team-optimal
agent routes with minimal total cost. Each agent thereby observes only its own
cost. Our multi-agent reinforcement learning approach, the so-called
multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to
solve the problem by iteratively rewriting solutions. Parallel agent action
execution and partial observability require new rewriting rules for the game.
We propose the introduction of a so-called pool in the system which serves as a
collection point for unvisited nodes. It enables agents to act simultaneously
and exchange nodes in a conflict-free manner. We realize limited disclosure of
agent-specific costs by only sharing them during learning. During inference,
each agents acts decentrally, solely based on its own cost. First empirical
results on small problem sizes demonstrate that we reach a performance close to
the employed OR-Tools benchmark which operates in the perfect cost information
setting. |
doi_str_mv | 10.48550/arxiv.2206.05990 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2206_05990</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2206_05990</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-b4a8f0e46ff60a681b22c7e467b954fb239a3e6cfc6892212cc7357b204a5c33</originalsourceid><addsrcrecordid>eNotz0tOwzAYBGBvWKDCAVjVF0hw_IyXVXhKKaAWsY1s87u15DbIdijcnlJYjUYajfQhdNWQmrdCkGuTvsJnTSmRNRFak3P0spxiCdViA_uCn2BKJuIVHFIokLAfE36DbXAR8GqcSthv8CGULe7D7jh4xzchuzjmKQEePe7GXPIFOvMmZrj8zxla392-dg9V_3z_2C36ykhFKstN6wlw6b0kRraNpdSpY1dWC-4tZdowkM472WpKG-qcYkJZSrgRjrEZmv-9nkTDRwo7k76HX9lwkrEf4y1JBA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</title><source>arXiv.org</source><creator>Paul, Nathalie ; Wirtz, Tim ; Wrobel, Stefan ; Kister, Alexander</creator><creatorcontrib>Paul, Nathalie ; Wirtz, Tim ; Wrobel, Stefan ; Kister, Alexander</creatorcontrib><description>We interpret solving the multi-vehicle routing problem as a team Markov game
with partially observable costs. For a given set of customers to serve, the
playing agents (vehicles) have the common goal to determine the team-optimal
agent routes with minimal total cost. Each agent thereby observes only its own
cost. Our multi-agent reinforcement learning approach, the so-called
multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to
solve the problem by iteratively rewriting solutions. Parallel agent action
execution and partial observability require new rewriting rules for the game.
We propose the introduction of a so-called pool in the system which serves as a
collection point for unvisited nodes. It enables agents to act simultaneously
and exchange nodes in a conflict-free manner. We realize limited disclosure of
agent-specific costs by only sharing them during learning. During inference,
each agents acts decentrally, solely based on its own cost. First empirical
results on small problem sizes demonstrate that we reach a performance close to
the employed OR-Tools benchmark which operates in the perfect cost information
setting.</description><identifier>DOI: 10.48550/arxiv.2206.05990</identifier><language>eng</language><subject>Computer Science - Computer Science and Game Theory ; Computer Science - Learning ; Computer Science - Multiagent Systems</subject><creationdate>2022-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,782,887</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2206.05990$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2206.05990$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Paul, Nathalie</creatorcontrib><creatorcontrib>Wirtz, Tim</creatorcontrib><creatorcontrib>Wrobel, Stefan</creatorcontrib><creatorcontrib>Kister, Alexander</creatorcontrib><title>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</title><description>We interpret solving the multi-vehicle routing problem as a team Markov game
with partially observable costs. For a given set of customers to serve, the
playing agents (vehicles) have the common goal to determine the team-optimal
agent routes with minimal total cost. Each agent thereby observes only its own
cost. Our multi-agent reinforcement learning approach, the so-called
multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to
solve the problem by iteratively rewriting solutions. Parallel agent action
execution and partial observability require new rewriting rules for the game.
We propose the introduction of a so-called pool in the system which serves as a
collection point for unvisited nodes. It enables agents to act simultaneously
and exchange nodes in a conflict-free manner. We realize limited disclosure of
agent-specific costs by only sharing them during learning. During inference,
each agents acts decentrally, solely based on its own cost. First empirical
results on small problem sizes demonstrate that we reach a performance close to
the employed OR-Tools benchmark which operates in the perfect cost information
setting.</description><subject>Computer Science - Computer Science and Game Theory</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Multiagent Systems</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0tOwzAYBGBvWKDCAVjVF0hw_IyXVXhKKaAWsY1s87u15DbIdijcnlJYjUYajfQhdNWQmrdCkGuTvsJnTSmRNRFak3P0spxiCdViA_uCn2BKJuIVHFIokLAfE36DbXAR8GqcSthv8CGULe7D7jh4xzchuzjmKQEePe7GXPIFOvMmZrj8zxla392-dg9V_3z_2C36ykhFKstN6wlw6b0kRraNpdSpY1dWC-4tZdowkM472WpKG-qcYkJZSrgRjrEZmv-9nkTDRwo7k76HX9lwkrEf4y1JBA</recordid><startdate>20220613</startdate><enddate>20220613</enddate><creator>Paul, Nathalie</creator><creator>Wirtz, Tim</creator><creator>Wrobel, Stefan</creator><creator>Kister, Alexander</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220613</creationdate><title>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</title><author>Paul, Nathalie ; Wirtz, Tim ; Wrobel, Stefan ; Kister, Alexander</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-b4a8f0e46ff60a681b22c7e467b954fb239a3e6cfc6892212cc7357b204a5c33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Science and Game Theory</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Multiagent Systems</topic><toplevel>online_resources</toplevel><creatorcontrib>Paul, Nathalie</creatorcontrib><creatorcontrib>Wirtz, Tim</creatorcontrib><creatorcontrib>Wrobel, Stefan</creatorcontrib><creatorcontrib>Kister, Alexander</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Paul, Nathalie</au><au>Wirtz, Tim</au><au>Wrobel, Stefan</au><au>Kister, Alexander</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs</atitle><date>2022-06-13</date><risdate>2022</risdate><abstract>We interpret solving the multi-vehicle routing problem as a team Markov game
with partially observable costs. For a given set of customers to serve, the
playing agents (vehicles) have the common goal to determine the team-optimal
agent routes with minimal total cost. Each agent thereby observes only its own
cost. Our multi-agent reinforcement learning approach, the so-called
multi-agent Neural Rewriter, builds on the single-agent Neural Rewriter to
solve the problem by iteratively rewriting solutions. Parallel agent action
execution and partial observability require new rewriting rules for the game.
We propose the introduction of a so-called pool in the system which serves as a
collection point for unvisited nodes. It enables agents to act simultaneously
and exchange nodes in a conflict-free manner. We realize limited disclosure of
agent-specific costs by only sharing them during learning. During inference,
each agents acts decentrally, solely based on its own cost. First empirical
results on small problem sizes demonstrate that we reach a performance close to
the employed OR-Tools benchmark which operates in the perfect cost information
setting.</abstract><doi>10.48550/arxiv.2206.05990</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2206.05990 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2206_05990 |
source | arXiv.org |
subjects | Computer Science - Computer Science and Game Theory Computer Science - Learning Computer Science - Multiagent Systems |
title | Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T19%3A23%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multi-Agent%20Neural%20Rewriter%20for%20Vehicle%20Routing%20with%20Limited%20Disclosure%20of%20Costs&rft.au=Paul,%20Nathalie&rft.date=2022-06-13&rft_id=info:doi/10.48550/arxiv.2206.05990&rft_dat=%3Carxiv_GOX%3E2206_05990%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |