Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance

Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely m...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2021, Vol.9, p.46788-46799
1. Verfasser: Tanimoto, Akira
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 46799
container_issue
container_start_page 46788
container_title IEEE access
container_volume 9
creator Tanimoto, Akira
description Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of O(2^{nH}) , where n is the number of components and H is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys O(n) complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.
doi_str_mv 10.1109/ACCESS.2021.3059244
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2021_3059244</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9354750</ieee_id><doaj_id>oai_doaj_org_article_6083c268dfd745519f36ccb6bd1e50f5</doaj_id><sourcerecordid>2509076927</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</originalsourceid><addsrcrecordid>eNqNUV1rFTEUXETB0vYX9GXBR9lrvjd5rEutF24RqT6HbHJScrlNapJF_Pdm3VJ9NC8nDDNzDjNdd4XRDmOkPlxP0839_Y4ggncUcUUYe9WdESzUQDkVr__5v-0uSzmi9mSD-HjW3U7pcQ7R1JSDOfVfhwOYHEN86H3K_ZSiCzWkOHw0BVy_jz6bUvNi65KhvzMhVogmWrjo3nhzKnD5PM-7759uvk2fh8OX2_10fRgsQ7IOAuxsvJQIRgzCE444Gy0eDeXSWj8T42bsOMxGcuYwwsozYEo1zHAqR3re7Tdfl8xRP-XwaPIvnUzQf4CUH7TJNdgTaIEktURI593IOG9WVFg7i9lh4Mjz5vVu83rK6ccCpepjWnJs5-t2mEKjUGTdSDeWzamUDP5lK0Z6LUBvBei1AP1cQFPJTfUT5uSLDdBCelG2AgQdsSBs7QJPoZo15SktsTbp-_-XNvbVxg4Af1mKtlw5or8BvEehtQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2509076927</pqid></control><display><type>article</type><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Web of Science - Science Citation Index Expanded - 2021&lt;img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /&gt;</source><creator>Tanimoto, Akira</creator><creatorcontrib>Tanimoto, Akira</creatorcontrib><description><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3059244</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>PISCATAWAY: IEEE</publisher><subject>artificial intelligence ; Combinatorial analysis ; combinatorial optimization ; Complexity ; Computer Science ; Computer Science, Information Systems ; Condition monitoring ; decision support systems ; Degradation ; Dependence ; Dynamic programming ; Economics ; Engineering ; Engineering, Electrical &amp; Electronic ; Infrastructure ; Machine learning ; Maintenance costs ; Maintenance engineering ; Optimization ; Planning ; Prediction models ; Predictive maintenance ; reinforcement learning ; Scalability ; Science &amp; Technology ; Technology ; Telecommunications ; Uncertainty ; Vehicle dynamics</subject><ispartof>IEEE access, 2021, Vol.9, p.46788-46799</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>6</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000637162400001</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</citedby><cites>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</cites><orcidid>0000-0003-0459-3993</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9354750$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>315,781,785,865,2103,2115,4025,27638,27928,27929,27930,39263,54938</link.rule.ids></links><search><creatorcontrib>Tanimoto, Akira</creatorcontrib><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><title>IEEE access</title><addtitle>Access</addtitle><addtitle>IEEE ACCESS</addtitle><description><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></description><subject>artificial intelligence</subject><subject>Combinatorial analysis</subject><subject>combinatorial optimization</subject><subject>Complexity</subject><subject>Computer Science</subject><subject>Computer Science, Information Systems</subject><subject>Condition monitoring</subject><subject>decision support systems</subject><subject>Degradation</subject><subject>Dependence</subject><subject>Dynamic programming</subject><subject>Economics</subject><subject>Engineering</subject><subject>Engineering, Electrical &amp; Electronic</subject><subject>Infrastructure</subject><subject>Machine learning</subject><subject>Maintenance costs</subject><subject>Maintenance engineering</subject><subject>Optimization</subject><subject>Planning</subject><subject>Prediction models</subject><subject>Predictive maintenance</subject><subject>reinforcement learning</subject><subject>Scalability</subject><subject>Science &amp; Technology</subject><subject>Technology</subject><subject>Telecommunications</subject><subject>Uncertainty</subject><subject>Vehicle dynamics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>HGBXW</sourceid><sourceid>DOA</sourceid><recordid>eNqNUV1rFTEUXETB0vYX9GXBR9lrvjd5rEutF24RqT6HbHJScrlNapJF_Pdm3VJ9NC8nDDNzDjNdd4XRDmOkPlxP0839_Y4ggncUcUUYe9WdESzUQDkVr__5v-0uSzmi9mSD-HjW3U7pcQ7R1JSDOfVfhwOYHEN86H3K_ZSiCzWkOHw0BVy_jz6bUvNi65KhvzMhVogmWrjo3nhzKnD5PM-7759uvk2fh8OX2_10fRgsQ7IOAuxsvJQIRgzCE444Gy0eDeXSWj8T42bsOMxGcuYwwsozYEo1zHAqR3re7Tdfl8xRP-XwaPIvnUzQf4CUH7TJNdgTaIEktURI593IOG9WVFg7i9lh4Mjz5vVu83rK6ccCpepjWnJs5-t2mEKjUGTdSDeWzamUDP5lK0Z6LUBvBei1AP1cQFPJTfUT5uSLDdBCelG2AgQdsSBs7QJPoZo15SktsTbp-_-XNvbVxg4Af1mKtlw5or8BvEehtQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Tanimoto, Akira</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-0459-3993</orcidid></search><sort><creationdate>2021</creationdate><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><author>Tanimoto, Akira</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>artificial intelligence</topic><topic>Combinatorial analysis</topic><topic>combinatorial optimization</topic><topic>Complexity</topic><topic>Computer Science</topic><topic>Computer Science, Information Systems</topic><topic>Condition monitoring</topic><topic>decision support systems</topic><topic>Degradation</topic><topic>Dependence</topic><topic>Dynamic programming</topic><topic>Economics</topic><topic>Engineering</topic><topic>Engineering, Electrical &amp; Electronic</topic><topic>Infrastructure</topic><topic>Machine learning</topic><topic>Maintenance costs</topic><topic>Maintenance engineering</topic><topic>Optimization</topic><topic>Planning</topic><topic>Prediction models</topic><topic>Predictive maintenance</topic><topic>reinforcement learning</topic><topic>Scalability</topic><topic>Science &amp; Technology</topic><topic>Technology</topic><topic>Telecommunications</topic><topic>Uncertainty</topic><topic>Vehicle dynamics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tanimoto, Akira</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tanimoto, Akira</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><stitle>IEEE ACCESS</stitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>46788</spage><epage>46799</epage><pages>46788-46799</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></abstract><cop>PISCATAWAY</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2021.3059244</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-0459-3993</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2021, Vol.9, p.46788-46799
issn 2169-3536
2169-3536
language eng
recordid cdi_crossref_primary_10_1109_ACCESS_2021_3059244
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />
subjects artificial intelligence
Combinatorial analysis
combinatorial optimization
Complexity
Computer Science
Computer Science, Information Systems
Condition monitoring
decision support systems
Degradation
Dependence
Dynamic programming
Economics
Engineering
Engineering, Electrical & Electronic
Infrastructure
Machine learning
Maintenance costs
Maintenance engineering
Optimization
Planning
Prediction models
Predictive maintenance
reinforcement learning
Scalability
Science & Technology
Technology
Telecommunications
Uncertainty
Vehicle dynamics
title Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T22%3A50%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combinatorial%20Q-Learning%20for%20Condition-Based%20Infrastructure%20Maintenance&rft.jtitle=IEEE%20access&rft.au=Tanimoto,%20Akira&rft.date=2021&rft.volume=9&rft.spage=46788&rft.epage=46799&rft.pages=46788-46799&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3059244&rft_dat=%3Cproquest_cross%3E2509076927%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2509076927&rft_id=info:pmid/&rft_ieee_id=9354750&rft_doaj_id=oai_doaj_org_article_6083c268dfd745519f36ccb6bd1e50f5&rfr_iscdi=true