Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance
Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely m...
Gespeichert in:
Veröffentlicht in: | IEEE access 2021, Vol.9, p.46788-46799 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 46799 |
---|---|
container_issue | |
container_start_page | 46788 |
container_title | IEEE access |
container_volume | 9 |
creator | Tanimoto, Akira |
description | Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of O(2^{nH}) , where n is the number of components and H is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys O(n) complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step. |
doi_str_mv | 10.1109/ACCESS.2021.3059244 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2021_3059244</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9354750</ieee_id><doaj_id>oai_doaj_org_article_6083c268dfd745519f36ccb6bd1e50f5</doaj_id><sourcerecordid>2509076927</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</originalsourceid><addsrcrecordid>eNqNUV1rFTEUXETB0vYX9GXBR9lrvjd5rEutF24RqT6HbHJScrlNapJF_Pdm3VJ9NC8nDDNzDjNdd4XRDmOkPlxP0839_Y4ggncUcUUYe9WdESzUQDkVr__5v-0uSzmi9mSD-HjW3U7pcQ7R1JSDOfVfhwOYHEN86H3K_ZSiCzWkOHw0BVy_jz6bUvNi65KhvzMhVogmWrjo3nhzKnD5PM-7759uvk2fh8OX2_10fRgsQ7IOAuxsvJQIRgzCE444Gy0eDeXSWj8T42bsOMxGcuYwwsozYEo1zHAqR3re7Tdfl8xRP-XwaPIvnUzQf4CUH7TJNdgTaIEktURI593IOG9WVFg7i9lh4Mjz5vVu83rK6ccCpepjWnJs5-t2mEKjUGTdSDeWzamUDP5lK0Z6LUBvBei1AP1cQFPJTfUT5uSLDdBCelG2AgQdsSBs7QJPoZo15SktsTbp-_-XNvbVxg4Af1mKtlw5or8BvEehtQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2509076927</pqid></control><display><type>article</type><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /></source><creator>Tanimoto, Akira</creator><creatorcontrib>Tanimoto, Akira</creatorcontrib><description><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3059244</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>PISCATAWAY: IEEE</publisher><subject>artificial intelligence ; Combinatorial analysis ; combinatorial optimization ; Complexity ; Computer Science ; Computer Science, Information Systems ; Condition monitoring ; decision support systems ; Degradation ; Dependence ; Dynamic programming ; Economics ; Engineering ; Engineering, Electrical & Electronic ; Infrastructure ; Machine learning ; Maintenance costs ; Maintenance engineering ; Optimization ; Planning ; Prediction models ; Predictive maintenance ; reinforcement learning ; Scalability ; Science & Technology ; Technology ; Telecommunications ; Uncertainty ; Vehicle dynamics</subject><ispartof>IEEE access, 2021, Vol.9, p.46788-46799</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>6</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000637162400001</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</citedby><cites>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</cites><orcidid>0000-0003-0459-3993</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9354750$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>315,781,785,865,2103,2115,4025,27638,27928,27929,27930,39263,54938</link.rule.ids></links><search><creatorcontrib>Tanimoto, Akira</creatorcontrib><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><title>IEEE access</title><addtitle>Access</addtitle><addtitle>IEEE ACCESS</addtitle><description><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></description><subject>artificial intelligence</subject><subject>Combinatorial analysis</subject><subject>combinatorial optimization</subject><subject>Complexity</subject><subject>Computer Science</subject><subject>Computer Science, Information Systems</subject><subject>Condition monitoring</subject><subject>decision support systems</subject><subject>Degradation</subject><subject>Dependence</subject><subject>Dynamic programming</subject><subject>Economics</subject><subject>Engineering</subject><subject>Engineering, Electrical & Electronic</subject><subject>Infrastructure</subject><subject>Machine learning</subject><subject>Maintenance costs</subject><subject>Maintenance engineering</subject><subject>Optimization</subject><subject>Planning</subject><subject>Prediction models</subject><subject>Predictive maintenance</subject><subject>reinforcement learning</subject><subject>Scalability</subject><subject>Science & Technology</subject><subject>Technology</subject><subject>Telecommunications</subject><subject>Uncertainty</subject><subject>Vehicle dynamics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>HGBXW</sourceid><sourceid>DOA</sourceid><recordid>eNqNUV1rFTEUXETB0vYX9GXBR9lrvjd5rEutF24RqT6HbHJScrlNapJF_Pdm3VJ9NC8nDDNzDjNdd4XRDmOkPlxP0839_Y4ggncUcUUYe9WdESzUQDkVr__5v-0uSzmi9mSD-HjW3U7pcQ7R1JSDOfVfhwOYHEN86H3K_ZSiCzWkOHw0BVy_jz6bUvNi65KhvzMhVogmWrjo3nhzKnD5PM-7759uvk2fh8OX2_10fRgsQ7IOAuxsvJQIRgzCE444Gy0eDeXSWj8T42bsOMxGcuYwwsozYEo1zHAqR3re7Tdfl8xRP-XwaPIvnUzQf4CUH7TJNdgTaIEktURI593IOG9WVFg7i9lh4Mjz5vVu83rK6ccCpepjWnJs5-t2mEKjUGTdSDeWzamUDP5lK0Z6LUBvBei1AP1cQFPJTfUT5uSLDdBCelG2AgQdsSBs7QJPoZo15SktsTbp-_-XNvbVxg4Af1mKtlw5or8BvEehtQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Tanimoto, Akira</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-0459-3993</orcidid></search><sort><creationdate>2021</creationdate><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><author>Tanimoto, Akira</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>artificial intelligence</topic><topic>Combinatorial analysis</topic><topic>combinatorial optimization</topic><topic>Complexity</topic><topic>Computer Science</topic><topic>Computer Science, Information Systems</topic><topic>Condition monitoring</topic><topic>decision support systems</topic><topic>Degradation</topic><topic>Dependence</topic><topic>Dynamic programming</topic><topic>Economics</topic><topic>Engineering</topic><topic>Engineering, Electrical & Electronic</topic><topic>Infrastructure</topic><topic>Machine learning</topic><topic>Maintenance costs</topic><topic>Maintenance engineering</topic><topic>Optimization</topic><topic>Planning</topic><topic>Prediction models</topic><topic>Predictive maintenance</topic><topic>reinforcement learning</topic><topic>Scalability</topic><topic>Science & Technology</topic><topic>Technology</topic><topic>Telecommunications</topic><topic>Uncertainty</topic><topic>Vehicle dynamics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tanimoto, Akira</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tanimoto, Akira</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><stitle>IEEE ACCESS</stitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>46788</spage><epage>46799</epage><pages>46788-46799</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></abstract><cop>PISCATAWAY</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2021.3059244</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-0459-3993</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2021, Vol.9, p.46788-46799 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_crossref_primary_10_1109_ACCESS_2021_3059244 |
source | IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /> |
subjects | artificial intelligence Combinatorial analysis combinatorial optimization Complexity Computer Science Computer Science, Information Systems Condition monitoring decision support systems Degradation Dependence Dynamic programming Economics Engineering Engineering, Electrical & Electronic Infrastructure Machine learning Maintenance costs Maintenance engineering Optimization Planning Prediction models Predictive maintenance reinforcement learning Scalability Science & Technology Technology Telecommunications Uncertainty Vehicle dynamics |
title | Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T22%3A50%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combinatorial%20Q-Learning%20for%20Condition-Based%20Infrastructure%20Maintenance&rft.jtitle=IEEE%20access&rft.au=Tanimoto,%20Akira&rft.date=2021&rft.volume=9&rft.spage=46788&rft.epage=46799&rft.pages=46788-46799&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3059244&rft_dat=%3Cproquest_cross%3E2509076927%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2509076927&rft_id=info:pmid/&rft_ieee_id=9354750&rft_doaj_id=oai_doaj_org_article_6083c268dfd745519f36ccb6bd1e50f5&rfr_iscdi=true |