Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance

Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely m...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE access 2021, Vol.9, p.46788-46799
1. Verfasser:	Tanimoto, Akira
Format:	Artikel
Sprache:	eng
Schlagworte:	artificial intelligence Combinatorial analysis combinatorial optimization Complexity Computer Science Computer Science, Information Systems Condition monitoring decision support systems Degradation Dependence Dynamic programming Economics Engineering Engineering, Electrical & Electronic Infrastructure Machine learning Maintenance costs Maintenance engineering Optimization Planning Prediction models Predictive maintenance reinforcement learning Scalability Science & Technology Technology Telecommunications Uncertainty Vehicle dynamics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	46799
container_issue
container_start_page	46788
container_title	IEEE access
container_volume	9
creator	Tanimoto, Akira
description	Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of O(2^{nH}) , where n is the number of components and H is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys O(n) complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.
doi_str_mv	10.1109/ACCESS.2021.3059244
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2021_3059244</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9354750</ieee_id><doaj_id>oai_doaj_org_article_6083c268dfd745519f36ccb6bd1e50f5</doaj_id><sourcerecordid>2509076927</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</originalsourceid><addsrcrecordid>eNqNUV1rFTEUXETB0vYX9GXBR9lrvjd5rEutF24RqT6HbHJScrlNapJF_Pdm3VJ9NC8nDDNzDjNdd4XRDmOkPlxP0839_Y4ggncUcUUYe9WdESzUQDkVr__5v-0uSzmi9mSD-HjW3U7pcQ7R1JSDOfVfhwOYHEN86H3K_ZSiCzWkOHw0BVy_jz6bUvNi65KhvzMhVogmWrjo3nhzKnD5PM-7759uvk2fh8OX2_10fRgsQ7IOAuxsvJQIRgzCE444Gy0eDeXSWj8T42bsOMxGcuYwwsozYEo1zHAqR3re7Tdfl8xRP-XwaPIvnUzQf4CUH7TJNdgTaIEktURI593IOG9WVFg7i9lh4Mjz5vVu83rK6ccCpepjWnJs5-t2mEKjUGTdSDeWzamUDP5lK0Z6LUBvBei1AP1cQFPJTfUT5uSLDdBCelG2AgQdsSBs7QJPoZo15SktsTbp-_-XNvbVxg4Af1mKtlw5or8BvEehtQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2509076927</pqid></control><display><type>article</type><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /></source><creator>Tanimoto, Akira</creator><creatorcontrib>Tanimoto, Akira</creatorcontrib><description><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3059244</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>PISCATAWAY: IEEE</publisher><subject>artificial intelligence ; Combinatorial analysis ; combinatorial optimization ; Complexity ; Computer Science ; Computer Science, Information Systems ; Condition monitoring ; decision support systems ; Degradation ; Dependence ; Dynamic programming ; Economics ; Engineering ; Engineering, Electrical & Electronic ; Infrastructure ; Machine learning ; Maintenance costs ; Maintenance engineering ; Optimization ; Planning ; Prediction models ; Predictive maintenance ; reinforcement learning ; Scalability ; Science & Technology ; Technology ; Telecommunications ; Uncertainty ; Vehicle dynamics</subject><ispartof>IEEE access, 2021, Vol.9, p.46788-46799</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>6</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000637162400001</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</citedby><cites>FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</cites><orcidid>0000-0003-0459-3993</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9354750$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>315,781,785,865,2103,2115,4025,27638,27928,27929,27930,39263,54938</link.rule.ids></links><search><creatorcontrib>Tanimoto, Akira</creatorcontrib><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><title>IEEE access</title><addtitle>Access</addtitle><addtitle>IEEE ACCESS</addtitle><description><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></description><subject>artificial intelligence</subject><subject>Combinatorial analysis</subject><subject>combinatorial optimization</subject><subject>Complexity</subject><subject>Computer Science</subject><subject>Computer Science, Information Systems</subject><subject>Condition monitoring</subject><subject>decision support systems</subject><subject>Degradation</subject><subject>Dependence</subject><subject>Dynamic programming</subject><subject>Economics</subject><subject>Engineering</subject><subject>Engineering, Electrical & Electronic</subject><subject>Infrastructure</subject><subject>Machine learning</subject><subject>Maintenance costs</subject><subject>Maintenance engineering</subject><subject>Optimization</subject><subject>Planning</subject><subject>Prediction models</subject><subject>Predictive maintenance</subject><subject>reinforcement learning</subject><subject>Scalability</subject><subject>Science & Technology</subject><subject>Technology</subject><subject>Telecommunications</subject><subject>Uncertainty</subject><subject>Vehicle dynamics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>HGBXW</sourceid><sourceid>DOA</sourceid><recordid>eNqNUV1rFTEUXETB0vYX9GXBR9lrvjd5rEutF24RqT6HbHJScrlNapJF_Pdm3VJ9NC8nDDNzDjNdd4XRDmOkPlxP0839_Y4ggncUcUUYe9WdESzUQDkVr__5v-0uSzmi9mSD-HjW3U7pcQ7R1JSDOfVfhwOYHEN86H3K_ZSiCzWkOHw0BVy_jz6bUvNi65KhvzMhVogmWrjo3nhzKnD5PM-7759uvk2fh8OX2_10fRgsQ7IOAuxsvJQIRgzCE444Gy0eDeXSWj8T42bsOMxGcuYwwsozYEo1zHAqR3re7Tdfl8xRP-XwaPIvnUzQf4CUH7TJNdgTaIEktURI593IOG9WVFg7i9lh4Mjz5vVu83rK6ccCpepjWnJs5-t2mEKjUGTdSDeWzamUDP5lK0Z6LUBvBei1AP1cQFPJTfUT5uSLDdBCelG2AgQdsSBs7QJPoZo15SktsTbp-_-XNvbVxg4Af1mKtlw5or8BvEehtQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Tanimoto, Akira</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-0459-3993</orcidid></search><sort><creationdate>2021</creationdate><title>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</title><author>Tanimoto, Akira</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-6ecbaf880e71e6f250547c17a358ccfb2adb1d5eba854d1019f4e4991d5a53873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>artificial intelligence</topic><topic>Combinatorial analysis</topic><topic>combinatorial optimization</topic><topic>Complexity</topic><topic>Computer Science</topic><topic>Computer Science, Information Systems</topic><topic>Condition monitoring</topic><topic>decision support systems</topic><topic>Degradation</topic><topic>Dependence</topic><topic>Dynamic programming</topic><topic>Economics</topic><topic>Engineering</topic><topic>Engineering, Electrical & Electronic</topic><topic>Infrastructure</topic><topic>Machine learning</topic><topic>Maintenance costs</topic><topic>Maintenance engineering</topic><topic>Optimization</topic><topic>Planning</topic><topic>Prediction models</topic><topic>Predictive maintenance</topic><topic>reinforcement learning</topic><topic>Scalability</topic><topic>Science & Technology</topic><topic>Technology</topic><topic>Telecommunications</topic><topic>Uncertainty</topic><topic>Vehicle dynamics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tanimoto, Akira</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tanimoto, Akira</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><stitle>IEEE ACCESS</stitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>46788</spage><epage>46799</epage><pages>46788-46799</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract><![CDATA[Infrastructure maintenance planning is a large-scale optimization problem of planning when and on which components to carry out maintenance so as to keep the whole infrastructure in good condition with minimal maintenance cost. Recent advances in condition monitoring techniques have enabled timely maintenance in response to the condition of each part regardless of age. In addition to the condition, the spatial structure is also important for cost-efficiency in infrastructure maintenance since traveling costs and/or setup costs can be saved by simultaneous maintenance of neighboring components, which is called economic dependency. This optimization problem naively has a high computational complexity of <inline-formula> <tex-math notation="LaTeX">O(2^{nH}) </tex-math></inline-formula>, where <inline-formula> <tex-math notation="LaTeX">n </tex-math></inline-formula> is the number of components and <inline-formula> <tex-math notation="LaTeX">H </tex-math></inline-formula> is the planning horizon, and the predictive modeling of degradation is also a big issue. To solve this problem efficiently at scale, our proposed method utilizes two kinds of dynamic programming for temporal and spatial scalability and consequently enjoys <inline-formula> <tex-math notation="LaTeX">O(n) </tex-math></inline-formula> complexity at each time step. For temporal scalability, we utilize a direct modeling approach for the action value of maintenance instead of modeling degradation, namely, Q-learning. For spatial scalability, we exploit locality in economic dependency by means of a reasonable approximation of the Q-function. A typical baseline approach is to divide the whole infrastructure into fixed groups of neighboring components beforehand and determine if maintenance should be performed for all the components in each group at each time step. In contrast, our scalable method enables fully combinatorial optimization for each component at each time step. We demonstrate the advantage of our method in a simulated environment, and the resulting maintenance history intuitively illustrates the benefit of our dynamic grouping approach. We also show that our method has a kind of interpretability in the optimization at each time step.]]></abstract><cop>PISCATAWAY</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2021.3059244</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-0459-3993</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2169-3536
ispartof	IEEE access, 2021, Vol.9, p.46788-46799
issn	2169-3536 2169-3536
language	eng
recordid	cdi_crossref_primary_10_1109_ACCESS_2021_3059244
source	IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />
subjects	artificial intelligence Combinatorial analysis combinatorial optimization Complexity Computer Science Computer Science, Information Systems Condition monitoring decision support systems Degradation Dependence Dynamic programming Economics Engineering Engineering, Electrical & Electronic Infrastructure Machine learning Maintenance costs Maintenance engineering Optimization Planning Prediction models Predictive maintenance reinforcement learning Scalability Science & Technology Technology Telecommunications Uncertainty Vehicle dynamics
title	Combinatorial Q-Learning for Condition-Based Infrastructure Maintenance
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T22%3A50%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combinatorial%20Q-Learning%20for%20Condition-Based%20Infrastructure%20Maintenance&rft.jtitle=IEEE%20access&rft.au=Tanimoto,%20Akira&rft.date=2021&rft.volume=9&rft.spage=46788&rft.epage=46799&rft.pages=46788-46799&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3059244&rft_dat=%3Cproquest_cross%3E2509076927%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2509076927&rft_id=info:pmid/&rft_ieee_id=9354750&rft_doaj_id=oai_doaj_org_article_6083c268dfd745519f36ccb6bd1e50f5&rfr_iscdi=true