Investigation of reinforcement learning for shape optimization of profile extrusion dies

Profile extrusion is a continuous production process for manufacturing plastic profiles from molten polymer. Especially interesting is the design of the die, through which the melt is pressed to attain the desired shape. However, due to an inhomogeneous velocity distribution at the die exit or resid...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2022-12
Hauptverfasser:	Fricke, Clemens, Wolff, Daniel, Kemmerling, Marco, Elgeti, Stefanie
Format:	Artikel
Sprache:	eng
Schlagworte:	Computational grids Computer Science - Computational Engineering, Finance, and Science Computer Science - Learning Continuous extrusion Continuous production Evolutionary algorithms Extrusion dies Free form Investigations Machine learning Mathematics - Optimization and Control Optimization Profile extrusion Residual stress Shape optimization Training Two dimensional flow Velocity distribution
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Fricke, Clemens Wolff, Daniel Kemmerling, Marco Elgeti, Stefanie
description	Profile extrusion is a continuous production process for manufacturing plastic profiles from molten polymer. Especially interesting is the design of the die, through which the melt is pressed to attain the desired shape. However, due to an inhomogeneous velocity distribution at the die exit or residual stresses inside the extrudate, the final shape of the manufactured part often deviates from the desired one. To avoid these deviations, the shape of the die can be computationally optimized, which has already been investigated in the literature using classical optimization approaches. A new approach in the field of shape optimization is the utilization of Reinforcement Learning (RL) as a learning-based optimization algorithm. RL is based on trial-and-error interactions of an agent with an environment. For each action, the agent is rewarded and informed about the subsequent state of the environment. While not necessarily superior to classical, e.g., gradient-based or evolutionary, optimization algorithms for one single problem, RL techniques are expected to perform especially well when similar optimization tasks are repeated since the agent learns a more general strategy for generating optimal shapes instead of concentrating on just one single problem. In this work, we investigate this approach by applying it to two 2D test cases. The flow-channel geometry can be modified by the RL agent using so-called Free-Form Deformation, a method where the computational mesh is embedded into a transformation spline, which is then manipulated based on the control-point positions. In particular, we investigate the impact of utilizing different agents on the training progress and the potential of wall time saving by utilizing multiple environments during training.
doi_str_mv	10.48550/arxiv.2212.12207
format	Article
fullrecord	<record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2212_12207</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2758201943</sourcerecordid><originalsourceid>FETCH-LOGICAL-a957-6eb0684d38789d921309f9d63e1d38ea4bc35fd056b817afb2eef4c5361d13373</originalsourceid><addsrcrecordid>eNo9j8tqwzAQRUWh0JDmA7qqoGun0oxlycsS-ggEusmiOyPHo1QhkV3JCWm_vk5Suho43Dvcw9idFNPcKCUebTz6wxRAwlQCCH3FRoAoM5MD3LBJShshBBQalMIR-5iHA6Xer23v28BbxyP54Nq4oh2Fnm_JxuDDmg-Ip0_bEW-73u_8z3-hi63zW-J07OM-nWDjKd2ya2e3iSZ_d8yWL8_L2Vu2eH-dz54WmS2VzgqqRWHyBo02ZVOCRFG6simQ5MDI5vUKlWuEKmojtXU1ELl8pbCQjUTUOGb3l7dn7aqLfmfjd3XSr876Q-Lhkhh2fu0H12rT7mMYNlWglQEhyxzxF8T5XqQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2758201943</pqid></control><display><type>article</type><title>Investigation of reinforcement learning for shape optimization of profile extrusion dies</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Fricke, Clemens ; Wolff, Daniel ; Kemmerling, Marco ; Elgeti, Stefanie</creator><creatorcontrib>Fricke, Clemens ; Wolff, Daniel ; Kemmerling, Marco ; Elgeti, Stefanie</creatorcontrib><description>Profile extrusion is a continuous production process for manufacturing plastic profiles from molten polymer. Especially interesting is the design of the die, through which the melt is pressed to attain the desired shape. However, due to an inhomogeneous velocity distribution at the die exit or residual stresses inside the extrudate, the final shape of the manufactured part often deviates from the desired one. To avoid these deviations, the shape of the die can be computationally optimized, which has already been investigated in the literature using classical optimization approaches. A new approach in the field of shape optimization is the utilization of Reinforcement Learning (RL) as a learning-based optimization algorithm. RL is based on trial-and-error interactions of an agent with an environment. For each action, the agent is rewarded and informed about the subsequent state of the environment. While not necessarily superior to classical, e.g., gradient-based or evolutionary, optimization algorithms for one single problem, RL techniques are expected to perform especially well when similar optimization tasks are repeated since the agent learns a more general strategy for generating optimal shapes instead of concentrating on just one single problem. In this work, we investigate this approach by applying it to two 2D test cases. The flow-channel geometry can be modified by the RL agent using so-called Free-Form Deformation, a method where the computational mesh is embedded into a transformation spline, which is then manipulated based on the control-point positions. In particular, we investigate the impact of utilizing different agents on the training progress and the potential of wall time saving by utilizing multiple environments during training.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2212.12207</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computational grids ; Computer Science - Computational Engineering, Finance, and Science ; Computer Science - Learning ; Continuous extrusion ; Continuous production ; Evolutionary algorithms ; Extrusion dies ; Free form ; Investigations ; Machine learning ; Mathematics - Optimization and Control ; Optimization ; Profile extrusion ; Residual stress ; Shape optimization ; Training ; Two dimensional flow ; Velocity distribution</subject><ispartof>arXiv.org, 2022-12</ispartof><rights>2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27923</link.rule.ids><backlink>$$Uhttps://doi.org/10.3934/acse.2023001$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2212.12207$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Fricke, Clemens</creatorcontrib><creatorcontrib>Wolff, Daniel</creatorcontrib><creatorcontrib>Kemmerling, Marco</creatorcontrib><creatorcontrib>Elgeti, Stefanie</creatorcontrib><title>Investigation of reinforcement learning for shape optimization of profile extrusion dies</title><title>arXiv.org</title><description>Profile extrusion is a continuous production process for manufacturing plastic profiles from molten polymer. Especially interesting is the design of the die, through which the melt is pressed to attain the desired shape. However, due to an inhomogeneous velocity distribution at the die exit or residual stresses inside the extrudate, the final shape of the manufactured part often deviates from the desired one. To avoid these deviations, the shape of the die can be computationally optimized, which has already been investigated in the literature using classical optimization approaches. A new approach in the field of shape optimization is the utilization of Reinforcement Learning (RL) as a learning-based optimization algorithm. RL is based on trial-and-error interactions of an agent with an environment. For each action, the agent is rewarded and informed about the subsequent state of the environment. While not necessarily superior to classical, e.g., gradient-based or evolutionary, optimization algorithms for one single problem, RL techniques are expected to perform especially well when similar optimization tasks are repeated since the agent learns a more general strategy for generating optimal shapes instead of concentrating on just one single problem. In this work, we investigate this approach by applying it to two 2D test cases. The flow-channel geometry can be modified by the RL agent using so-called Free-Form Deformation, a method where the computational mesh is embedded into a transformation spline, which is then manipulated based on the control-point positions. In particular, we investigate the impact of utilizing different agents on the training progress and the potential of wall time saving by utilizing multiple environments during training.</description><subject>Computational grids</subject><subject>Computer Science - Computational Engineering, Finance, and Science</subject><subject>Computer Science - Learning</subject><subject>Continuous extrusion</subject><subject>Continuous production</subject><subject>Evolutionary algorithms</subject><subject>Extrusion dies</subject><subject>Free form</subject><subject>Investigations</subject><subject>Machine learning</subject><subject>Mathematics - Optimization and Control</subject><subject>Optimization</subject><subject>Profile extrusion</subject><subject>Residual stress</subject><subject>Shape optimization</subject><subject>Training</subject><subject>Two dimensional flow</subject><subject>Velocity distribution</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNo9j8tqwzAQRUWh0JDmA7qqoGun0oxlycsS-ggEusmiOyPHo1QhkV3JCWm_vk5Suho43Dvcw9idFNPcKCUebTz6wxRAwlQCCH3FRoAoM5MD3LBJShshBBQalMIR-5iHA6Xer23v28BbxyP54Nq4oh2Fnm_JxuDDmg-Ip0_bEW-73u_8z3-hi63zW-J07OM-nWDjKd2ya2e3iSZ_d8yWL8_L2Vu2eH-dz54WmS2VzgqqRWHyBo02ZVOCRFG6simQ5MDI5vUKlWuEKmojtXU1ELl8pbCQjUTUOGb3l7dn7aqLfmfjd3XSr876Q-Lhkhh2fu0H12rT7mMYNlWglQEhyxzxF8T5XqQ</recordid><startdate>20221223</startdate><enddate>20221223</enddate><creator>Fricke, Clemens</creator><creator>Wolff, Daniel</creator><creator>Kemmerling, Marco</creator><creator>Elgeti, Stefanie</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>AKZ</scope><scope>GOX</scope></search><sort><creationdate>20221223</creationdate><title>Investigation of reinforcement learning for shape optimization of profile extrusion dies</title><author>Fricke, Clemens ; Wolff, Daniel ; Kemmerling, Marco ; Elgeti, Stefanie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a957-6eb0684d38789d921309f9d63e1d38ea4bc35fd056b817afb2eef4c5361d13373</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computational grids</topic><topic>Computer Science - Computational Engineering, Finance, and Science</topic><topic>Computer Science - Learning</topic><topic>Continuous extrusion</topic><topic>Continuous production</topic><topic>Evolutionary algorithms</topic><topic>Extrusion dies</topic><topic>Free form</topic><topic>Investigations</topic><topic>Machine learning</topic><topic>Mathematics - Optimization and Control</topic><topic>Optimization</topic><topic>Profile extrusion</topic><topic>Residual stress</topic><topic>Shape optimization</topic><topic>Training</topic><topic>Two dimensional flow</topic><topic>Velocity distribution</topic><toplevel>online_resources</toplevel><creatorcontrib>Fricke, Clemens</creatorcontrib><creatorcontrib>Wolff, Daniel</creatorcontrib><creatorcontrib>Kemmerling, Marco</creatorcontrib><creatorcontrib>Elgeti, Stefanie</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv Mathematics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fricke, Clemens</au><au>Wolff, Daniel</au><au>Kemmerling, Marco</au><au>Elgeti, Stefanie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Investigation of reinforcement learning for shape optimization of profile extrusion dies</atitle><jtitle>arXiv.org</jtitle><date>2022-12-23</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>Profile extrusion is a continuous production process for manufacturing plastic profiles from molten polymer. Especially interesting is the design of the die, through which the melt is pressed to attain the desired shape. However, due to an inhomogeneous velocity distribution at the die exit or residual stresses inside the extrudate, the final shape of the manufactured part often deviates from the desired one. To avoid these deviations, the shape of the die can be computationally optimized, which has already been investigated in the literature using classical optimization approaches. A new approach in the field of shape optimization is the utilization of Reinforcement Learning (RL) as a learning-based optimization algorithm. RL is based on trial-and-error interactions of an agent with an environment. For each action, the agent is rewarded and informed about the subsequent state of the environment. While not necessarily superior to classical, e.g., gradient-based or evolutionary, optimization algorithms for one single problem, RL techniques are expected to perform especially well when similar optimization tasks are repeated since the agent learns a more general strategy for generating optimal shapes instead of concentrating on just one single problem. In this work, we investigate this approach by applying it to two 2D test cases. The flow-channel geometry can be modified by the RL agent using so-called Free-Form Deformation, a method where the computational mesh is embedded into a transformation spline, which is then manipulated based on the control-point positions. In particular, we investigate the impact of utilizing different agents on the training progress and the potential of wall time saving by utilizing multiple environments during training.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2212.12207</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2022-12
issn	2331-8422
language	eng
recordid	cdi_arxiv_primary_2212_12207
source	arXiv.org; Free E- Journals
subjects	Computational grids Computer Science - Computational Engineering, Finance, and Science Computer Science - Learning Continuous extrusion Continuous production Evolutionary algorithms Extrusion dies Free form Investigations Machine learning Mathematics - Optimization and Control Optimization Profile extrusion Residual stress Shape optimization Training Two dimensional flow Velocity distribution
title	Investigation of reinforcement learning for shape optimization of profile extrusion dies
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T11%3A10%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Investigation%20of%20reinforcement%20learning%20for%20shape%20optimization%20of%20profile%20extrusion%20dies&rft.jtitle=arXiv.org&rft.au=Fricke,%20Clemens&rft.date=2022-12-23&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2212.12207&rft_dat=%3Cproquest_arxiv%3E2758201943%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2758201943&rft_id=info:pmid/&rfr_iscdi=true