A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection
Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, t...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2018-04 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | La Cava, William Helmuth, Thomas Spector, Lee Moore, Jason H |
description | Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2072235368</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2072235368</sourcerecordid><originalsourceid>FETCH-proquest_journals_20722353683</originalsourceid><addsrcrecordid>eNqNjc0KwjAQhIMgWLTvEPAciBv7cxVRfADvJa1b2BCb2k1F394q3rx4Gphvhm8mEjBmo8otwEKkzE5rDXkBWWYSYXeyH0Jta_LEkRppu4u8jj6SCrXDJtIdp876JxPL0EqPD2oso2T0bxy6zwV7Jh869YtXYt5az5h-cynWx8N5f1KT9zYix8qFcZgMXIEuAExm8tL8t3oBos1GGw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2072235368</pqid></control><display><type>article</type><title>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</title><source>Free E- Journals</source><creator>La Cava, William ; Helmuth, Thomas ; Spector, Lee ; Moore, Jason H</creator><creatorcontrib>La Cava, William ; Helmuth, Thomas ; Spector, Lee ; Moore, Jason H</creatorcontrib><description>Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Dimensional analysis ; Multiple objective analysis ; Optimization ; Regression analysis ; Statistical analysis ; Training</subject><ispartof>arXiv.org, 2018-04</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>La Cava, William</creatorcontrib><creatorcontrib>Helmuth, Thomas</creatorcontrib><creatorcontrib>Spector, Lee</creatorcontrib><creatorcontrib>Moore, Jason H</creatorcontrib><title>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</title><title>arXiv.org</title><description>Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.</description><subject>Dimensional analysis</subject><subject>Multiple objective analysis</subject><subject>Optimization</subject><subject>Regression analysis</subject><subject>Statistical analysis</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjc0KwjAQhIMgWLTvEPAciBv7cxVRfADvJa1b2BCb2k1F394q3rx4Gphvhm8mEjBmo8otwEKkzE5rDXkBWWYSYXeyH0Jta_LEkRppu4u8jj6SCrXDJtIdp876JxPL0EqPD2oso2T0bxy6zwV7Jh869YtXYt5az5h-cynWx8N5f1KT9zYix8qFcZgMXIEuAExm8tL8t3oBos1GGw</recordid><startdate>20180429</startdate><enddate>20180429</enddate><creator>La Cava, William</creator><creator>Helmuth, Thomas</creator><creator>Spector, Lee</creator><creator>Moore, Jason H</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180429</creationdate><title>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</title><author>La Cava, William ; Helmuth, Thomas ; Spector, Lee ; Moore, Jason H</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20722353683</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Dimensional analysis</topic><topic>Multiple objective analysis</topic><topic>Optimization</topic><topic>Regression analysis</topic><topic>Statistical analysis</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>La Cava, William</creatorcontrib><creatorcontrib>Helmuth, Thomas</creatorcontrib><creatorcontrib>Spector, Lee</creatorcontrib><creatorcontrib>Moore, Jason H</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>La Cava, William</au><au>Helmuth, Thomas</au><au>Spector, Lee</au><au>Moore, Jason H</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</atitle><jtitle>arXiv.org</jtitle><date>2018-04-29</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2018-04 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2072235368 |
source | Free E- Journals |
subjects | Dimensional analysis Multiple objective analysis Optimization Regression analysis Statistical analysis Training |
title | A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T21%3A39%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=A%20probabilistic%20and%20multi-objective%20analysis%20of%20lexicase%20selection%20and%20epsilon-lexicase%20selection&rft.jtitle=arXiv.org&rft.au=La%20Cava,%20William&rft.date=2018-04-29&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2072235368%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2072235368&rft_id=info:pmid/&rfr_iscdi=true |