A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection

Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2018-04
Hauptverfasser:	La Cava, William, Helmuth, Thomas, Spector, Lee, Moore, Jason H
Format:	Artikel
Sprache:	eng
Schlagworte:	Dimensional analysis Multiple objective analysis Optimization Regression analysis Statistical analysis Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	La Cava, William Helmuth, Thomas Spector, Lee Moore, Jason H
description	Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2072235368</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2072235368</sourcerecordid><originalsourceid>FETCH-proquest_journals_20722353683</originalsourceid><addsrcrecordid>eNqNjc0KwjAQhIMgWLTvEPAciBv7cxVRfADvJa1b2BCb2k1F394q3rx4Gphvhm8mEjBmo8otwEKkzE5rDXkBWWYSYXeyH0Jta_LEkRppu4u8jj6SCrXDJtIdp876JxPL0EqPD2oso2T0bxy6zwV7Jh869YtXYt5az5h-cynWx8N5f1KT9zYix8qFcZgMXIEuAExm8tL8t3oBos1GGw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2072235368</pqid></control><display><type>article</type><title>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</title><source>Free E- Journals</source><creator>La Cava, William ; Helmuth, Thomas ; Spector, Lee ; Moore, Jason H</creator><creatorcontrib>La Cava, William ; Helmuth, Thomas ; Spector, Lee ; Moore, Jason H</creatorcontrib><description>Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Dimensional analysis ; Multiple objective analysis ; Optimization ; Regression analysis ; Statistical analysis ; Training</subject><ispartof>arXiv.org, 2018-04</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>La Cava, William</creatorcontrib><creatorcontrib>Helmuth, Thomas</creatorcontrib><creatorcontrib>Spector, Lee</creatorcontrib><creatorcontrib>Moore, Jason H</creatorcontrib><title>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</title><title>arXiv.org</title><description>Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.</description><subject>Dimensional analysis</subject><subject>Multiple objective analysis</subject><subject>Optimization</subject><subject>Regression analysis</subject><subject>Statistical analysis</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjc0KwjAQhIMgWLTvEPAciBv7cxVRfADvJa1b2BCb2k1F394q3rx4Gphvhm8mEjBmo8otwEKkzE5rDXkBWWYSYXeyH0Jta_LEkRppu4u8jj6SCrXDJtIdp876JxPL0EqPD2oso2T0bxy6zwV7Jh869YtXYt5az5h-cynWx8N5f1KT9zYix8qFcZgMXIEuAExm8tL8t3oBos1GGw</recordid><startdate>20180429</startdate><enddate>20180429</enddate><creator>La Cava, William</creator><creator>Helmuth, Thomas</creator><creator>Spector, Lee</creator><creator>Moore, Jason H</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180429</creationdate><title>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</title><author>La Cava, William ; Helmuth, Thomas ; Spector, Lee ; Moore, Jason H</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20722353683</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Dimensional analysis</topic><topic>Multiple objective analysis</topic><topic>Optimization</topic><topic>Regression analysis</topic><topic>Statistical analysis</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>La Cava, William</creatorcontrib><creatorcontrib>Helmuth, Thomas</creatorcontrib><creatorcontrib>Spector, Lee</creatorcontrib><creatorcontrib>Moore, Jason H</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>La Cava, William</au><au>Helmuth, Thomas</au><au>Spector, Lee</au><au>Moore, Jason H</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection</atitle><jtitle>arXiv.org</jtitle><date>2018-04-29</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>Lexicase selection is a parent selection method that considers training cases individually, rather than in aggregate, when performing parent selection. Whereas previous work has demonstrated the ability of lexicase selection to solve difficult problems in program synthesis and symbolic regression, the central goal of this paper is to develop the theoretical underpinnings that explain its performance. To this end, we derive an analytical formula that gives the expected probabilities of selection under lexicase selection, given a population and its behavior. In addition, we expand upon the relation of lexicase selection to many-objective optimization methods to describe the behavior of lexicase selection, which is to select individuals on the boundaries of Pareto fronts in high-dimensional space. We show analytically why lexicase selection performs more poorly for certain sizes of population and training cases, and show why it has been shown to perform more poorly in continuous error spaces. To address this last concern, we propose new variants of epsilon-lexicase selection, a method that modifies the pass condition in lexicase selection to allow near-elite individuals to pass cases, thereby improving selection performance with continuous errors. We show that epsilon-lexicase outperforms several diversity-maintenance strategies on a number of real-world and synthetic regression problems.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2018-04
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2072235368
source	Free E- Journals
subjects	Dimensional analysis Multiple objective analysis Optimization Regression analysis Statistical analysis Training
title	A probabilistic and multi-objective analysis of lexicase selection and epsilon-lexicase selection
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T21%3A39%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=A%20probabilistic%20and%20multi-objective%20analysis%20of%20lexicase%20selection%20and%20epsilon-lexicase%20selection&rft.jtitle=arXiv.org&rft.au=La%20Cava,%20William&rft.date=2018-04-29&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2072235368%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2072235368&rft_id=info:pmid/&rfr_iscdi=true