alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling

Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning. Since the discovery that original DARTS will inevitably converge to poor architectures, recent works alleviate this by either designing rule-based architecture selection techniques or incorporat...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Guo, Bicheng, Guo, Shuxuan, Shi, Miaojing, Chen, Peng, He, Shibo, Chen, Jiming, Yu, Kaicheng
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Guo, Bicheng Guo, Shuxuan Shi, Miaojing Chen, Peng He, Shibo Chen, Jiming Yu, Kaicheng
description	Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning. Since the discovery that original DARTS will inevitably converge to poor architectures, recent works alleviate this by either designing rule-based architecture selection techniques or incorporating complex regularization techniques, abandoning the simplicity of the original DARTS that selects architectures based on the largest parametric value, namely $\alpha$. Moreover, we find that all the previous attempts only rely on classification labels, hence learning only single modal information and limiting the representation power of the shared network. To this end, we propose to additionally inject semantic information by formulating a patch recovery approach. Specifically, we exploit the recent trending masked image modeling and do not abandon the guidance from the downstream tasks during the search phase. Our method surpasses all previous DARTS variants and achieves state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex manual-designed strategies.
doi_str_mv	10.48550/arxiv.2211.10105
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2211_10105</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2211_10105</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-a55fa0a2120ac4ccdb57b46266121765cae204de8af1047cf44dc2314c6332663</originalsourceid><addsrcrecordid>eNotj01Lw0AURWfjQqo_wJWzcJs43ynuQlu10FKw2Up4efOmGUxjmUax_960urpcuPfAYexOitxMrRWPkH7id66UlLkUUthr9g7doYUHPi_fqi3f9Eh8_ZnoiS_6FnqM_Y7PYwiUqB8iNB3xMmEbB8LhKxHfEoyVNye-huMHeb7cw-6M8NSN3xt2FaA70u1_Tlj1vKhmr9lq87KclasMXGEzsDaAACWVADSIvrFFY5xyTipZOItAShhPUwhSmAKDMR6Vlgad1uNKT9j9H_biVx9S3EM61WfP-uKpfwGAY0xb</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</title><source>arXiv.org</source><creator>Guo, Bicheng ; Guo, Shuxuan ; Shi, Miaojing ; Chen, Peng ; He, Shibo ; Chen, Jiming ; Yu, Kaicheng</creator><creatorcontrib>Guo, Bicheng ; Guo, Shuxuan ; Shi, Miaojing ; Chen, Peng ; He, Shibo ; Chen, Jiming ; Yu, Kaicheng</creatorcontrib><description>Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning. Since the discovery that original DARTS will inevitably converge to poor architectures, recent works alleviate this by either designing rule-based architecture selection techniques or incorporating complex regularization techniques, abandoning the simplicity of the original DARTS that selects architectures based on the largest parametric value, namely $\alpha$. Moreover, we find that all the previous attempts only rely on classification labels, hence learning only single modal information and limiting the representation power of the shared network. To this end, we propose to additionally inject semantic information by formulating a patch recovery approach. Specifically, we exploit the recent trending masked image modeling and do not abandon the guidance from the downstream tasks during the search phase. Our method surpasses all previous DARTS variants and achieves state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex manual-designed strategies.</description><identifier>DOI: 10.48550/arxiv.2211.10105</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2022-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2211.10105$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2211.10105$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Guo, Bicheng</creatorcontrib><creatorcontrib>Guo, Shuxuan</creatorcontrib><creatorcontrib>Shi, Miaojing</creatorcontrib><creatorcontrib>Chen, Peng</creatorcontrib><creatorcontrib>He, Shibo</creatorcontrib><creatorcontrib>Chen, Jiming</creatorcontrib><creatorcontrib>Yu, Kaicheng</creatorcontrib><title>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</title><description>Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning. Since the discovery that original DARTS will inevitably converge to poor architectures, recent works alleviate this by either designing rule-based architecture selection techniques or incorporating complex regularization techniques, abandoning the simplicity of the original DARTS that selects architectures based on the largest parametric value, namely $\alpha$. Moreover, we find that all the previous attempts only rely on classification labels, hence learning only single modal information and limiting the representation power of the shared network. To this end, we propose to additionally inject semantic information by formulating a patch recovery approach. Specifically, we exploit the recent trending masked image modeling and do not abandon the guidance from the downstream tasks during the search phase. Our method surpasses all previous DARTS variants and achieves state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex manual-designed strategies.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj01Lw0AURWfjQqo_wJWzcJs43ynuQlu10FKw2Up4efOmGUxjmUax_960urpcuPfAYexOitxMrRWPkH7id66UlLkUUthr9g7doYUHPi_fqi3f9Eh8_ZnoiS_6FnqM_Y7PYwiUqB8iNB3xMmEbB8LhKxHfEoyVNye-huMHeb7cw-6M8NSN3xt2FaA70u1_Tlj1vKhmr9lq87KclasMXGEzsDaAACWVADSIvrFFY5xyTipZOItAShhPUwhSmAKDMR6Vlgad1uNKT9j9H_biVx9S3EM61WfP-uKpfwGAY0xb</recordid><startdate>20221118</startdate><enddate>20221118</enddate><creator>Guo, Bicheng</creator><creator>Guo, Shuxuan</creator><creator>Shi, Miaojing</creator><creator>Chen, Peng</creator><creator>He, Shibo</creator><creator>Chen, Jiming</creator><creator>Yu, Kaicheng</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221118</creationdate><title>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</title><author>Guo, Bicheng ; Guo, Shuxuan ; Shi, Miaojing ; Chen, Peng ; He, Shibo ; Chen, Jiming ; Yu, Kaicheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-a55fa0a2120ac4ccdb57b46266121765cae204de8af1047cf44dc2314c6332663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Guo, Bicheng</creatorcontrib><creatorcontrib>Guo, Shuxuan</creatorcontrib><creatorcontrib>Shi, Miaojing</creatorcontrib><creatorcontrib>Chen, Peng</creatorcontrib><creatorcontrib>He, Shibo</creatorcontrib><creatorcontrib>Chen, Jiming</creatorcontrib><creatorcontrib>Yu, Kaicheng</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Guo, Bicheng</au><au>Guo, Shuxuan</au><au>Shi, Miaojing</au><au>Chen, Peng</au><au>He, Shibo</au><au>Chen, Jiming</au><au>Yu, Kaicheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</atitle><date>2022-11-18</date><risdate>2022</risdate><abstract>Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning. Since the discovery that original DARTS will inevitably converge to poor architectures, recent works alleviate this by either designing rule-based architecture selection techniques or incorporating complex regularization techniques, abandoning the simplicity of the original DARTS that selects architectures based on the largest parametric value, namely $\alpha$. Moreover, we find that all the previous attempts only rely on classification labels, hence learning only single modal information and limiting the representation power of the shared network. To this end, we propose to additionally inject semantic information by formulating a patch recovery approach. Specifically, we exploit the recent trending masked image modeling and do not abandon the guidance from the downstream tasks during the search phase. Our method surpasses all previous DARTS variants and achieves state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex manual-designed strategies.</abstract><doi>10.48550/arxiv.2211.10105</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2211.10105
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2211_10105
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T22%3A26%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=alpha$%20DARTS%20Once%20More:%20Enhancing%20Differentiable%20Architecture%20Search%20by%20Masked%20Image%20Modeling&rft.au=Guo,%20Bicheng&rft.date=2022-11-18&rft_id=info:doi/10.48550/arxiv.2211.10105&rft_dat=%3Carxiv_GOX%3E2211_10105%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true