alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling
Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning. Since the discovery that original DARTS will inevitably converge to poor architectures, recent works alleviate this by either designing rule-based architecture selection techniques or incorporat...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Guo, Bicheng Guo, Shuxuan Shi, Miaojing Chen, Peng He, Shibo Chen, Jiming Yu, Kaicheng |
description | Differentiable architecture search (DARTS) has been a mainstream direction in
automatic machine learning. Since the discovery that original DARTS will
inevitably converge to poor architectures, recent works alleviate this by
either designing rule-based architecture selection techniques or incorporating
complex regularization techniques, abandoning the simplicity of the original
DARTS that selects architectures based on the largest parametric value, namely
$\alpha$. Moreover, we find that all the previous attempts only rely on
classification labels, hence learning only single modal information and
limiting the representation power of the shared network. To this end, we
propose to additionally inject semantic information by formulating a patch
recovery approach. Specifically, we exploit the recent trending masked image
modeling and do not abandon the guidance from the downstream tasks during the
search phase. Our method surpasses all previous DARTS variants and achieves
state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex
manual-designed strategies. |
doi_str_mv | 10.48550/arxiv.2211.10105 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2211_10105</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2211_10105</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-a55fa0a2120ac4ccdb57b46266121765cae204de8af1047cf44dc2314c6332663</originalsourceid><addsrcrecordid>eNotj01Lw0AURWfjQqo_wJWzcJs43ynuQlu10FKw2Up4efOmGUxjmUax_960urpcuPfAYexOitxMrRWPkH7id66UlLkUUthr9g7doYUHPi_fqi3f9Eh8_ZnoiS_6FnqM_Y7PYwiUqB8iNB3xMmEbB8LhKxHfEoyVNye-huMHeb7cw-6M8NSN3xt2FaA70u1_Tlj1vKhmr9lq87KclasMXGEzsDaAACWVADSIvrFFY5xyTipZOItAShhPUwhSmAKDMR6Vlgad1uNKT9j9H_biVx9S3EM61WfP-uKpfwGAY0xb</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</title><source>arXiv.org</source><creator>Guo, Bicheng ; Guo, Shuxuan ; Shi, Miaojing ; Chen, Peng ; He, Shibo ; Chen, Jiming ; Yu, Kaicheng</creator><creatorcontrib>Guo, Bicheng ; Guo, Shuxuan ; Shi, Miaojing ; Chen, Peng ; He, Shibo ; Chen, Jiming ; Yu, Kaicheng</creatorcontrib><description>Differentiable architecture search (DARTS) has been a mainstream direction in
automatic machine learning. Since the discovery that original DARTS will
inevitably converge to poor architectures, recent works alleviate this by
either designing rule-based architecture selection techniques or incorporating
complex regularization techniques, abandoning the simplicity of the original
DARTS that selects architectures based on the largest parametric value, namely
$\alpha$. Moreover, we find that all the previous attempts only rely on
classification labels, hence learning only single modal information and
limiting the representation power of the shared network. To this end, we
propose to additionally inject semantic information by formulating a patch
recovery approach. Specifically, we exploit the recent trending masked image
modeling and do not abandon the guidance from the downstream tasks during the
search phase. Our method surpasses all previous DARTS variants and achieves
state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex
manual-designed strategies.</description><identifier>DOI: 10.48550/arxiv.2211.10105</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2022-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2211.10105$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2211.10105$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Guo, Bicheng</creatorcontrib><creatorcontrib>Guo, Shuxuan</creatorcontrib><creatorcontrib>Shi, Miaojing</creatorcontrib><creatorcontrib>Chen, Peng</creatorcontrib><creatorcontrib>He, Shibo</creatorcontrib><creatorcontrib>Chen, Jiming</creatorcontrib><creatorcontrib>Yu, Kaicheng</creatorcontrib><title>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</title><description>Differentiable architecture search (DARTS) has been a mainstream direction in
automatic machine learning. Since the discovery that original DARTS will
inevitably converge to poor architectures, recent works alleviate this by
either designing rule-based architecture selection techniques or incorporating
complex regularization techniques, abandoning the simplicity of the original
DARTS that selects architectures based on the largest parametric value, namely
$\alpha$. Moreover, we find that all the previous attempts only rely on
classification labels, hence learning only single modal information and
limiting the representation power of the shared network. To this end, we
propose to additionally inject semantic information by formulating a patch
recovery approach. Specifically, we exploit the recent trending masked image
modeling and do not abandon the guidance from the downstream tasks during the
search phase. Our method surpasses all previous DARTS variants and achieves
state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex
manual-designed strategies.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj01Lw0AURWfjQqo_wJWzcJs43ynuQlu10FKw2Up4efOmGUxjmUax_960urpcuPfAYexOitxMrRWPkH7id66UlLkUUthr9g7doYUHPi_fqi3f9Eh8_ZnoiS_6FnqM_Y7PYwiUqB8iNB3xMmEbB8LhKxHfEoyVNye-huMHeb7cw-6M8NSN3xt2FaA70u1_Tlj1vKhmr9lq87KclasMXGEzsDaAACWVADSIvrFFY5xyTipZOItAShhPUwhSmAKDMR6Vlgad1uNKT9j9H_biVx9S3EM61WfP-uKpfwGAY0xb</recordid><startdate>20221118</startdate><enddate>20221118</enddate><creator>Guo, Bicheng</creator><creator>Guo, Shuxuan</creator><creator>Shi, Miaojing</creator><creator>Chen, Peng</creator><creator>He, Shibo</creator><creator>Chen, Jiming</creator><creator>Yu, Kaicheng</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221118</creationdate><title>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</title><author>Guo, Bicheng ; Guo, Shuxuan ; Shi, Miaojing ; Chen, Peng ; He, Shibo ; Chen, Jiming ; Yu, Kaicheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-a55fa0a2120ac4ccdb57b46266121765cae204de8af1047cf44dc2314c6332663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Guo, Bicheng</creatorcontrib><creatorcontrib>Guo, Shuxuan</creatorcontrib><creatorcontrib>Shi, Miaojing</creatorcontrib><creatorcontrib>Chen, Peng</creatorcontrib><creatorcontrib>He, Shibo</creatorcontrib><creatorcontrib>Chen, Jiming</creatorcontrib><creatorcontrib>Yu, Kaicheng</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Guo, Bicheng</au><au>Guo, Shuxuan</au><au>Shi, Miaojing</au><au>Chen, Peng</au><au>He, Shibo</au><au>Chen, Jiming</au><au>Yu, Kaicheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling</atitle><date>2022-11-18</date><risdate>2022</risdate><abstract>Differentiable architecture search (DARTS) has been a mainstream direction in
automatic machine learning. Since the discovery that original DARTS will
inevitably converge to poor architectures, recent works alleviate this by
either designing rule-based architecture selection techniques or incorporating
complex regularization techniques, abandoning the simplicity of the original
DARTS that selects architectures based on the largest parametric value, namely
$\alpha$. Moreover, we find that all the previous attempts only rely on
classification labels, hence learning only single modal information and
limiting the representation power of the shared network. To this end, we
propose to additionally inject semantic information by formulating a patch
recovery approach. Specifically, we exploit the recent trending masked image
modeling and do not abandon the guidance from the downstream tasks during the
search phase. Our method surpasses all previous DARTS variants and achieves
state-of-the-art results on CIFAR-10, CIFAR-100, and ImageNet without complex
manual-designed strategies.</abstract><doi>10.48550/arxiv.2211.10105</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2211.10105 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2211_10105 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | alpha$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T22%3A26%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=alpha$%20DARTS%20Once%20More:%20Enhancing%20Differentiable%20Architecture%20Search%20by%20Masked%20Image%20Modeling&rft.au=Guo,%20Bicheng&rft.date=2022-11-18&rft_id=info:doi/10.48550/arxiv.2211.10105&rft_dat=%3Carxiv_GOX%3E2211_10105%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |