Skeletal Program Enumeration for Rigorous Compiler Testing

A program can be viewed as a syntactic structure P (syntactic skeleton) parameterized by a collection of the identifiers V (variable names). This paper introduces the skeletal program enumeration (SPE) problem: Given a fixed syntactic skeleton P and a set of variables V , enumerate a set of programs...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhang, Qirun, Sun, Chengnian, Su, Zhendong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Zhang, Qirun
Sun, Chengnian
Su, Zhendong
description A program can be viewed as a syntactic structure P (syntactic skeleton) parameterized by a collection of the identifiers V (variable names). This paper introduces the skeletal program enumeration (SPE) problem: Given a fixed syntactic skeleton P and a set of variables V , enumerate a set of programs P exhibiting all possible variable usage patterns within P. It proposes an effective realization of SPE for systematic, rigorous compiler testing by leveraging three important observations: (1) Programs with different variable usage patterns exhibit diverse control- and data-dependence information, and help exploit different compiler optimizations and stress-test compilers; (2) most real compiler bugs were revealed by small tests (i.e., small-sized P) --- this "small-scope" observation opens up SPE for practical compiler validation; and (3) SPE is exhaustive w.r.t. a given syntactic skeleton and variable set, and thus can offer a level of guarantee that is absent from all existing compiler testing techniques. The key challenge of SPE is how to eliminate the enormous amount of equivalent programs w.r.t. $\alpha$-conversion. Our main technical contribution is a novel algorithm for computing the canonical (and smallest) set of all non-$\alpha$-equivalent programs. We have realized our SPE technique and evaluated it using syntactic skeletons derived from GCC's testsuite. Our evaluation results on testing GCC and Clang are extremely promising. In less than six months, our approach has led to 217 confirmed bug reports, 104 of which have already been fixed, and the majority are long latent bugs despite the extensive prior efforts of automatically testing both compilers (e.g., Csmith and EMI). The results also show that our algorithm for enumerating non-$\alpha$-equivalent programs provides six orders of magnitude reduction, enabling processing the GCC test-suite in under a month.
doi_str_mv 10.48550/arxiv.1610.03148
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1610_03148</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1610_03148</sourcerecordid><originalsourceid>FETCH-LOGICAL-a678-39613d81a2af2bf6a8a5af8e0a32ba7b56abefe6831bea1e4920da5e7f7f5f423</originalsourceid><addsrcrecordid>eNotj81Kw0AURmfjQmofwJXzAqnzk5lM3UlotVBQNPtwh94bBpNMuUlF315bXR34FofvCHGr1aoMzql74K_0udL-d1BWl-FaPLx_YI8z9PKVc8cwyM14GpBhTnmUlFm-pS5zPk2yzsMx9ciywWlOY3cjrgj6CZf_XIhmu2nq52L_8rSrH_cF-CoUdu21PQQNBshE8hDAAQVUYE2EKjoPEQl9sDoiaCzXRh3AYUUVOSqNXYi7P-3lfHvkNAB_t-eI9hJhfwCgM0La</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Skeletal Program Enumeration for Rigorous Compiler Testing</title><source>arXiv.org</source><creator>Zhang, Qirun ; Sun, Chengnian ; Su, Zhendong</creator><creatorcontrib>Zhang, Qirun ; Sun, Chengnian ; Su, Zhendong</creatorcontrib><description>A program can be viewed as a syntactic structure P (syntactic skeleton) parameterized by a collection of the identifiers V (variable names). This paper introduces the skeletal program enumeration (SPE) problem: Given a fixed syntactic skeleton P and a set of variables V , enumerate a set of programs P exhibiting all possible variable usage patterns within P. It proposes an effective realization of SPE for systematic, rigorous compiler testing by leveraging three important observations: (1) Programs with different variable usage patterns exhibit diverse control- and data-dependence information, and help exploit different compiler optimizations and stress-test compilers; (2) most real compiler bugs were revealed by small tests (i.e., small-sized P) --- this "small-scope" observation opens up SPE for practical compiler validation; and (3) SPE is exhaustive w.r.t. a given syntactic skeleton and variable set, and thus can offer a level of guarantee that is absent from all existing compiler testing techniques. The key challenge of SPE is how to eliminate the enormous amount of equivalent programs w.r.t. $\alpha$-conversion. Our main technical contribution is a novel algorithm for computing the canonical (and smallest) set of all non-$\alpha$-equivalent programs. We have realized our SPE technique and evaluated it using syntactic skeletons derived from GCC's testsuite. Our evaluation results on testing GCC and Clang are extremely promising. In less than six months, our approach has led to 217 confirmed bug reports, 104 of which have already been fixed, and the majority are long latent bugs despite the extensive prior efforts of automatically testing both compilers (e.g., Csmith and EMI). The results also show that our algorithm for enumerating non-$\alpha$-equivalent programs provides six orders of magnitude reduction, enabling processing the GCC test-suite in under a month.</description><identifier>DOI: 10.48550/arxiv.1610.03148</identifier><language>eng</language><subject>Computer Science - Programming Languages</subject><creationdate>2016-10</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1610.03148$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1610.03148$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhang, Qirun</creatorcontrib><creatorcontrib>Sun, Chengnian</creatorcontrib><creatorcontrib>Su, Zhendong</creatorcontrib><title>Skeletal Program Enumeration for Rigorous Compiler Testing</title><description>A program can be viewed as a syntactic structure P (syntactic skeleton) parameterized by a collection of the identifiers V (variable names). This paper introduces the skeletal program enumeration (SPE) problem: Given a fixed syntactic skeleton P and a set of variables V , enumerate a set of programs P exhibiting all possible variable usage patterns within P. It proposes an effective realization of SPE for systematic, rigorous compiler testing by leveraging three important observations: (1) Programs with different variable usage patterns exhibit diverse control- and data-dependence information, and help exploit different compiler optimizations and stress-test compilers; (2) most real compiler bugs were revealed by small tests (i.e., small-sized P) --- this "small-scope" observation opens up SPE for practical compiler validation; and (3) SPE is exhaustive w.r.t. a given syntactic skeleton and variable set, and thus can offer a level of guarantee that is absent from all existing compiler testing techniques. The key challenge of SPE is how to eliminate the enormous amount of equivalent programs w.r.t. $\alpha$-conversion. Our main technical contribution is a novel algorithm for computing the canonical (and smallest) set of all non-$\alpha$-equivalent programs. We have realized our SPE technique and evaluated it using syntactic skeletons derived from GCC's testsuite. Our evaluation results on testing GCC and Clang are extremely promising. In less than six months, our approach has led to 217 confirmed bug reports, 104 of which have already been fixed, and the majority are long latent bugs despite the extensive prior efforts of automatically testing both compilers (e.g., Csmith and EMI). The results also show that our algorithm for enumerating non-$\alpha$-equivalent programs provides six orders of magnitude reduction, enabling processing the GCC test-suite in under a month.</description><subject>Computer Science - Programming Languages</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj81Kw0AURmfjQmofwJXzAqnzk5lM3UlotVBQNPtwh94bBpNMuUlF315bXR34FofvCHGr1aoMzql74K_0udL-d1BWl-FaPLx_YI8z9PKVc8cwyM14GpBhTnmUlFm-pS5zPk2yzsMx9ciywWlOY3cjrgj6CZf_XIhmu2nq52L_8rSrH_cF-CoUdu21PQQNBshE8hDAAQVUYE2EKjoPEQl9sDoiaCzXRh3AYUUVOSqNXYi7P-3lfHvkNAB_t-eI9hJhfwCgM0La</recordid><startdate>20161010</startdate><enddate>20161010</enddate><creator>Zhang, Qirun</creator><creator>Sun, Chengnian</creator><creator>Su, Zhendong</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20161010</creationdate><title>Skeletal Program Enumeration for Rigorous Compiler Testing</title><author>Zhang, Qirun ; Sun, Chengnian ; Su, Zhendong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a678-39613d81a2af2bf6a8a5af8e0a32ba7b56abefe6831bea1e4920da5e7f7f5f423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Computer Science - Programming Languages</topic><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Qirun</creatorcontrib><creatorcontrib>Sun, Chengnian</creatorcontrib><creatorcontrib>Su, Zhendong</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhang, Qirun</au><au>Sun, Chengnian</au><au>Su, Zhendong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Skeletal Program Enumeration for Rigorous Compiler Testing</atitle><date>2016-10-10</date><risdate>2016</risdate><abstract>A program can be viewed as a syntactic structure P (syntactic skeleton) parameterized by a collection of the identifiers V (variable names). This paper introduces the skeletal program enumeration (SPE) problem: Given a fixed syntactic skeleton P and a set of variables V , enumerate a set of programs P exhibiting all possible variable usage patterns within P. It proposes an effective realization of SPE for systematic, rigorous compiler testing by leveraging three important observations: (1) Programs with different variable usage patterns exhibit diverse control- and data-dependence information, and help exploit different compiler optimizations and stress-test compilers; (2) most real compiler bugs were revealed by small tests (i.e., small-sized P) --- this "small-scope" observation opens up SPE for practical compiler validation; and (3) SPE is exhaustive w.r.t. a given syntactic skeleton and variable set, and thus can offer a level of guarantee that is absent from all existing compiler testing techniques. The key challenge of SPE is how to eliminate the enormous amount of equivalent programs w.r.t. $\alpha$-conversion. Our main technical contribution is a novel algorithm for computing the canonical (and smallest) set of all non-$\alpha$-equivalent programs. We have realized our SPE technique and evaluated it using syntactic skeletons derived from GCC's testsuite. Our evaluation results on testing GCC and Clang are extremely promising. In less than six months, our approach has led to 217 confirmed bug reports, 104 of which have already been fixed, and the majority are long latent bugs despite the extensive prior efforts of automatically testing both compilers (e.g., Csmith and EMI). The results also show that our algorithm for enumerating non-$\alpha$-equivalent programs provides six orders of magnitude reduction, enabling processing the GCC test-suite in under a month.</abstract><doi>10.48550/arxiv.1610.03148</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1610.03148
ispartof
issn
language eng
recordid cdi_arxiv_primary_1610_03148
source arXiv.org
subjects Computer Science - Programming Languages
title Skeletal Program Enumeration for Rigorous Compiler Testing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T02%3A15%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Skeletal%20Program%20Enumeration%20for%20Rigorous%20Compiler%20Testing&rft.au=Zhang,%20Qirun&rft.date=2016-10-10&rft_id=info:doi/10.48550/arxiv.1610.03148&rft_dat=%3Carxiv_GOX%3E1610_03148%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true