Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics

Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2021-09
Hauptverfasser: Chung, Jaewon, Varjavand, Bijan, Arroyo, Jesus, Alyakin, Anton, Agterberg, Joshua, Tang, Minh, Vogelstein, Joshua T, Priebe, Carey E
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Chung, Jaewon
Varjavand, Bijan
Arroyo, Jesus
Alyakin, Anton
Agterberg, Joshua
Tang, Minh
Vogelstein, Joshua T
Priebe, Carey E
description Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.
doi_str_mv 10.48550/arxiv.1911.02741
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_1911_02741</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2313450730</sourcerecordid><originalsourceid>FETCH-LOGICAL-a951-8b14af5f53966a638b233645d360363faf544a27f8da3a4098cc820f63bb66a03</originalsourceid><addsrcrecordid>eNo9kEtLAzEUhYMgWKo_wJUB11PznpllKVqFSgUHt8Odl02ZZmKSaXXrLze24urCvedczvkQuqZkJjIpyR24T72f0ZzSGWGpoGdowjinSSYYu0BX3m8JIUylTEo-Qd9v0OsGF4cheYWd7Vu8dGA3uGh90OYd7zXgtQ16Bz0uHBhvBxfwixtqN_rQegymwc9jH7Sv4d-9GJxrewh6MPigwwbPre11fVx4rE0UGNPWYdjp2l-i8w563179zSkqHu6LxWOyWi-fFvNVArmM8SsqoJOd5LlSoHhWxVZKyIYrwhXv4k0IYGmXNcBBkDyr64yRTvGqigbCp-jm9PYIqLQudnJf5S-o8ggqKm5PCuuGjzECKLfD6EzMVDJOuZAk5YT_AAFSbOE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2313450730</pqid></control><display><type>article</type><title>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Chung, Jaewon ; Varjavand, Bijan ; Arroyo, Jesus ; Alyakin, Anton ; Agterberg, Joshua ; Tang, Minh ; Vogelstein, Joshua T ; Priebe, Carey E</creator><creatorcontrib>Chung, Jaewon ; Varjavand, Bijan ; Arroyo, Jesus ; Alyakin, Anton ; Agterberg, Joshua ; Tang, Minh ; Vogelstein, Joshua T ; Priebe, Carey E</creatorcontrib><description>Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.1911.02741</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Brain ; Fruit flies ; Graphs ; Model testing ; Statistical methods ; Statistical tests ; Statistics - Methodology</subject><ispartof>arXiv.org, 2021-09</ispartof><rights>2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.1002/sta4.429$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.1911.02741$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Chung, Jaewon</creatorcontrib><creatorcontrib>Varjavand, Bijan</creatorcontrib><creatorcontrib>Arroyo, Jesus</creatorcontrib><creatorcontrib>Alyakin, Anton</creatorcontrib><creatorcontrib>Agterberg, Joshua</creatorcontrib><creatorcontrib>Tang, Minh</creatorcontrib><creatorcontrib>Vogelstein, Joshua T</creatorcontrib><creatorcontrib>Priebe, Carey E</creatorcontrib><title>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</title><title>arXiv.org</title><description>Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.</description><subject>Brain</subject><subject>Fruit flies</subject><subject>Graphs</subject><subject>Model testing</subject><subject>Statistical methods</subject><subject>Statistical tests</subject><subject>Statistics - Methodology</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNo9kEtLAzEUhYMgWKo_wJUB11PznpllKVqFSgUHt8Odl02ZZmKSaXXrLze24urCvedczvkQuqZkJjIpyR24T72f0ZzSGWGpoGdowjinSSYYu0BX3m8JIUylTEo-Qd9v0OsGF4cheYWd7Vu8dGA3uGh90OYd7zXgtQ16Bz0uHBhvBxfwixtqN_rQegymwc9jH7Sv4d-9GJxrewh6MPigwwbPre11fVx4rE0UGNPWYdjp2l-i8w563179zSkqHu6LxWOyWi-fFvNVArmM8SsqoJOd5LlSoHhWxVZKyIYrwhXv4k0IYGmXNcBBkDyr64yRTvGqigbCp-jm9PYIqLQudnJf5S-o8ggqKm5PCuuGjzECKLfD6EzMVDJOuZAk5YT_AAFSbOE</recordid><startdate>20210913</startdate><enddate>20210913</enddate><creator>Chung, Jaewon</creator><creator>Varjavand, Bijan</creator><creator>Arroyo, Jesus</creator><creator>Alyakin, Anton</creator><creator>Agterberg, Joshua</creator><creator>Tang, Minh</creator><creator>Vogelstein, Joshua T</creator><creator>Priebe, Carey E</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20210913</creationdate><title>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</title><author>Chung, Jaewon ; Varjavand, Bijan ; Arroyo, Jesus ; Alyakin, Anton ; Agterberg, Joshua ; Tang, Minh ; Vogelstein, Joshua T ; Priebe, Carey E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a951-8b14af5f53966a638b233645d360363faf544a27f8da3a4098cc820f63bb66a03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Brain</topic><topic>Fruit flies</topic><topic>Graphs</topic><topic>Model testing</topic><topic>Statistical methods</topic><topic>Statistical tests</topic><topic>Statistics - Methodology</topic><toplevel>online_resources</toplevel><creatorcontrib>Chung, Jaewon</creatorcontrib><creatorcontrib>Varjavand, Bijan</creatorcontrib><creatorcontrib>Arroyo, Jesus</creatorcontrib><creatorcontrib>Alyakin, Anton</creatorcontrib><creatorcontrib>Agterberg, Joshua</creatorcontrib><creatorcontrib>Tang, Minh</creatorcontrib><creatorcontrib>Vogelstein, Joshua T</creatorcontrib><creatorcontrib>Priebe, Carey E</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chung, Jaewon</au><au>Varjavand, Bijan</au><au>Arroyo, Jesus</au><au>Alyakin, Anton</au><au>Agterberg, Joshua</au><au>Tang, Minh</au><au>Vogelstein, Joshua T</au><au>Priebe, Carey E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</atitle><jtitle>arXiv.org</jtitle><date>2021-09-13</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.1911.02741</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-09
issn 2331-8422
language eng
recordid cdi_arxiv_primary_1911_02741
source arXiv.org; Free E- Journals
subjects Brain
Fruit flies
Graphs
Model testing
Statistical methods
Statistical tests
Statistics - Methodology
title Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T16%3A20%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Valid%20Two-Sample%20Graph%20Testing%20via%20Optimal%20Transport%20Procrustes%20and%20Multiscale%20Graph%20Correlation%20with%20Applications%20in%20Connectomics&rft.jtitle=arXiv.org&rft.au=Chung,%20Jaewon&rft.date=2021-09-13&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.1911.02741&rft_dat=%3Cproquest_arxiv%3E2313450730%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2313450730&rft_id=info:pmid/&rfr_iscdi=true