Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics
Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2021-09 |
---|---|
Hauptverfasser: | , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Chung, Jaewon Varjavand, Bijan Arroyo, Jesus Alyakin, Anton Agterberg, Joshua Tang, Minh Vogelstein, Joshua T Priebe, Carey E |
description | Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed. |
doi_str_mv | 10.48550/arxiv.1911.02741 |
format | Article |
fullrecord | <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_1911_02741</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2313450730</sourcerecordid><originalsourceid>FETCH-LOGICAL-a951-8b14af5f53966a638b233645d360363faf544a27f8da3a4098cc820f63bb66a03</originalsourceid><addsrcrecordid>eNo9kEtLAzEUhYMgWKo_wJUB11PznpllKVqFSgUHt8Odl02ZZmKSaXXrLze24urCvedczvkQuqZkJjIpyR24T72f0ZzSGWGpoGdowjinSSYYu0BX3m8JIUylTEo-Qd9v0OsGF4cheYWd7Vu8dGA3uGh90OYd7zXgtQ16Bz0uHBhvBxfwixtqN_rQegymwc9jH7Sv4d-9GJxrewh6MPigwwbPre11fVx4rE0UGNPWYdjp2l-i8w563179zSkqHu6LxWOyWi-fFvNVArmM8SsqoJOd5LlSoHhWxVZKyIYrwhXv4k0IYGmXNcBBkDyr64yRTvGqigbCp-jm9PYIqLQudnJf5S-o8ggqKm5PCuuGjzECKLfD6EzMVDJOuZAk5YT_AAFSbOE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2313450730</pqid></control><display><type>article</type><title>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Chung, Jaewon ; Varjavand, Bijan ; Arroyo, Jesus ; Alyakin, Anton ; Agterberg, Joshua ; Tang, Minh ; Vogelstein, Joshua T ; Priebe, Carey E</creator><creatorcontrib>Chung, Jaewon ; Varjavand, Bijan ; Arroyo, Jesus ; Alyakin, Anton ; Agterberg, Joshua ; Tang, Minh ; Vogelstein, Joshua T ; Priebe, Carey E</creatorcontrib><description>Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.1911.02741</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Brain ; Fruit flies ; Graphs ; Model testing ; Statistical methods ; Statistical tests ; Statistics - Methodology</subject><ispartof>arXiv.org, 2021-09</ispartof><rights>2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.1002/sta4.429$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.1911.02741$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Chung, Jaewon</creatorcontrib><creatorcontrib>Varjavand, Bijan</creatorcontrib><creatorcontrib>Arroyo, Jesus</creatorcontrib><creatorcontrib>Alyakin, Anton</creatorcontrib><creatorcontrib>Agterberg, Joshua</creatorcontrib><creatorcontrib>Tang, Minh</creatorcontrib><creatorcontrib>Vogelstein, Joshua T</creatorcontrib><creatorcontrib>Priebe, Carey E</creatorcontrib><title>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</title><title>arXiv.org</title><description>Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.</description><subject>Brain</subject><subject>Fruit flies</subject><subject>Graphs</subject><subject>Model testing</subject><subject>Statistical methods</subject><subject>Statistical tests</subject><subject>Statistics - Methodology</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNo9kEtLAzEUhYMgWKo_wJUB11PznpllKVqFSgUHt8Odl02ZZmKSaXXrLze24urCvedczvkQuqZkJjIpyR24T72f0ZzSGWGpoGdowjinSSYYu0BX3m8JIUylTEo-Qd9v0OsGF4cheYWd7Vu8dGA3uGh90OYd7zXgtQ16Bz0uHBhvBxfwixtqN_rQegymwc9jH7Sv4d-9GJxrewh6MPigwwbPre11fVx4rE0UGNPWYdjp2l-i8w563179zSkqHu6LxWOyWi-fFvNVArmM8SsqoJOd5LlSoHhWxVZKyIYrwhXv4k0IYGmXNcBBkDyr64yRTvGqigbCp-jm9PYIqLQudnJf5S-o8ggqKm5PCuuGjzECKLfD6EzMVDJOuZAk5YT_AAFSbOE</recordid><startdate>20210913</startdate><enddate>20210913</enddate><creator>Chung, Jaewon</creator><creator>Varjavand, Bijan</creator><creator>Arroyo, Jesus</creator><creator>Alyakin, Anton</creator><creator>Agterberg, Joshua</creator><creator>Tang, Minh</creator><creator>Vogelstein, Joshua T</creator><creator>Priebe, Carey E</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20210913</creationdate><title>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</title><author>Chung, Jaewon ; Varjavand, Bijan ; Arroyo, Jesus ; Alyakin, Anton ; Agterberg, Joshua ; Tang, Minh ; Vogelstein, Joshua T ; Priebe, Carey E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a951-8b14af5f53966a638b233645d360363faf544a27f8da3a4098cc820f63bb66a03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Brain</topic><topic>Fruit flies</topic><topic>Graphs</topic><topic>Model testing</topic><topic>Statistical methods</topic><topic>Statistical tests</topic><topic>Statistics - Methodology</topic><toplevel>online_resources</toplevel><creatorcontrib>Chung, Jaewon</creatorcontrib><creatorcontrib>Varjavand, Bijan</creatorcontrib><creatorcontrib>Arroyo, Jesus</creatorcontrib><creatorcontrib>Alyakin, Anton</creatorcontrib><creatorcontrib>Agterberg, Joshua</creatorcontrib><creatorcontrib>Tang, Minh</creatorcontrib><creatorcontrib>Vogelstein, Joshua T</creatorcontrib><creatorcontrib>Priebe, Carey E</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chung, Jaewon</au><au>Varjavand, Bijan</au><au>Arroyo, Jesus</au><au>Alyakin, Anton</au><au>Agterberg, Joshua</au><au>Tang, Minh</au><au>Vogelstein, Joshua T</au><au>Priebe, Carey E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics</atitle><jtitle>arXiv.org</jtitle><date>2021-09-13</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>Testing whether two graphs come from the same distribution is of interest in many real world scenarios, including brain network analysis. Under the random dot product graph model, the nonparametric hypothesis testing frame-work consists of embedding the graphs using the adjacency spectral embedding (ASE), followed by aligning the embeddings using the median flip heuristic, and finally applying the nonparametric maximum mean discrepancy(MMD) test to obtain a p-value. Using synthetic data generated from Drosophila brain networks, we show that the median flip heuristic results in an invalid test, and demonstrate that optimal transport Procrustes (OTP) for alignment resolves the invalidity. We further demonstrate that substituting the MMD test with multiscale graph correlation(MGC) test leads to a more powerful test both in synthetic and in simulated data. Lastly, we apply this powerful test to the right and left hemispheres of the larval Drosophila mushroom body brain networks, and conclude that there is not sufficient evidence to reject the null hypothesis that the two hemispheres are equally distributed.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.1911.02741</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-09 |
issn | 2331-8422 |
language | eng |
recordid | cdi_arxiv_primary_1911_02741 |
source | arXiv.org; Free E- Journals |
subjects | Brain Fruit flies Graphs Model testing Statistical methods Statistical tests Statistics - Methodology |
title | Valid Two-Sample Graph Testing via Optimal Transport Procrustes and Multiscale Graph Correlation with Applications in Connectomics |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T16%3A20%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Valid%20Two-Sample%20Graph%20Testing%20via%20Optimal%20Transport%20Procrustes%20and%20Multiscale%20Graph%20Correlation%20with%20Applications%20in%20Connectomics&rft.jtitle=arXiv.org&rft.au=Chung,%20Jaewon&rft.date=2021-09-13&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.1911.02741&rft_dat=%3Cproquest_arxiv%3E2313450730%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2313450730&rft_id=info:pmid/&rfr_iscdi=true |