Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata

SARS CoV-2 is a fascinating topic to investigate, especially in Indonesia and Malaysia, which share similar racial demographics. However, statistical analysis of information on the SARS CoV-2 from a database, especially GISAID, does not contain specific customizations related to virus comparisons be...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sudigyo, D., Budiarto, A., Pardamean, B.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title
container_volume 2594
creator Sudigyo, D.
Budiarto, A.
Pardamean, B.
description SARS CoV-2 is a fascinating topic to investigate, especially in Indonesia and Malaysia, which share similar racial demographics. However, statistical analysis of information on the SARS CoV-2 from a database, especially GISAID, does not contain specific customizations related to virus comparisons between selected countries. Therefore, the researchers conducted statistical analysis and data visualization using the Python programming language to describe and investigate SARS CoV-2 Indonesia and Malaysia from the GISAID database. SARS CoV-2 metadata from Indonesia (N=117) and Malaysia (N=250), which were gathered during 2020, were compared. This comparison was aimed to investigate the discrepancies of COVID-19 cases in closely related populations. Firstly, data visualization was conducted using the Python Matplotlib library to create bar charts for clades and mutation comparison. Additionally, a series of boxplots were generated to show age discrepancies stratified by gender. Furthermore, the statistical tests showed that only the dominant Malaysian (G and O) clades were found to be significantly different compared to Indonesian cases (p-value=0.016). The proportion of two major mutations (G614D and NSP12 P323L) were also significantly different in the two countries caused by the dominant clade differences (p-value=0.007). Lastly, the differences in the age distribution of COVID-19 cases between the two countries were significant only in the male group (p-value=0.017).
doi_str_mv 10.1063/5.0109186
format Conference Proceeding
fullrecord <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_2803642815</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2803642815</sourcerecordid><originalsourceid>FETCH-LOGICAL-p168t-3844183451ea45340b6258c09c00abcf176afdfb7041db2ee80dddfa118aac623</originalsourceid><addsrcrecordid>eNp90E1Lw0AQBuBFFKzVg_8g4E1IndmvbI6l-FGoCFZBT8skm0BKmsTstlB_vWkrePM0M_DMwLyMXSNMELS4UxNASNHoEzZCpTBONOpTNgJIZcyl-DhnF96vAHiaJGbEPpeBQuVDlVMdUUP1zld-aFzkKFC0rfyG6up7MG0TtWU0b1zbFL6ig3mmmnb7YTl9XUazdhvzaF0E2u9esrOSal9c_dYxe3-4f5s9xYuXx_lsuog71CbEwkiJRkiFBUklJGSaK5NDmgNQlpeYaCpdmSUg0WW8KAw450pCNES55mLMbo53u7792hQ-2FW76YdPvOUGhJbcoBrU7VH5vAqHb2zXV2vqdxbB7qOzyv5G9x_etv0ftJ0rxQ_2IW7i</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>2803642815</pqid></control><display><type>conference_proceeding</type><title>Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata</title><source>AIP Journals Complete</source><creator>Sudigyo, D. ; Budiarto, A. ; Pardamean, B.</creator><contributor>Sitepu, Elioenai ; Noerlina ; Harito, Christian</contributor><creatorcontrib>Sudigyo, D. ; Budiarto, A. ; Pardamean, B. ; Sitepu, Elioenai ; Noerlina ; Harito, Christian</creatorcontrib><description>SARS CoV-2 is a fascinating topic to investigate, especially in Indonesia and Malaysia, which share similar racial demographics. However, statistical analysis of information on the SARS CoV-2 from a database, especially GISAID, does not contain specific customizations related to virus comparisons between selected countries. Therefore, the researchers conducted statistical analysis and data visualization using the Python programming language to describe and investigate SARS CoV-2 Indonesia and Malaysia from the GISAID database. SARS CoV-2 metadata from Indonesia (N=117) and Malaysia (N=250), which were gathered during 2020, were compared. This comparison was aimed to investigate the discrepancies of COVID-19 cases in closely related populations. Firstly, data visualization was conducted using the Python Matplotlib library to create bar charts for clades and mutation comparison. Additionally, a series of boxplots were generated to show age discrepancies stratified by gender. Furthermore, the statistical tests showed that only the dominant Malaysian (G and O) clades were found to be significantly different compared to Indonesian cases (p-value=0.016). The proportion of two major mutations (G614D and NSP12 P323L) were also significantly different in the two countries caused by the dominant clade differences (p-value=0.007). Lastly, the differences in the age distribution of COVID-19 cases between the two countries were significant only in the male group (p-value=0.017).</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0109186</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Coronaviruses ; COVID-19 ; Data analysis ; Demographics ; Metadata ; Mutation ; Programming languages ; Scientific visualization ; Statistical analysis ; Statistical tests ; Visualization</subject><ispartof>AIP conference proceedings, 2023, Vol.2594 (1)</ispartof><rights>Author(s)</rights><rights>2023 Author(s). Published by AIP Publishing.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/acp/article-lookup/doi/10.1063/5.0109186$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>309,310,314,776,780,785,786,790,4498,23909,23910,25118,27901,27902,76353</link.rule.ids></links><search><contributor>Sitepu, Elioenai</contributor><contributor>Noerlina</contributor><contributor>Harito, Christian</contributor><creatorcontrib>Sudigyo, D.</creatorcontrib><creatorcontrib>Budiarto, A.</creatorcontrib><creatorcontrib>Pardamean, B.</creatorcontrib><title>Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata</title><title>AIP conference proceedings</title><description>SARS CoV-2 is a fascinating topic to investigate, especially in Indonesia and Malaysia, which share similar racial demographics. However, statistical analysis of information on the SARS CoV-2 from a database, especially GISAID, does not contain specific customizations related to virus comparisons between selected countries. Therefore, the researchers conducted statistical analysis and data visualization using the Python programming language to describe and investigate SARS CoV-2 Indonesia and Malaysia from the GISAID database. SARS CoV-2 metadata from Indonesia (N=117) and Malaysia (N=250), which were gathered during 2020, were compared. This comparison was aimed to investigate the discrepancies of COVID-19 cases in closely related populations. Firstly, data visualization was conducted using the Python Matplotlib library to create bar charts for clades and mutation comparison. Additionally, a series of boxplots were generated to show age discrepancies stratified by gender. Furthermore, the statistical tests showed that only the dominant Malaysian (G and O) clades were found to be significantly different compared to Indonesian cases (p-value=0.016). The proportion of two major mutations (G614D and NSP12 P323L) were also significantly different in the two countries caused by the dominant clade differences (p-value=0.007). Lastly, the differences in the age distribution of COVID-19 cases between the two countries were significant only in the male group (p-value=0.017).</description><subject>Coronaviruses</subject><subject>COVID-19</subject><subject>Data analysis</subject><subject>Demographics</subject><subject>Metadata</subject><subject>Mutation</subject><subject>Programming languages</subject><subject>Scientific visualization</subject><subject>Statistical analysis</subject><subject>Statistical tests</subject><subject>Visualization</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNp90E1Lw0AQBuBFFKzVg_8g4E1IndmvbI6l-FGoCFZBT8skm0BKmsTstlB_vWkrePM0M_DMwLyMXSNMELS4UxNASNHoEzZCpTBONOpTNgJIZcyl-DhnF96vAHiaJGbEPpeBQuVDlVMdUUP1zld-aFzkKFC0rfyG6up7MG0TtWU0b1zbFL6ig3mmmnb7YTl9XUazdhvzaF0E2u9esrOSal9c_dYxe3-4f5s9xYuXx_lsuog71CbEwkiJRkiFBUklJGSaK5NDmgNQlpeYaCpdmSUg0WW8KAw450pCNES55mLMbo53u7792hQ-2FW76YdPvOUGhJbcoBrU7VH5vAqHb2zXV2vqdxbB7qOzyv5G9x_etv0ftJ0rxQ_2IW7i</recordid><startdate>20230420</startdate><enddate>20230420</enddate><creator>Sudigyo, D.</creator><creator>Budiarto, A.</creator><creator>Pardamean, B.</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20230420</creationdate><title>Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata</title><author>Sudigyo, D. ; Budiarto, A. ; Pardamean, B.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p168t-3844183451ea45340b6258c09c00abcf176afdfb7041db2ee80dddfa118aac623</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Coronaviruses</topic><topic>COVID-19</topic><topic>Data analysis</topic><topic>Demographics</topic><topic>Metadata</topic><topic>Mutation</topic><topic>Programming languages</topic><topic>Scientific visualization</topic><topic>Statistical analysis</topic><topic>Statistical tests</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sudigyo, D.</creatorcontrib><creatorcontrib>Budiarto, A.</creatorcontrib><creatorcontrib>Pardamean, B.</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sudigyo, D.</au><au>Budiarto, A.</au><au>Pardamean, B.</au><au>Sitepu, Elioenai</au><au>Noerlina</au><au>Harito, Christian</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata</atitle><btitle>AIP conference proceedings</btitle><date>2023-04-20</date><risdate>2023</risdate><volume>2594</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>SARS CoV-2 is a fascinating topic to investigate, especially in Indonesia and Malaysia, which share similar racial demographics. However, statistical analysis of information on the SARS CoV-2 from a database, especially GISAID, does not contain specific customizations related to virus comparisons between selected countries. Therefore, the researchers conducted statistical analysis and data visualization using the Python programming language to describe and investigate SARS CoV-2 Indonesia and Malaysia from the GISAID database. SARS CoV-2 metadata from Indonesia (N=117) and Malaysia (N=250), which were gathered during 2020, were compared. This comparison was aimed to investigate the discrepancies of COVID-19 cases in closely related populations. Firstly, data visualization was conducted using the Python Matplotlib library to create bar charts for clades and mutation comparison. Additionally, a series of boxplots were generated to show age discrepancies stratified by gender. Furthermore, the statistical tests showed that only the dominant Malaysian (G and O) clades were found to be significantly different compared to Indonesian cases (p-value=0.016). The proportion of two major mutations (G614D and NSP12 P323L) were also significantly different in the two countries caused by the dominant clade differences (p-value=0.007). Lastly, the differences in the age distribution of COVID-19 cases between the two countries were significant only in the male group (p-value=0.017).</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0109186</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0094-243X
ispartof AIP conference proceedings, 2023, Vol.2594 (1)
issn 0094-243X
1551-7616
language eng
recordid cdi_proquest_journals_2803642815
source AIP Journals Complete
subjects Coronaviruses
COVID-19
Data analysis
Demographics
Metadata
Mutation
Programming languages
Scientific visualization
Statistical analysis
Statistical tests
Visualization
title Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T00%3A38%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Statistical%20analysis%20and%20data%20visualization%20of%20Indonesia%20and%20Malaysia%20SARS%20Cov-2%20metadata&rft.btitle=AIP%20conference%20proceedings&rft.au=Sudigyo,%20D.&rft.date=2023-04-20&rft.volume=2594&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0109186&rft_dat=%3Cproquest_scita%3E2803642815%3C/proquest_scita%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2803642815&rft_id=info:pmid/&rfr_iscdi=true