Random forests for predicting species identity of forensically important blow flies (Diptera: Calliphoridae) and flesh flies (Sarcophagidae) using geometric morphometric data: proof of concept

Wing shape variation has been shown to be useful for delineating forensically important fly species in two Diptera families: Calliphoridae and Sarcophagidae. Compared to DNA-based identification, the cost of geometric morphometric data acquisition and analysis is relatively much lower because the to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Khang, Tsung Fei, Mohd Puaad, Nur Ayuni Dayana, Teh, Ser Huy, Mohamed, Zulqarnain
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Khang, Tsung Fei
Mohd Puaad, Nur Ayuni Dayana
Teh, Ser Huy
Mohamed, Zulqarnain
description Wing shape variation has been shown to be useful for delineating forensically important fly species in two Diptera families: Calliphoridae and Sarcophagidae. Compared to DNA-based identification, the cost of geometric morphometric data acquisition and analysis is relatively much lower because the tools required are basic, and stable softwares are available. However, to date, an explicit demonstration of using wing geometric morphometric data for species identity prediction in these two families remains lacking. Here, geometric morphometric data from 19 homologous landmarks on the left wing of males from seven species of Calliphoridae (n=55), and eight species of Sarcophagidae (n=40) were obtained and processed using Generalized Procrustes Analysis. Allometric effect was removed by regressing centroid size (in log10) against the Procrustes coordinates. Subsequently, principal component analysis of the allometry-adjusted Procrustes variables was done, with the first 15 principal components used to train a random forests model for species prediction. Using a real test sample consisting of 33 male fly specimens collected around a human corpse at a crime scene, the estimated percentage of concordance between species identities predicted using the random forests model and those inferred using DNA-based identification was about 80.6% (approximate 95% confidence interval = [68.9%, 92.2%]). In contrast, baseline concordance using naive majority class prediction was 36.4%. The results provide proof of concept that geometric morphometric data has good potential to complement morphological and DNA-based identification of blow flies and flesh flies in forensic work. 
doi_str_mv 10.5061/dryad.95x69p8hf
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5061_dryad_95x69p8hf</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5061_dryad_95x69p8hf</sourcerecordid><originalsourceid>FETCH-LOGICAL-c86f-93af284df5d8548db2c16db1f455dfb7f0beb94d7849d9f752032174e042bcda3</originalsourceid><addsrcrecordid>eNo9kE1LAzEQhvfiQapnrznqoe1-JPvhTepXoSBo70uSmXQDu5uQRHT_nT_NbFuFgWGYZ953eJPkJktXLC2zNbiJw6ph32Vj605dJj_vfAQzEGUc-uDnTqxD0DLo8UC8RanREw04Bh0mYtQRHb2WvO8nogdrXOBjIKI3X0T1M337qG1Ax-_JJkLadsZp4HhHoldE0Hd_4Ad30tiOH077Tz-bHtAMGJyWZDAuHp8H4CEqWmfiD7GkGSXacJVcKN57vD73RbJ_ftpvXpe7t5ft5mG3lHWplk3BVV5TUAxqRmsQucxKEJmijIESlUoFioZCVdMGGlWxPC3yrKKY0lxI4MUiWZ9k5y-kDthapwfupjZL2zna9hht-x9t8QsABXso</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Random forests for predicting species identity of forensically important blow flies (Diptera: Calliphoridae) and flesh flies (Sarcophagidae) using geometric morphometric data: proof of concept</title><source>DataCite</source><creator>Khang, Tsung Fei ; Mohd Puaad, Nur Ayuni Dayana ; Teh, Ser Huy ; Mohamed, Zulqarnain</creator><creatorcontrib>Khang, Tsung Fei ; Mohd Puaad, Nur Ayuni Dayana ; Teh, Ser Huy ; Mohamed, Zulqarnain</creatorcontrib><description>Wing shape variation has been shown to be useful for delineating forensically important fly species in two Diptera families: Calliphoridae and Sarcophagidae. Compared to DNA-based identification, the cost of geometric morphometric data acquisition and analysis is relatively much lower because the tools required are basic, and stable softwares are available. However, to date, an explicit demonstration of using wing geometric morphometric data for species identity prediction in these two families remains lacking. Here, geometric morphometric data from 19 homologous landmarks on the left wing of males from seven species of Calliphoridae (n=55), and eight species of Sarcophagidae (n=40) were obtained and processed using Generalized Procrustes Analysis. Allometric effect was removed by regressing centroid size (in log10) against the Procrustes coordinates. Subsequently, principal component analysis of the allometry-adjusted Procrustes variables was done, with the first 15 principal components used to train a random forests model for species prediction. Using a real test sample consisting of 33 male fly specimens collected around a human corpse at a crime scene, the estimated percentage of concordance between species identities predicted using the random forests model and those inferred using DNA-based identification was about 80.6% (approximate 95% confidence interval = [68.9%, 92.2%]). In contrast, baseline concordance using naive majority class prediction was 36.4%. The results provide proof of concept that geometric morphometric data has good potential to complement morphological and DNA-based identification of blow flies and flesh flies in forensic work. </description><identifier>DOI: 10.5061/dryad.95x69p8hf</identifier><language>eng</language><publisher>Dryad</publisher><subject>Calliphoridae ; Forensic Entomology ; FOS: Biological sciences ; GUIDE ; Sarcophagidae ; species identity prediction ; wing shape</subject><creationdate>2020</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c86f-93af284df5d8548db2c16db1f455dfb7f0beb94d7849d9f752032174e042bcda3</citedby><orcidid>0000-0003-4433-9738</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>781,1895</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5061/dryad.95x69p8hf$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Khang, Tsung Fei</creatorcontrib><creatorcontrib>Mohd Puaad, Nur Ayuni Dayana</creatorcontrib><creatorcontrib>Teh, Ser Huy</creatorcontrib><creatorcontrib>Mohamed, Zulqarnain</creatorcontrib><title>Random forests for predicting species identity of forensically important blow flies (Diptera: Calliphoridae) and flesh flies (Sarcophagidae) using geometric morphometric data: proof of concept</title><description>Wing shape variation has been shown to be useful for delineating forensically important fly species in two Diptera families: Calliphoridae and Sarcophagidae. Compared to DNA-based identification, the cost of geometric morphometric data acquisition and analysis is relatively much lower because the tools required are basic, and stable softwares are available. However, to date, an explicit demonstration of using wing geometric morphometric data for species identity prediction in these two families remains lacking. Here, geometric morphometric data from 19 homologous landmarks on the left wing of males from seven species of Calliphoridae (n=55), and eight species of Sarcophagidae (n=40) were obtained and processed using Generalized Procrustes Analysis. Allometric effect was removed by regressing centroid size (in log10) against the Procrustes coordinates. Subsequently, principal component analysis of the allometry-adjusted Procrustes variables was done, with the first 15 principal components used to train a random forests model for species prediction. Using a real test sample consisting of 33 male fly specimens collected around a human corpse at a crime scene, the estimated percentage of concordance between species identities predicted using the random forests model and those inferred using DNA-based identification was about 80.6% (approximate 95% confidence interval = [68.9%, 92.2%]). In contrast, baseline concordance using naive majority class prediction was 36.4%. The results provide proof of concept that geometric morphometric data has good potential to complement morphological and DNA-based identification of blow flies and flesh flies in forensic work. </description><subject>Calliphoridae</subject><subject>Forensic Entomology</subject><subject>FOS: Biological sciences</subject><subject>GUIDE</subject><subject>Sarcophagidae</subject><subject>species identity prediction</subject><subject>wing shape</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2020</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNo9kE1LAzEQhvfiQapnrznqoe1-JPvhTepXoSBo70uSmXQDu5uQRHT_nT_NbFuFgWGYZ953eJPkJktXLC2zNbiJw6ph32Vj605dJj_vfAQzEGUc-uDnTqxD0DLo8UC8RanREw04Bh0mYtQRHb2WvO8nogdrXOBjIKI3X0T1M337qG1Ax-_JJkLadsZp4HhHoldE0Hd_4Ad30tiOH077Tz-bHtAMGJyWZDAuHp8H4CEqWmfiD7GkGSXacJVcKN57vD73RbJ_ftpvXpe7t5ft5mG3lHWplk3BVV5TUAxqRmsQucxKEJmijIESlUoFioZCVdMGGlWxPC3yrKKY0lxI4MUiWZ9k5y-kDthapwfupjZL2zna9hht-x9t8QsABXso</recordid><startdate>20201021</startdate><enddate>20201021</enddate><creator>Khang, Tsung Fei</creator><creator>Mohd Puaad, Nur Ayuni Dayana</creator><creator>Teh, Ser Huy</creator><creator>Mohamed, Zulqarnain</creator><general>Dryad</general><scope>DYCCY</scope><scope>PQ8</scope><orcidid>https://orcid.org/0000-0003-4433-9738</orcidid></search><sort><creationdate>20201021</creationdate><title>Random forests for predicting species identity of forensically important blow flies (Diptera: Calliphoridae) and flesh flies (Sarcophagidae) using geometric morphometric data: proof of concept</title><author>Khang, Tsung Fei ; Mohd Puaad, Nur Ayuni Dayana ; Teh, Ser Huy ; Mohamed, Zulqarnain</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c86f-93af284df5d8548db2c16db1f455dfb7f0beb94d7849d9f752032174e042bcda3</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Calliphoridae</topic><topic>Forensic Entomology</topic><topic>FOS: Biological sciences</topic><topic>GUIDE</topic><topic>Sarcophagidae</topic><topic>species identity prediction</topic><topic>wing shape</topic><toplevel>online_resources</toplevel><creatorcontrib>Khang, Tsung Fei</creatorcontrib><creatorcontrib>Mohd Puaad, Nur Ayuni Dayana</creatorcontrib><creatorcontrib>Teh, Ser Huy</creatorcontrib><creatorcontrib>Mohamed, Zulqarnain</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Khang, Tsung Fei</au><au>Mohd Puaad, Nur Ayuni Dayana</au><au>Teh, Ser Huy</au><au>Mohamed, Zulqarnain</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Random forests for predicting species identity of forensically important blow flies (Diptera: Calliphoridae) and flesh flies (Sarcophagidae) using geometric morphometric data: proof of concept</title><date>2020-10-21</date><risdate>2020</risdate><abstract>Wing shape variation has been shown to be useful for delineating forensically important fly species in two Diptera families: Calliphoridae and Sarcophagidae. Compared to DNA-based identification, the cost of geometric morphometric data acquisition and analysis is relatively much lower because the tools required are basic, and stable softwares are available. However, to date, an explicit demonstration of using wing geometric morphometric data for species identity prediction in these two families remains lacking. Here, geometric morphometric data from 19 homologous landmarks on the left wing of males from seven species of Calliphoridae (n=55), and eight species of Sarcophagidae (n=40) were obtained and processed using Generalized Procrustes Analysis. Allometric effect was removed by regressing centroid size (in log10) against the Procrustes coordinates. Subsequently, principal component analysis of the allometry-adjusted Procrustes variables was done, with the first 15 principal components used to train a random forests model for species prediction. Using a real test sample consisting of 33 male fly specimens collected around a human corpse at a crime scene, the estimated percentage of concordance between species identities predicted using the random forests model and those inferred using DNA-based identification was about 80.6% (approximate 95% confidence interval = [68.9%, 92.2%]). In contrast, baseline concordance using naive majority class prediction was 36.4%. The results provide proof of concept that geometric morphometric data has good potential to complement morphological and DNA-based identification of blow flies and flesh flies in forensic work. </abstract><pub>Dryad</pub><doi>10.5061/dryad.95x69p8hf</doi><orcidid>https://orcid.org/0000-0003-4433-9738</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5061/dryad.95x69p8hf
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5061_dryad_95x69p8hf
source DataCite
subjects Calliphoridae
Forensic Entomology
FOS: Biological sciences
GUIDE
Sarcophagidae
species identity prediction
wing shape
title Random forests for predicting species identity of forensically important blow flies (Diptera: Calliphoridae) and flesh flies (Sarcophagidae) using geometric morphometric data: proof of concept
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T10%3A47%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Khang,%20Tsung%20Fei&rft.date=2020-10-21&rft_id=info:doi/10.5061/dryad.95x69p8hf&rft_dat=%3Cdatacite_PQ8%3E10_5061_dryad_95x69p8hf%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true