Targeted DNA methylation analysis and prediction of smoking habits in blood based on massively parallel sequencing

Tobacco smoking is a frequent habit sustained by > 1.3 billion people in 2020 and the leading preventable factor for health risk and premature mortality worldwide. In the forensic context, predicting smoking habits from biological samples may allow broadening DNA phenotyping. In this study, we ai...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Forensic science international : genetics 2023-07, Vol.65, p.102878-102878, Article 102878
Hauptverfasser: Vidaki, Athina, Planterose Jiménez, Benjamin, Poggiali, Brando, Kalamara, Vivian, van der Gaag, Kristiaan J., Maas, Silvana C.E., Ghanbari, Mohsen, Sijen, Titia, Kayser, Manfred
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 102878
container_issue
container_start_page 102878
container_title Forensic science international : genetics
container_volume 65
creator Vidaki, Athina
Planterose Jiménez, Benjamin
Poggiali, Brando
Kalamara, Vivian
van der Gaag, Kristiaan J.
Maas, Silvana C.E.
Ghanbari, Mohsen
Sijen, Titia
Kayser, Manfred
description Tobacco smoking is a frequent habit sustained by > 1.3 billion people in 2020 and the leading preventable factor for health risk and premature mortality worldwide. In the forensic context, predicting smoking habits from biological samples may allow broadening DNA phenotyping. In this study, we aimed to implement previously published smoking habit classification models based on blood DNA methylation at 13 CpGs. First, we developed a matching lab tool based on bisulfite conversion and multiplex PCR followed by amplification-free library preparation and targeted paired-end massively parallel sequencing (MPS). Analysis of six technical duplicates revealed high reproducibility of methylation measurements (Pearson correlation of 0.983). Artificially methylated standards uncovered marker-specific amplification bias, which we corrected via bi-exponential models. We then applied our MPS tool to 232 blood samples from Europeans of a wide age range, of which 90 were current, 71 former and 71 never smokers. On average, we obtained 189,000 reads/sample and 15,000 reads/CpG, without marker drop-out. Methylation distributions per smoking category roughly corresponded to previous microarray analysis, showcasing large inter-individual variation but with technology-driven bias. Methylation at 11 out of 13 smoking-CpGs correlated with daily cigarettes in current smokers, while solely one was weakly correlated with time since cessation in former smokers. Interestingly, eight smoking-CpGs correlated with age, and one displayed weak but significant sex-associated methylation differences. Using bias-uncorrected MPS data, smoking habits were relatively accurately predicted using both two- (current/non-current) and three- (never/former/current) category model, but bias correction resulted in worse prediction performance for both models. Finally, to account for technology-driven variation, we built new, joint models with inter-technology corrections, which resulted in improved prediction results for both models, with or without PCR bias correction (e.g. MPS cross-validation F1-score > 0.8; 2-categories). Overall, our novel assay takes us one step closer towards the forensic application of viable smoking habit prediction from blood traces. However, future research is needed towards forensically validating the assay, especially in terms of sensitivity. We also need to further shed light on the employed biomarkers, particularly on the mechanistics, tissue specificity and putative conf
doi_str_mv 10.1016/j.fsigen.2023.102878
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2807928113</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1872497323000534</els_id><sourcerecordid>2807928113</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-ce2fb64e20da8fd73de0e5f80d1a426f064ed826a34cfad13e88f7a3fe5e47a63</originalsourceid><addsrcrecordid>eNp9kMtuFDEQRS0EIiHwBwh5yaYHP_rh2SBFCS8pgk1YW9V2eeLB3R5cPZHm73HSgSWrKlXdW1d1GHsrxUYK2X_YbwLFHc4bJZSuI2UG84ydy1oaoVX__LFXTbsd9Bl7RbQXotsOsnvJzvQgZa_a7pyVWyg7XNDz6--XfMLl7pRgiXnmMEM6UaTaeH4o6KN7nOfAacq_4rzjdzDGhXic-Zhy9nwEqoeqZgKieI_pxA9QICVMnPD3EWdXba_ZiwCJ8M1TvWA_P3-6vfra3Pz48u3q8qZxrTBL41CFsW9RCQ8m-EF7FNgFI7yEVvVB1J03qgfdugBeajQmDKADdtgO0OsL9n69eyi5ZtNip0gOU4IZ85GsMmLYKiOlrtJ2lbqSiQoGeyhxgnKyUtgH2nZvV9r2gbZdaVfbu6eE4zih_2f6i7cKPq4CrH_eRyyWXKwUKsyCbrE-x_8n_AFotJUK</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2807928113</pqid></control><display><type>article</type><title>Targeted DNA methylation analysis and prediction of smoking habits in blood based on massively parallel sequencing</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals</source><creator>Vidaki, Athina ; Planterose Jiménez, Benjamin ; Poggiali, Brando ; Kalamara, Vivian ; van der Gaag, Kristiaan J. ; Maas, Silvana C.E. ; Ghanbari, Mohsen ; Sijen, Titia ; Kayser, Manfred</creator><creatorcontrib>Vidaki, Athina ; Planterose Jiménez, Benjamin ; Poggiali, Brando ; Kalamara, Vivian ; van der Gaag, Kristiaan J. ; Maas, Silvana C.E. ; Ghanbari, Mohsen ; Sijen, Titia ; Kayser, Manfred ; B.I.O.S. Consortium</creatorcontrib><description>Tobacco smoking is a frequent habit sustained by &gt; 1.3 billion people in 2020 and the leading preventable factor for health risk and premature mortality worldwide. In the forensic context, predicting smoking habits from biological samples may allow broadening DNA phenotyping. In this study, we aimed to implement previously published smoking habit classification models based on blood DNA methylation at 13 CpGs. First, we developed a matching lab tool based on bisulfite conversion and multiplex PCR followed by amplification-free library preparation and targeted paired-end massively parallel sequencing (MPS). Analysis of six technical duplicates revealed high reproducibility of methylation measurements (Pearson correlation of 0.983). Artificially methylated standards uncovered marker-specific amplification bias, which we corrected via bi-exponential models. We then applied our MPS tool to 232 blood samples from Europeans of a wide age range, of which 90 were current, 71 former and 71 never smokers. On average, we obtained 189,000 reads/sample and 15,000 reads/CpG, without marker drop-out. Methylation distributions per smoking category roughly corresponded to previous microarray analysis, showcasing large inter-individual variation but with technology-driven bias. Methylation at 11 out of 13 smoking-CpGs correlated with daily cigarettes in current smokers, while solely one was weakly correlated with time since cessation in former smokers. Interestingly, eight smoking-CpGs correlated with age, and one displayed weak but significant sex-associated methylation differences. Using bias-uncorrected MPS data, smoking habits were relatively accurately predicted using both two- (current/non-current) and three- (never/former/current) category model, but bias correction resulted in worse prediction performance for both models. Finally, to account for technology-driven variation, we built new, joint models with inter-technology corrections, which resulted in improved prediction results for both models, with or without PCR bias correction (e.g. MPS cross-validation F1-score &gt; 0.8; 2-categories). Overall, our novel assay takes us one step closer towards the forensic application of viable smoking habit prediction from blood traces. However, future research is needed towards forensically validating the assay, especially in terms of sensitivity. We also need to further shed light on the employed biomarkers, particularly on the mechanistics, tissue specificity and putative confounders of smoking epigenetic signatures. •We developed a novel massively parallel sequencing assay based on 13 smoking-CpGs.•Non-linear methylation quantification was corrected via bi-exponential regression fits.•We detected phenotype-specific methylation signatures in 90 current, 71 former and 71 never smokers.•Most smoking-CpGs show statistically significant association with age.•We correctly predicted smoking habits using existing microarray and new joint models.</description><identifier>ISSN: 1872-4973</identifier><identifier>EISSN: 1878-0326</identifier><identifier>DOI: 10.1016/j.fsigen.2023.102878</identifier><identifier>PMID: 37116245</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Blood ; CpG Islands - genetics ; DNA Methylation ; Forensic epigenetics ; High-Throughput Nucleotide Sequencing ; Humans ; Lifestyle prediction ; Massively parallel sequencing ; Polymerase Chain Reaction ; Reproducibility of Results ; Smoking ; Smoking - genetics</subject><ispartof>Forensic science international : genetics, 2023-07, Vol.65, p.102878-102878, Article 102878</ispartof><rights>2023 The Authors</rights><rights>Copyright © 2023 The Authors. Published by Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-ce2fb64e20da8fd73de0e5f80d1a426f064ed826a34cfad13e88f7a3fe5e47a63</citedby><cites>FETCH-LOGICAL-c408t-ce2fb64e20da8fd73de0e5f80d1a426f064ed826a34cfad13e88f7a3fe5e47a63</cites><orcidid>0000-0002-5470-245X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1872497323000534$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,3537,27903,27904,65309</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37116245$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Vidaki, Athina</creatorcontrib><creatorcontrib>Planterose Jiménez, Benjamin</creatorcontrib><creatorcontrib>Poggiali, Brando</creatorcontrib><creatorcontrib>Kalamara, Vivian</creatorcontrib><creatorcontrib>van der Gaag, Kristiaan J.</creatorcontrib><creatorcontrib>Maas, Silvana C.E.</creatorcontrib><creatorcontrib>Ghanbari, Mohsen</creatorcontrib><creatorcontrib>Sijen, Titia</creatorcontrib><creatorcontrib>Kayser, Manfred</creatorcontrib><creatorcontrib>B.I.O.S. Consortium</creatorcontrib><title>Targeted DNA methylation analysis and prediction of smoking habits in blood based on massively parallel sequencing</title><title>Forensic science international : genetics</title><addtitle>Forensic Sci Int Genet</addtitle><description>Tobacco smoking is a frequent habit sustained by &gt; 1.3 billion people in 2020 and the leading preventable factor for health risk and premature mortality worldwide. In the forensic context, predicting smoking habits from biological samples may allow broadening DNA phenotyping. In this study, we aimed to implement previously published smoking habit classification models based on blood DNA methylation at 13 CpGs. First, we developed a matching lab tool based on bisulfite conversion and multiplex PCR followed by amplification-free library preparation and targeted paired-end massively parallel sequencing (MPS). Analysis of six technical duplicates revealed high reproducibility of methylation measurements (Pearson correlation of 0.983). Artificially methylated standards uncovered marker-specific amplification bias, which we corrected via bi-exponential models. We then applied our MPS tool to 232 blood samples from Europeans of a wide age range, of which 90 were current, 71 former and 71 never smokers. On average, we obtained 189,000 reads/sample and 15,000 reads/CpG, without marker drop-out. Methylation distributions per smoking category roughly corresponded to previous microarray analysis, showcasing large inter-individual variation but with technology-driven bias. Methylation at 11 out of 13 smoking-CpGs correlated with daily cigarettes in current smokers, while solely one was weakly correlated with time since cessation in former smokers. Interestingly, eight smoking-CpGs correlated with age, and one displayed weak but significant sex-associated methylation differences. Using bias-uncorrected MPS data, smoking habits were relatively accurately predicted using both two- (current/non-current) and three- (never/former/current) category model, but bias correction resulted in worse prediction performance for both models. Finally, to account for technology-driven variation, we built new, joint models with inter-technology corrections, which resulted in improved prediction results for both models, with or without PCR bias correction (e.g. MPS cross-validation F1-score &gt; 0.8; 2-categories). Overall, our novel assay takes us one step closer towards the forensic application of viable smoking habit prediction from blood traces. However, future research is needed towards forensically validating the assay, especially in terms of sensitivity. We also need to further shed light on the employed biomarkers, particularly on the mechanistics, tissue specificity and putative confounders of smoking epigenetic signatures. •We developed a novel massively parallel sequencing assay based on 13 smoking-CpGs.•Non-linear methylation quantification was corrected via bi-exponential regression fits.•We detected phenotype-specific methylation signatures in 90 current, 71 former and 71 never smokers.•Most smoking-CpGs show statistically significant association with age.•We correctly predicted smoking habits using existing microarray and new joint models.</description><subject>Blood</subject><subject>CpG Islands - genetics</subject><subject>DNA Methylation</subject><subject>Forensic epigenetics</subject><subject>High-Throughput Nucleotide Sequencing</subject><subject>Humans</subject><subject>Lifestyle prediction</subject><subject>Massively parallel sequencing</subject><subject>Polymerase Chain Reaction</subject><subject>Reproducibility of Results</subject><subject>Smoking</subject><subject>Smoking - genetics</subject><issn>1872-4973</issn><issn>1878-0326</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kMtuFDEQRS0EIiHwBwh5yaYHP_rh2SBFCS8pgk1YW9V2eeLB3R5cPZHm73HSgSWrKlXdW1d1GHsrxUYK2X_YbwLFHc4bJZSuI2UG84ydy1oaoVX__LFXTbsd9Bl7RbQXotsOsnvJzvQgZa_a7pyVWyg7XNDz6--XfMLl7pRgiXnmMEM6UaTaeH4o6KN7nOfAacq_4rzjdzDGhXic-Zhy9nwEqoeqZgKieI_pxA9QICVMnPD3EWdXba_ZiwCJ8M1TvWA_P3-6vfra3Pz48u3q8qZxrTBL41CFsW9RCQ8m-EF7FNgFI7yEVvVB1J03qgfdugBeajQmDKADdtgO0OsL9n69eyi5ZtNip0gOU4IZ85GsMmLYKiOlrtJ2lbqSiQoGeyhxgnKyUtgH2nZvV9r2gbZdaVfbu6eE4zih_2f6i7cKPq4CrH_eRyyWXKwUKsyCbrE-x_8n_AFotJUK</recordid><startdate>202307</startdate><enddate>202307</enddate><creator>Vidaki, Athina</creator><creator>Planterose Jiménez, Benjamin</creator><creator>Poggiali, Brando</creator><creator>Kalamara, Vivian</creator><creator>van der Gaag, Kristiaan J.</creator><creator>Maas, Silvana C.E.</creator><creator>Ghanbari, Mohsen</creator><creator>Sijen, Titia</creator><creator>Kayser, Manfred</creator><general>Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-5470-245X</orcidid></search><sort><creationdate>202307</creationdate><title>Targeted DNA methylation analysis and prediction of smoking habits in blood based on massively parallel sequencing</title><author>Vidaki, Athina ; Planterose Jiménez, Benjamin ; Poggiali, Brando ; Kalamara, Vivian ; van der Gaag, Kristiaan J. ; Maas, Silvana C.E. ; Ghanbari, Mohsen ; Sijen, Titia ; Kayser, Manfred</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-ce2fb64e20da8fd73de0e5f80d1a426f064ed826a34cfad13e88f7a3fe5e47a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Blood</topic><topic>CpG Islands - genetics</topic><topic>DNA Methylation</topic><topic>Forensic epigenetics</topic><topic>High-Throughput Nucleotide Sequencing</topic><topic>Humans</topic><topic>Lifestyle prediction</topic><topic>Massively parallel sequencing</topic><topic>Polymerase Chain Reaction</topic><topic>Reproducibility of Results</topic><topic>Smoking</topic><topic>Smoking - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vidaki, Athina</creatorcontrib><creatorcontrib>Planterose Jiménez, Benjamin</creatorcontrib><creatorcontrib>Poggiali, Brando</creatorcontrib><creatorcontrib>Kalamara, Vivian</creatorcontrib><creatorcontrib>van der Gaag, Kristiaan J.</creatorcontrib><creatorcontrib>Maas, Silvana C.E.</creatorcontrib><creatorcontrib>Ghanbari, Mohsen</creatorcontrib><creatorcontrib>Sijen, Titia</creatorcontrib><creatorcontrib>Kayser, Manfred</creatorcontrib><creatorcontrib>B.I.O.S. Consortium</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Forensic science international : genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vidaki, Athina</au><au>Planterose Jiménez, Benjamin</au><au>Poggiali, Brando</au><au>Kalamara, Vivian</au><au>van der Gaag, Kristiaan J.</au><au>Maas, Silvana C.E.</au><au>Ghanbari, Mohsen</au><au>Sijen, Titia</au><au>Kayser, Manfred</au><aucorp>B.I.O.S. Consortium</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Targeted DNA methylation analysis and prediction of smoking habits in blood based on massively parallel sequencing</atitle><jtitle>Forensic science international : genetics</jtitle><addtitle>Forensic Sci Int Genet</addtitle><date>2023-07</date><risdate>2023</risdate><volume>65</volume><spage>102878</spage><epage>102878</epage><pages>102878-102878</pages><artnum>102878</artnum><issn>1872-4973</issn><eissn>1878-0326</eissn><abstract>Tobacco smoking is a frequent habit sustained by &gt; 1.3 billion people in 2020 and the leading preventable factor for health risk and premature mortality worldwide. In the forensic context, predicting smoking habits from biological samples may allow broadening DNA phenotyping. In this study, we aimed to implement previously published smoking habit classification models based on blood DNA methylation at 13 CpGs. First, we developed a matching lab tool based on bisulfite conversion and multiplex PCR followed by amplification-free library preparation and targeted paired-end massively parallel sequencing (MPS). Analysis of six technical duplicates revealed high reproducibility of methylation measurements (Pearson correlation of 0.983). Artificially methylated standards uncovered marker-specific amplification bias, which we corrected via bi-exponential models. We then applied our MPS tool to 232 blood samples from Europeans of a wide age range, of which 90 were current, 71 former and 71 never smokers. On average, we obtained 189,000 reads/sample and 15,000 reads/CpG, without marker drop-out. Methylation distributions per smoking category roughly corresponded to previous microarray analysis, showcasing large inter-individual variation but with technology-driven bias. Methylation at 11 out of 13 smoking-CpGs correlated with daily cigarettes in current smokers, while solely one was weakly correlated with time since cessation in former smokers. Interestingly, eight smoking-CpGs correlated with age, and one displayed weak but significant sex-associated methylation differences. Using bias-uncorrected MPS data, smoking habits were relatively accurately predicted using both two- (current/non-current) and three- (never/former/current) category model, but bias correction resulted in worse prediction performance for both models. Finally, to account for technology-driven variation, we built new, joint models with inter-technology corrections, which resulted in improved prediction results for both models, with or without PCR bias correction (e.g. MPS cross-validation F1-score &gt; 0.8; 2-categories). Overall, our novel assay takes us one step closer towards the forensic application of viable smoking habit prediction from blood traces. However, future research is needed towards forensically validating the assay, especially in terms of sensitivity. We also need to further shed light on the employed biomarkers, particularly on the mechanistics, tissue specificity and putative confounders of smoking epigenetic signatures. •We developed a novel massively parallel sequencing assay based on 13 smoking-CpGs.•Non-linear methylation quantification was corrected via bi-exponential regression fits.•We detected phenotype-specific methylation signatures in 90 current, 71 former and 71 never smokers.•Most smoking-CpGs show statistically significant association with age.•We correctly predicted smoking habits using existing microarray and new joint models.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>37116245</pmid><doi>10.1016/j.fsigen.2023.102878</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-5470-245X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1872-4973
ispartof Forensic science international : genetics, 2023-07, Vol.65, p.102878-102878, Article 102878
issn 1872-4973
1878-0326
language eng
recordid cdi_proquest_miscellaneous_2807928113
source MEDLINE; Elsevier ScienceDirect Journals
subjects Blood
CpG Islands - genetics
DNA Methylation
Forensic epigenetics
High-Throughput Nucleotide Sequencing
Humans
Lifestyle prediction
Massively parallel sequencing
Polymerase Chain Reaction
Reproducibility of Results
Smoking
Smoking - genetics
title Targeted DNA methylation analysis and prediction of smoking habits in blood based on massively parallel sequencing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T23%3A25%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Targeted%20DNA%20methylation%20analysis%20and%20prediction%20of%20smoking%20habits%20in%20blood%20based%20on%20massively%20parallel%20sequencing&rft.jtitle=Forensic%20science%20international%20:%20genetics&rft.au=Vidaki,%20Athina&rft.aucorp=B.I.O.S.%20Consortium&rft.date=2023-07&rft.volume=65&rft.spage=102878&rft.epage=102878&rft.pages=102878-102878&rft.artnum=102878&rft.issn=1872-4973&rft.eissn=1878-0326&rft_id=info:doi/10.1016/j.fsigen.2023.102878&rft_dat=%3Cproquest_cross%3E2807928113%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2807928113&rft_id=info:pmid/37116245&rft_els_id=S1872497323000534&rfr_iscdi=true