Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians

Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (avera...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PloS one 2013-04, Vol.8 (4), p.e59494
Hauptverfasser: Shen, Hui, Li, Jian, Zhang, Jigang, Xu, Chao, Jiang, Yan, Wu, Zikai, Zhao, Fuping, Liao, Li, Chen, Jun, Lin, Yong, Tian, Qing, Papasian, Christopher J, Deng, Hong-Wen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 4
container_start_page e59494
container_title PloS one
container_volume 8
creator Shen, Hui
Li, Jian
Zhang, Jigang
Xu, Chao
Jiang, Yan
Wu, Zikai
Zhao, Fuping
Liao, Li
Chen, Jun
Lin, Yong
Tian, Qing
Papasian, Christopher J
Deng, Hong-Wen
description Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8×). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (∼96%) have a minor allele frequency less than 5%. On average, each individual genome carried ∼3.3 million SNPs and ∼492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely "knock out" the corresponding genes. Across all the 44 genomes, a total of 182 genes were "knocked-out" in at least one individual genome, among which 46 genes were "knocked out" in over 30% of our samples, suggesting that a number of genes are commonly "knocked-out" in general populations. Gene ontology analysis suggested that these commonly "knocked-out" genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.
doi_str_mv 10.1371/journal.pone.0059494
format Article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1330900781</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A478419776</galeid><doaj_id>oai_doaj_org_article_365afc58c8654b5dba660759284a3a85</doaj_id><sourcerecordid>A478419776</sourcerecordid><originalsourceid>FETCH-LOGICAL-c692t-6bbb2414ddd3a0e05d63aff65a574fc080231487125f52bd902b578aa705ed8f3</originalsourceid><addsrcrecordid>eNqNk22L1DAQx4so3nn6DUQLguiLXdOkSdo3wrH4sHBw4NPbME2TNkubrEm7uvfpTd3esSv3QgJJmPzmP5lJJkmeZ2iZEZ6927jRW-iWW2fVEiFa5mX-IDnPSoIXDCPy8Gh_ljwJYRMhUjD2ODnDhHKOGDtPblau33rVKhvMTqWyBQ9yUN7cwGCcTZ1O27EHmzbKul6lO_DmcFLt09Y0bSrdTnloVPqrdZ1azFxQP0dlpbHNJKGdH_ZxHn26glFCMGDD0-SRhi6oZ_N6kXz_-OHb6vPi6vrTenV5tZCsxMOCVVWF8yyv65oAUojWjIDWjALluZaoQJhkecEzTDXFVV0iXFFeAHBEVV1ocpG8POhuOxfEXLYgMkJQiRAvskisD0TtYCO23vTg98KBEX8NzjcC_GBkpwSJcbWkhSwYzStaV8AY4rTERQ4EChq13s_RxqpXtVR28NCdiJ6eWNOKxu2iclZgzqPAm1nAu1jDMIjeBKm6Dqxy43RvzDimBOcRffUPen92M9VATMBY7WJcOYmKy5wXeVZyziK1vIeKo1a9kfGPaRPtJw5vTxwiM6jfQwNjCGL99cv_s9c_TtnXR2yroBva4Lpx-nThFMwPoPQuBK_0XZEzJKYWua2GmFpEzC0S3V4cP9Cd021PkD9O1A16</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1330900781</pqid></control><display><type>article</type><title>Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians</title><source>MEDLINE</source><source>Public Library of Science</source><source>PubMed Central</source><source>Directory of Open Access Journals</source><source>Free Full-Text Journals in Chemistry</source><source>EZB Electronic Journals Library</source><creator>Shen, Hui ; Li, Jian ; Zhang, Jigang ; Xu, Chao ; Jiang, Yan ; Wu, Zikai ; Zhao, Fuping ; Liao, Li ; Chen, Jun ; Lin, Yong ; Tian, Qing ; Papasian, Christopher J ; Deng, Hong-Wen</creator><creatorcontrib>Shen, Hui ; Li, Jian ; Zhang, Jigang ; Xu, Chao ; Jiang, Yan ; Wu, Zikai ; Zhao, Fuping ; Liao, Li ; Chen, Jun ; Lin, Yong ; Tian, Qing ; Papasian, Christopher J ; Deng, Hong-Wen</creatorcontrib><description>Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8×). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (∼96%) have a minor allele frequency less than 5%. On average, each individual genome carried ∼3.3 million SNPs and ∼492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely "knock out" the corresponding genes. Across all the 44 genomes, a total of 182 genes were "knocked-out" in at least one individual genome, among which 46 genes were "knocked out" in over 30% of our samples, suggesting that a number of genes are commonly "knocked-out" in general populations. Gene ontology analysis suggested that these commonly "knocked-out" genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0059494</identifier><identifier>PMID: 23577066</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Adult ; Adults ; Analysis ; Antigen processing ; Bioinformatics ; Biological activity ; Biological effects ; Biology ; Chromosomes, Human, Y - genetics ; Computer Science ; Consortia ; Deoxyribonucleic acid ; Disease - ethnology ; Disease - genetics ; DNA ; DNA Copy Number Variations - genetics ; DNA sequencing ; DNA, Mitochondrial - genetics ; Female ; Gene frequency ; Gene sequencing ; Genes ; Genetic aspects ; Genetic diversity ; Genetic research ; Genetic testing ; Genome, Human - genetics ; Genomes ; Genomics ; Health care ; Humans ; Immune response ; Immune system ; INDEL Mutation - genetics ; Male ; Medicine ; Mitochondrial DNA ; Mutation Rate ; Polymorphism, Single Nucleotide - genetics ; Public health ; Science ; Sequence Analysis, DNA - methods ; Single nucleotide polymorphisms ; Single-nucleotide polymorphism ; Studies ; White People - genetics</subject><ispartof>PloS one, 2013-04, Vol.8 (4), p.e59494</ispartof><rights>COPYRIGHT 2013 Public Library of Science</rights><rights>2013 Shen et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2013 Shen et al 2013 Shen et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c692t-6bbb2414ddd3a0e05d63aff65a574fc080231487125f52bd902b578aa705ed8f3</citedby><cites>FETCH-LOGICAL-c692t-6bbb2414ddd3a0e05d63aff65a574fc080231487125f52bd902b578aa705ed8f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618277/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618277/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,2096,2915,23845,27901,27902,53766,53768,79342,79343</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/23577066$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Shen, Hui</creatorcontrib><creatorcontrib>Li, Jian</creatorcontrib><creatorcontrib>Zhang, Jigang</creatorcontrib><creatorcontrib>Xu, Chao</creatorcontrib><creatorcontrib>Jiang, Yan</creatorcontrib><creatorcontrib>Wu, Zikai</creatorcontrib><creatorcontrib>Zhao, Fuping</creatorcontrib><creatorcontrib>Liao, Li</creatorcontrib><creatorcontrib>Chen, Jun</creatorcontrib><creatorcontrib>Lin, Yong</creatorcontrib><creatorcontrib>Tian, Qing</creatorcontrib><creatorcontrib>Papasian, Christopher J</creatorcontrib><creatorcontrib>Deng, Hong-Wen</creatorcontrib><title>Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8×). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (∼96%) have a minor allele frequency less than 5%. On average, each individual genome carried ∼3.3 million SNPs and ∼492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely "knock out" the corresponding genes. Across all the 44 genomes, a total of 182 genes were "knocked-out" in at least one individual genome, among which 46 genes were "knocked out" in over 30% of our samples, suggesting that a number of genes are commonly "knocked-out" in general populations. Gene ontology analysis suggested that these commonly "knocked-out" genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.</description><subject>Adult</subject><subject>Adults</subject><subject>Analysis</subject><subject>Antigen processing</subject><subject>Bioinformatics</subject><subject>Biological activity</subject><subject>Biological effects</subject><subject>Biology</subject><subject>Chromosomes, Human, Y - genetics</subject><subject>Computer Science</subject><subject>Consortia</subject><subject>Deoxyribonucleic acid</subject><subject>Disease - ethnology</subject><subject>Disease - genetics</subject><subject>DNA</subject><subject>DNA Copy Number Variations - genetics</subject><subject>DNA sequencing</subject><subject>DNA, Mitochondrial - genetics</subject><subject>Female</subject><subject>Gene frequency</subject><subject>Gene sequencing</subject><subject>Genes</subject><subject>Genetic aspects</subject><subject>Genetic diversity</subject><subject>Genetic research</subject><subject>Genetic testing</subject><subject>Genome, Human - genetics</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Health care</subject><subject>Humans</subject><subject>Immune response</subject><subject>Immune system</subject><subject>INDEL Mutation - genetics</subject><subject>Male</subject><subject>Medicine</subject><subject>Mitochondrial DNA</subject><subject>Mutation Rate</subject><subject>Polymorphism, Single Nucleotide - genetics</subject><subject>Public health</subject><subject>Science</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Single nucleotide polymorphisms</subject><subject>Single-nucleotide polymorphism</subject><subject>Studies</subject><subject>White People - genetics</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><sourceid>DOA</sourceid><recordid>eNqNk22L1DAQx4so3nn6DUQLguiLXdOkSdo3wrH4sHBw4NPbME2TNkubrEm7uvfpTd3esSv3QgJJmPzmP5lJJkmeZ2iZEZ6927jRW-iWW2fVEiFa5mX-IDnPSoIXDCPy8Gh_ljwJYRMhUjD2ODnDhHKOGDtPblau33rVKhvMTqWyBQ9yUN7cwGCcTZ1O27EHmzbKul6lO_DmcFLt09Y0bSrdTnloVPqrdZ1azFxQP0dlpbHNJKGdH_ZxHn26glFCMGDD0-SRhi6oZ_N6kXz_-OHb6vPi6vrTenV5tZCsxMOCVVWF8yyv65oAUojWjIDWjALluZaoQJhkecEzTDXFVV0iXFFeAHBEVV1ocpG8POhuOxfEXLYgMkJQiRAvskisD0TtYCO23vTg98KBEX8NzjcC_GBkpwSJcbWkhSwYzStaV8AY4rTERQ4EChq13s_RxqpXtVR28NCdiJ6eWNOKxu2iclZgzqPAm1nAu1jDMIjeBKm6Dqxy43RvzDimBOcRffUPen92M9VATMBY7WJcOYmKy5wXeVZyziK1vIeKo1a9kfGPaRPtJw5vTxwiM6jfQwNjCGL99cv_s9c_TtnXR2yroBva4Lpx-nThFMwPoPQuBK_0XZEzJKYWua2GmFpEzC0S3V4cP9Cd021PkD9O1A16</recordid><startdate>20130405</startdate><enddate>20130405</enddate><creator>Shen, Hui</creator><creator>Li, Jian</creator><creator>Zhang, Jigang</creator><creator>Xu, Chao</creator><creator>Jiang, Yan</creator><creator>Wu, Zikai</creator><creator>Zhao, Fuping</creator><creator>Liao, Li</creator><creator>Chen, Jun</creator><creator>Lin, Yong</creator><creator>Tian, Qing</creator><creator>Papasian, Christopher J</creator><creator>Deng, Hong-Wen</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20130405</creationdate><title>Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians</title><author>Shen, Hui ; Li, Jian ; Zhang, Jigang ; Xu, Chao ; Jiang, Yan ; Wu, Zikai ; Zhao, Fuping ; Liao, Li ; Chen, Jun ; Lin, Yong ; Tian, Qing ; Papasian, Christopher J ; Deng, Hong-Wen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c692t-6bbb2414ddd3a0e05d63aff65a574fc080231487125f52bd902b578aa705ed8f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Adult</topic><topic>Adults</topic><topic>Analysis</topic><topic>Antigen processing</topic><topic>Bioinformatics</topic><topic>Biological activity</topic><topic>Biological effects</topic><topic>Biology</topic><topic>Chromosomes, Human, Y - genetics</topic><topic>Computer Science</topic><topic>Consortia</topic><topic>Deoxyribonucleic acid</topic><topic>Disease - ethnology</topic><topic>Disease - genetics</topic><topic>DNA</topic><topic>DNA Copy Number Variations - genetics</topic><topic>DNA sequencing</topic><topic>DNA, Mitochondrial - genetics</topic><topic>Female</topic><topic>Gene frequency</topic><topic>Gene sequencing</topic><topic>Genes</topic><topic>Genetic aspects</topic><topic>Genetic diversity</topic><topic>Genetic research</topic><topic>Genetic testing</topic><topic>Genome, Human - genetics</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Health care</topic><topic>Humans</topic><topic>Immune response</topic><topic>Immune system</topic><topic>INDEL Mutation - genetics</topic><topic>Male</topic><topic>Medicine</topic><topic>Mitochondrial DNA</topic><topic>Mutation Rate</topic><topic>Polymorphism, Single Nucleotide - genetics</topic><topic>Public health</topic><topic>Science</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Single nucleotide polymorphisms</topic><topic>Single-nucleotide polymorphism</topic><topic>Studies</topic><topic>White People - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shen, Hui</creatorcontrib><creatorcontrib>Li, Jian</creatorcontrib><creatorcontrib>Zhang, Jigang</creatorcontrib><creatorcontrib>Xu, Chao</creatorcontrib><creatorcontrib>Jiang, Yan</creatorcontrib><creatorcontrib>Wu, Zikai</creatorcontrib><creatorcontrib>Zhao, Fuping</creatorcontrib><creatorcontrib>Liao, Li</creatorcontrib><creatorcontrib>Chen, Jun</creatorcontrib><creatorcontrib>Lin, Yong</creatorcontrib><creatorcontrib>Tian, Qing</creatorcontrib><creatorcontrib>Papasian, Christopher J</creatorcontrib><creatorcontrib>Deng, Hong-Wen</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing &amp; Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest - Health &amp; Medical Complete保健、医学与药学数据库</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials Science Collection</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shen, Hui</au><au>Li, Jian</au><au>Zhang, Jigang</au><au>Xu, Chao</au><au>Jiang, Yan</au><au>Wu, Zikai</au><au>Zhao, Fuping</au><au>Liao, Li</au><au>Chen, Jun</au><au>Lin, Yong</au><au>Tian, Qing</au><au>Papasian, Christopher J</au><au>Deng, Hong-Wen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2013-04-05</date><risdate>2013</risdate><volume>8</volume><issue>4</issue><spage>e59494</spage><pages>e59494-</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8×). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (∼96%) have a minor allele frequency less than 5%. On average, each individual genome carried ∼3.3 million SNPs and ∼492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely "knock out" the corresponding genes. Across all the 44 genomes, a total of 182 genes were "knocked-out" in at least one individual genome, among which 46 genes were "knocked out" in over 30% of our samples, suggesting that a number of genes are commonly "knocked-out" in general populations. Gene ontology analysis suggested that these commonly "knocked-out" genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>23577066</pmid><doi>10.1371/journal.pone.0059494</doi><tpages>e59494</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2013-04, Vol.8 (4), p.e59494
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_1330900781
source MEDLINE; Public Library of Science; PubMed Central; Directory of Open Access Journals; Free Full-Text Journals in Chemistry; EZB Electronic Journals Library
subjects Adult
Adults
Analysis
Antigen processing
Bioinformatics
Biological activity
Biological effects
Biology
Chromosomes, Human, Y - genetics
Computer Science
Consortia
Deoxyribonucleic acid
Disease - ethnology
Disease - genetics
DNA
DNA Copy Number Variations - genetics
DNA sequencing
DNA, Mitochondrial - genetics
Female
Gene frequency
Gene sequencing
Genes
Genetic aspects
Genetic diversity
Genetic research
Genetic testing
Genome, Human - genetics
Genomes
Genomics
Health care
Humans
Immune response
Immune system
INDEL Mutation - genetics
Male
Medicine
Mitochondrial DNA
Mutation Rate
Polymorphism, Single Nucleotide - genetics
Public health
Science
Sequence Analysis, DNA - methods
Single nucleotide polymorphisms
Single-nucleotide polymorphism
Studies
White People - genetics
title Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T15%3A15%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comprehensive%20characterization%20of%20human%20genome%20variation%20by%20high%20coverage%20whole-genome%20sequencing%20of%20forty%20four%20Caucasians&rft.jtitle=PloS%20one&rft.au=Shen,%20Hui&rft.date=2013-04-05&rft.volume=8&rft.issue=4&rft.spage=e59494&rft.pages=e59494-&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0059494&rft_dat=%3Cgale_plos_%3EA478419776%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1330900781&rft_id=info:pmid/23577066&rft_galeid=A478419776&rft_doaj_id=oai_doaj_org_article_365afc58c8654b5dba660759284a3a85&rfr_iscdi=true