Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome

Although the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC genomics 2014-05, Vol.15 (1), p.387-387, Article 387
Hauptverfasser: O'Bleness, Majesta, Searles, Veronica B, Dickens, C Michael, Astling, David, Albracht, Derek, Mak, Angel C Y, Lai, Yvonne Y Y, Lin, Chin, Chu, Catherine, Graves, Tina, Kwok, Pui-Yan, Wilson, Richard K, Sikela, James M
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 387
container_issue 1
container_start_page 387
container_title BMC genomics
container_volume 15
creator O'Bleness, Majesta
Searles, Veronica B
Dickens, C Michael
Astling, David
Albracht, Derek
Mak, Angel C Y
Lai, Yvonne Y Y
Lin, Chin
Chu, Catherine
Graves, Tina
Kwok, Pui-Yan
Wilson, Richard K
Sikela, James M
description Although the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly. This region also contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Additionally, examinations of this protein domain have been challenging due to the incomplete 1q21 build. To address these problems, a single-haplotype hydatidiform mole BAC library (CHORI-17) was used to produce the first complete sequence of the 1q21.1-q21.2 region. We found and addressed several inaccuracies in the GRCh37sequence of the 1q21 region on large and small scales, including genomic rearrangements and inversions, and incorrect gene copy number estimates and assemblies. The DUF1220-encoding NBPF genes required the most corrections, with 3 genes removed, 2 genes reassigned to the 1p11.2 region, 8 genes requiring assembly corrections for DUF1220 domains (~91 DUF1220 domains were misassigned), and multiple instances of nucleotide changes that reassigned the domain to a different DUF1220 subtype. These corrections resulted in an overall increase in DUF1220 copy number, yielding a haploid total of 289 copies. Approximately 20 of these new DUF1220 copies were the result of a segmental duplication from 1q21.2 to 1p11.2 that included two NBPF genes. Interestingly, this duplication may have been the catalyst for the evolutionarily important human lineage-specific chromosome 1 pericentric inversion. Through the hydatidiform mole genome sequencing effort, the 1q21.1-q21.2 region is complete and misassemblies involving inter- and intra-region duplications have been resolved. The availability of this single haploid sequence path will aid in the investigation of many genetic diseases linked to 1q21, including several associated with DUF1220 copy number variations. Finally, the corrected sequence identified a recent segmental duplication that added 20 additional DUF1220 copies to the human genome, and may have facilitated the chromosome 1 pericentric inversion that is among the most notable human-specific genomic landmarks.
doi_str_mv 10.1186/1471-2164-15-387
format Article
fullrecord <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4053653</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A539565852</galeid><sourcerecordid>A539565852</sourcerecordid><originalsourceid>FETCH-LOGICAL-b684t-a9737c8ac950cbbdec58edc1782d6b9999660ea352a21436053507667d6040493</originalsourceid><addsrcrecordid>eNqNkk1v1DAQhiMEoqVw54QscSmHFH8nuSBVCwuVKiEBPXGwHGeSuErsrZ0g-u9xtGVpUJGwD7Zmnnk1emey7CXBZ4SU8i3hBckpkTwnImdl8Sg7PoQe3_sfZc9ivMaYFCUVT7MjystSYCqOs-9b62zsoUERbmZwBpB2DdIxwlgPt8i3aOoBvb_aEkpxHqzpEbmhBAXorHdojtZ1SKNe7wZvG9TPo3aoA-dHeJ49afUQ4cXde5JdbT9823zKLz9_vNicX-a1LPmU66pghSm1qQQ2dd2AESU0Zum1kXWVjpQYNBNUU8KZxIIJXEhZNBJzzCt2kr3b6-7mekyV4KagB7ULdtThVnlt1TrjbK86_0PxJCUFSwKbvUBt_T8E1hnjR7WYqxZzFREqeZ9UTu_aCD5ZGSc12mhgGLQDP8eEcUypSEP4D5RxXElGl95e_4Ve-zm45OdCSc4KLsQfqtMDKOtan_o0i6g6F6wSUpSCJursASrdBkZrvIPWpviq4M2qIDET_Jw6PceoLr5-WbN4z5rgYwzQHvwjWC3b-pBjr-4P7lDwez3ZL4im3-8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1536437455</pqid></control><display><type>article</type><title>Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>SpringerLink Journals (MCLS)</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>O'Bleness, Majesta ; Searles, Veronica B ; Dickens, C Michael ; Astling, David ; Albracht, Derek ; Mak, Angel C Y ; Lai, Yvonne Y Y ; Lin, Chin ; Chu, Catherine ; Graves, Tina ; Kwok, Pui-Yan ; Wilson, Richard K ; Sikela, James M</creator><creatorcontrib>O'Bleness, Majesta ; Searles, Veronica B ; Dickens, C Michael ; Astling, David ; Albracht, Derek ; Mak, Angel C Y ; Lai, Yvonne Y Y ; Lin, Chin ; Chu, Catherine ; Graves, Tina ; Kwok, Pui-Yan ; Wilson, Richard K ; Sikela, James M</creatorcontrib><description>Although the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly. This region also contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Additionally, examinations of this protein domain have been challenging due to the incomplete 1q21 build. To address these problems, a single-haplotype hydatidiform mole BAC library (CHORI-17) was used to produce the first complete sequence of the 1q21.1-q21.2 region. We found and addressed several inaccuracies in the GRCh37sequence of the 1q21 region on large and small scales, including genomic rearrangements and inversions, and incorrect gene copy number estimates and assemblies. The DUF1220-encoding NBPF genes required the most corrections, with 3 genes removed, 2 genes reassigned to the 1p11.2 region, 8 genes requiring assembly corrections for DUF1220 domains (~91 DUF1220 domains were misassigned), and multiple instances of nucleotide changes that reassigned the domain to a different DUF1220 subtype. These corrections resulted in an overall increase in DUF1220 copy number, yielding a haploid total of 289 copies. Approximately 20 of these new DUF1220 copies were the result of a segmental duplication from 1q21.2 to 1p11.2 that included two NBPF genes. Interestingly, this duplication may have been the catalyst for the evolutionarily important human lineage-specific chromosome 1 pericentric inversion. Through the hydatidiform mole genome sequencing effort, the 1q21.1-q21.2 region is complete and misassemblies involving inter- and intra-region duplications have been resolved. The availability of this single haploid sequence path will aid in the investigation of many genetic diseases linked to 1q21, including several associated with DUF1220 copy number variations. Finally, the corrected sequence identified a recent segmental duplication that added 20 additional DUF1220 copies to the human genome, and may have facilitated the chromosome 1 pericentric inversion that is among the most notable human-specific genomic landmarks.</description><identifier>ISSN: 1471-2164</identifier><identifier>EISSN: 1471-2164</identifier><identifier>DOI: 10.1186/1471-2164-15-387</identifier><identifier>PMID: 24885025</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Academic libraries ; Biological Evolution ; Carrier Proteins - genetics ; Chromosomes, Human, Pair 1 ; Colleges &amp; universities ; Comparative Genomic Hybridization ; Data analysis ; Deoxyribonucleic acid ; Disease ; Disease susceptibility ; DNA ; DNA Copy Number Variations ; Genes ; Genetic Linkage ; Genetic testing ; Genetics ; Genome, Human ; Genomes ; Genomics ; Haploidy ; Humans ; Methods ; Protein Structure, Tertiary - genetics ; Segmental Duplications, Genomic</subject><ispartof>BMC genomics, 2014-05, Vol.15 (1), p.387-387, Article 387</ispartof><rights>COPYRIGHT 2014 BioMed Central Ltd.</rights><rights>2014 O'Bleness et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.</rights><rights>O’Bleness et al.; licensee BioMed Central Ltd. 2014</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b684t-a9737c8ac950cbbdec58edc1782d6b9999660ea352a21436053507667d6040493</citedby><cites>FETCH-LOGICAL-b684t-a9737c8ac950cbbdec58edc1782d6b9999660ea352a21436053507667d6040493</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4053653/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4053653/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24885025$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>O'Bleness, Majesta</creatorcontrib><creatorcontrib>Searles, Veronica B</creatorcontrib><creatorcontrib>Dickens, C Michael</creatorcontrib><creatorcontrib>Astling, David</creatorcontrib><creatorcontrib>Albracht, Derek</creatorcontrib><creatorcontrib>Mak, Angel C Y</creatorcontrib><creatorcontrib>Lai, Yvonne Y Y</creatorcontrib><creatorcontrib>Lin, Chin</creatorcontrib><creatorcontrib>Chu, Catherine</creatorcontrib><creatorcontrib>Graves, Tina</creatorcontrib><creatorcontrib>Kwok, Pui-Yan</creatorcontrib><creatorcontrib>Wilson, Richard K</creatorcontrib><creatorcontrib>Sikela, James M</creatorcontrib><title>Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome</title><title>BMC genomics</title><addtitle>BMC Genomics</addtitle><description>Although the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly. This region also contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Additionally, examinations of this protein domain have been challenging due to the incomplete 1q21 build. To address these problems, a single-haplotype hydatidiform mole BAC library (CHORI-17) was used to produce the first complete sequence of the 1q21.1-q21.2 region. We found and addressed several inaccuracies in the GRCh37sequence of the 1q21 region on large and small scales, including genomic rearrangements and inversions, and incorrect gene copy number estimates and assemblies. The DUF1220-encoding NBPF genes required the most corrections, with 3 genes removed, 2 genes reassigned to the 1p11.2 region, 8 genes requiring assembly corrections for DUF1220 domains (~91 DUF1220 domains were misassigned), and multiple instances of nucleotide changes that reassigned the domain to a different DUF1220 subtype. These corrections resulted in an overall increase in DUF1220 copy number, yielding a haploid total of 289 copies. Approximately 20 of these new DUF1220 copies were the result of a segmental duplication from 1q21.2 to 1p11.2 that included two NBPF genes. Interestingly, this duplication may have been the catalyst for the evolutionarily important human lineage-specific chromosome 1 pericentric inversion. Through the hydatidiform mole genome sequencing effort, the 1q21.1-q21.2 region is complete and misassemblies involving inter- and intra-region duplications have been resolved. The availability of this single haploid sequence path will aid in the investigation of many genetic diseases linked to 1q21, including several associated with DUF1220 copy number variations. Finally, the corrected sequence identified a recent segmental duplication that added 20 additional DUF1220 copies to the human genome, and may have facilitated the chromosome 1 pericentric inversion that is among the most notable human-specific genomic landmarks.</description><subject>Academic libraries</subject><subject>Biological Evolution</subject><subject>Carrier Proteins - genetics</subject><subject>Chromosomes, Human, Pair 1</subject><subject>Colleges &amp; universities</subject><subject>Comparative Genomic Hybridization</subject><subject>Data analysis</subject><subject>Deoxyribonucleic acid</subject><subject>Disease</subject><subject>Disease susceptibility</subject><subject>DNA</subject><subject>DNA Copy Number Variations</subject><subject>Genes</subject><subject>Genetic Linkage</subject><subject>Genetic testing</subject><subject>Genetics</subject><subject>Genome, Human</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Haploidy</subject><subject>Humans</subject><subject>Methods</subject><subject>Protein Structure, Tertiary - genetics</subject><subject>Segmental Duplications, Genomic</subject><issn>1471-2164</issn><issn>1471-2164</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNqNkk1v1DAQhiMEoqVw54QscSmHFH8nuSBVCwuVKiEBPXGwHGeSuErsrZ0g-u9xtGVpUJGwD7Zmnnk1emey7CXBZ4SU8i3hBckpkTwnImdl8Sg7PoQe3_sfZc9ivMaYFCUVT7MjystSYCqOs-9b62zsoUERbmZwBpB2DdIxwlgPt8i3aOoBvb_aEkpxHqzpEbmhBAXorHdojtZ1SKNe7wZvG9TPo3aoA-dHeJ49afUQ4cXde5JdbT9823zKLz9_vNicX-a1LPmU66pghSm1qQQ2dd2AESU0Zum1kXWVjpQYNBNUU8KZxIIJXEhZNBJzzCt2kr3b6-7mekyV4KagB7ULdtThVnlt1TrjbK86_0PxJCUFSwKbvUBt_T8E1hnjR7WYqxZzFREqeZ9UTu_aCD5ZGSc12mhgGLQDP8eEcUypSEP4D5RxXElGl95e_4Ve-zm45OdCSc4KLsQfqtMDKOtan_o0i6g6F6wSUpSCJursASrdBkZrvIPWpviq4M2qIDET_Jw6PceoLr5-WbN4z5rgYwzQHvwjWC3b-pBjr-4P7lDwez3ZL4im3-8</recordid><startdate>20140520</startdate><enddate>20140520</enddate><creator>O'Bleness, Majesta</creator><creator>Searles, Veronica B</creator><creator>Dickens, C Michael</creator><creator>Astling, David</creator><creator>Albracht, Derek</creator><creator>Mak, Angel C Y</creator><creator>Lai, Yvonne Y Y</creator><creator>Lin, Chin</creator><creator>Chu, Catherine</creator><creator>Graves, Tina</creator><creator>Kwok, Pui-Yan</creator><creator>Wilson, Richard K</creator><creator>Sikela, James M</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7QP</scope><scope>7QR</scope><scope>7SS</scope><scope>7TK</scope><scope>7U7</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20140520</creationdate><title>Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome</title><author>O'Bleness, Majesta ; Searles, Veronica B ; Dickens, C Michael ; Astling, David ; Albracht, Derek ; Mak, Angel C Y ; Lai, Yvonne Y Y ; Lin, Chin ; Chu, Catherine ; Graves, Tina ; Kwok, Pui-Yan ; Wilson, Richard K ; Sikela, James M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b684t-a9737c8ac950cbbdec58edc1782d6b9999660ea352a21436053507667d6040493</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Academic libraries</topic><topic>Biological Evolution</topic><topic>Carrier Proteins - genetics</topic><topic>Chromosomes, Human, Pair 1</topic><topic>Colleges &amp; universities</topic><topic>Comparative Genomic Hybridization</topic><topic>Data analysis</topic><topic>Deoxyribonucleic acid</topic><topic>Disease</topic><topic>Disease susceptibility</topic><topic>DNA</topic><topic>DNA Copy Number Variations</topic><topic>Genes</topic><topic>Genetic Linkage</topic><topic>Genetic testing</topic><topic>Genetics</topic><topic>Genome, Human</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Haploidy</topic><topic>Humans</topic><topic>Methods</topic><topic>Protein Structure, Tertiary - genetics</topic><topic>Segmental Duplications, Genomic</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>O'Bleness, Majesta</creatorcontrib><creatorcontrib>Searles, Veronica B</creatorcontrib><creatorcontrib>Dickens, C Michael</creatorcontrib><creatorcontrib>Astling, David</creatorcontrib><creatorcontrib>Albracht, Derek</creatorcontrib><creatorcontrib>Mak, Angel C Y</creatorcontrib><creatorcontrib>Lai, Yvonne Y Y</creatorcontrib><creatorcontrib>Lin, Chin</creatorcontrib><creatorcontrib>Chu, Catherine</creatorcontrib><creatorcontrib>Graves, Tina</creatorcontrib><creatorcontrib>Kwok, Pui-Yan</creatorcontrib><creatorcontrib>Wilson, Richard K</creatorcontrib><creatorcontrib>Sikela, James M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC genomics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>O'Bleness, Majesta</au><au>Searles, Veronica B</au><au>Dickens, C Michael</au><au>Astling, David</au><au>Albracht, Derek</au><au>Mak, Angel C Y</au><au>Lai, Yvonne Y Y</au><au>Lin, Chin</au><au>Chu, Catherine</au><au>Graves, Tina</au><au>Kwok, Pui-Yan</au><au>Wilson, Richard K</au><au>Sikela, James M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome</atitle><jtitle>BMC genomics</jtitle><addtitle>BMC Genomics</addtitle><date>2014-05-20</date><risdate>2014</risdate><volume>15</volume><issue>1</issue><spage>387</spage><epage>387</epage><pages>387-387</pages><artnum>387</artnum><issn>1471-2164</issn><eissn>1471-2164</eissn><abstract>Although the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly. This region also contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Additionally, examinations of this protein domain have been challenging due to the incomplete 1q21 build. To address these problems, a single-haplotype hydatidiform mole BAC library (CHORI-17) was used to produce the first complete sequence of the 1q21.1-q21.2 region. We found and addressed several inaccuracies in the GRCh37sequence of the 1q21 region on large and small scales, including genomic rearrangements and inversions, and incorrect gene copy number estimates and assemblies. The DUF1220-encoding NBPF genes required the most corrections, with 3 genes removed, 2 genes reassigned to the 1p11.2 region, 8 genes requiring assembly corrections for DUF1220 domains (~91 DUF1220 domains were misassigned), and multiple instances of nucleotide changes that reassigned the domain to a different DUF1220 subtype. These corrections resulted in an overall increase in DUF1220 copy number, yielding a haploid total of 289 copies. Approximately 20 of these new DUF1220 copies were the result of a segmental duplication from 1q21.2 to 1p11.2 that included two NBPF genes. Interestingly, this duplication may have been the catalyst for the evolutionarily important human lineage-specific chromosome 1 pericentric inversion. Through the hydatidiform mole genome sequencing effort, the 1q21.1-q21.2 region is complete and misassemblies involving inter- and intra-region duplications have been resolved. The availability of this single haploid sequence path will aid in the investigation of many genetic diseases linked to 1q21, including several associated with DUF1220 copy number variations. Finally, the corrected sequence identified a recent segmental duplication that added 20 additional DUF1220 copies to the human genome, and may have facilitated the chromosome 1 pericentric inversion that is among the most notable human-specific genomic landmarks.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>24885025</pmid><doi>10.1186/1471-2164-15-387</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2164
ispartof BMC genomics, 2014-05, Vol.15 (1), p.387-387, Article 387
issn 1471-2164
1471-2164
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4053653
source MEDLINE; DOAJ Directory of Open Access Journals; PubMed Central Open Access; Springer Nature OA Free Journals; SpringerLink Journals (MCLS); EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Academic libraries
Biological Evolution
Carrier Proteins - genetics
Chromosomes, Human, Pair 1
Colleges & universities
Comparative Genomic Hybridization
Data analysis
Deoxyribonucleic acid
Disease
Disease susceptibility
DNA
DNA Copy Number Variations
Genes
Genetic Linkage
Genetic testing
Genetics
Genome, Human
Genomes
Genomics
Haploidy
Humans
Methods
Protein Structure, Tertiary - genetics
Segmental Duplications, Genomic
title Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-31T16%3A22%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Finished%20sequence%20and%20assembly%20of%20the%20DUF1220-rich%201q21%20region%20using%20a%20haploid%20human%20genome&rft.jtitle=BMC%20genomics&rft.au=O'Bleness,%20Majesta&rft.date=2014-05-20&rft.volume=15&rft.issue=1&rft.spage=387&rft.epage=387&rft.pages=387-387&rft.artnum=387&rft.issn=1471-2164&rft.eissn=1471-2164&rft_id=info:doi/10.1186/1471-2164-15-387&rft_dat=%3Cgale_pubme%3EA539565852%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1536437455&rft_id=info:pmid/24885025&rft_galeid=A539565852&rfr_iscdi=true