DNA sequence features underlying large-scale duplications and deletions in human

Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the 1000 Genomes Project to characterise genomic regions vulnerable to CNV formation and to identify sequence features characte...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of applied genetics 2022-09, Vol.63 (3), p.527-533
Hauptverfasser: Kołomański, Mateusz, Szyda, Joanna, Frąszczak, Magdalena, Mielczarek, Magda
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 533
container_issue 3
container_start_page 527
container_title Journal of applied genetics
container_volume 63
creator Kołomański, Mateusz
Szyda, Joanna
Frąszczak, Magdalena
Mielczarek, Magda
description Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the 1000 Genomes Project to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.
doi_str_mv 10.1007/s13353-022-00704-0
format Article
fullrecord <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9365719</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A713610586</galeid><sourcerecordid>A713610586</sourcerecordid><originalsourceid>FETCH-LOGICAL-c422t-68c115261c98de6740810f0306296b7fbcbba2645820e1877eb7845e7b2997c3</originalsourceid><addsrcrecordid>eNp9UU1v1TAQtBCIPlr-AAcUibPL2o4_ckF6KuVDqqCH3i3H2aSuEudhJ0j99zWklFZCaA-r9c6MdjyEvGFwygD0-8yEkIIC57SMUFN4RnacNUCFMeI52TEuasoaI47Iq5xvAISpNX9JjoSUDYCRO3L58du-yvhjxeix6tEta8JcrbHDNN6GOFSjSwPS7N2IVbcexuDdEuaYKxe7qsMRtynE6nqdXDwhL3o3Znx934_J1afzq7Mv9OL7569n-wvqa84XqoxnTHLFfGM6VLoGw6AHAYo3qtV969vWcVVLwwGZ0RpbbWqJuuVNo704Jh822cPaTth5jEtyoz2kMLl0a2cX7NNNDNd2mH_aRiipWVME3t0LpLm4z4u9mdcUy8mW6_Kdkun6EWoo9m2I_VzE_BSyt3vNhGIgjSqo03-gSnU4BT9H7EN5f0LgG8GnOeeE_cPhDOyvbO2WrS3Z2t_ZWiikt48tP1D-hFkAYgPksooDpr-W_iN7B3S9rbw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2700751749</pqid></control><display><type>article</type><title>DNA sequence features underlying large-scale duplications and deletions in human</title><source>SpringerNature Journals</source><creator>Kołomański, Mateusz ; Szyda, Joanna ; Frąszczak, Magdalena ; Mielczarek, Magda</creator><creatorcontrib>Kołomański, Mateusz ; Szyda, Joanna ; Frąszczak, Magdalena ; Mielczarek, Magda</creatorcontrib><description>Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the 1000 Genomes Project to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.</description><identifier>ISSN: 1234-1983</identifier><identifier>EISSN: 2190-3883</identifier><identifier>DOI: 10.1007/s13353-022-00704-0</identifier><identifier>PMID: 35590085</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Animal Genetics and Genomics ; Biological activity ; Biomedical and Life Sciences ; Cell adhesion ; Complexity ; Copy number ; Cytoskeleton ; Deoxyribonucleic acid ; DNA ; DNA sequencing ; Genomes ; Genomics ; Human Genetics ; Human Genetics • Original Paper ; Immune response ; Introns ; Life Sciences ; Microbial Genetics and Genomics ; Nucleotide sequence ; Nucleotide sequencing ; Phenotypes ; Plant Genetics and Genomics ; Reproduction (copying) ; Synaptic transmission</subject><ispartof>Journal of applied genetics, 2022-09, Vol.63 (3), p.527-533</ispartof><rights>The Author(s) 2022</rights><rights>2022. The Author(s).</rights><rights>COPYRIGHT 2022 Springer</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c422t-68c115261c98de6740810f0306296b7fbcbba2645820e1877eb7845e7b2997c3</cites><orcidid>0000-0001-9688-0193 ; 0000-0002-8012-4980 ; 0000-0002-1086-9119 ; 0000-0001-7424-3919</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s13353-022-00704-0$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s13353-022-00704-0$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>230,315,781,785,886,27928,27929,41492,42561,51323</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35590085$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kołomański, Mateusz</creatorcontrib><creatorcontrib>Szyda, Joanna</creatorcontrib><creatorcontrib>Frąszczak, Magdalena</creatorcontrib><creatorcontrib>Mielczarek, Magda</creatorcontrib><title>DNA sequence features underlying large-scale duplications and deletions in human</title><title>Journal of applied genetics</title><addtitle>J Appl Genetics</addtitle><addtitle>J Appl Genet</addtitle><description>Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the 1000 Genomes Project to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.</description><subject>Animal Genetics and Genomics</subject><subject>Biological activity</subject><subject>Biomedical and Life Sciences</subject><subject>Cell adhesion</subject><subject>Complexity</subject><subject>Copy number</subject><subject>Cytoskeleton</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA sequencing</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Human Genetics</subject><subject>Human Genetics • Original Paper</subject><subject>Immune response</subject><subject>Introns</subject><subject>Life Sciences</subject><subject>Microbial Genetics and Genomics</subject><subject>Nucleotide sequence</subject><subject>Nucleotide sequencing</subject><subject>Phenotypes</subject><subject>Plant Genetics and Genomics</subject><subject>Reproduction (copying)</subject><subject>Synaptic transmission</subject><issn>1234-1983</issn><issn>2190-3883</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9UU1v1TAQtBCIPlr-AAcUibPL2o4_ckF6KuVDqqCH3i3H2aSuEudhJ0j99zWklFZCaA-r9c6MdjyEvGFwygD0-8yEkIIC57SMUFN4RnacNUCFMeI52TEuasoaI47Iq5xvAISpNX9JjoSUDYCRO3L58du-yvhjxeix6tEta8JcrbHDNN6GOFSjSwPS7N2IVbcexuDdEuaYKxe7qsMRtynE6nqdXDwhL3o3Znx934_J1afzq7Mv9OL7569n-wvqa84XqoxnTHLFfGM6VLoGw6AHAYo3qtV969vWcVVLwwGZ0RpbbWqJuuVNo704Jh822cPaTth5jEtyoz2kMLl0a2cX7NNNDNd2mH_aRiipWVME3t0LpLm4z4u9mdcUy8mW6_Kdkun6EWoo9m2I_VzE_BSyt3vNhGIgjSqo03-gSnU4BT9H7EN5f0LgG8GnOeeE_cPhDOyvbO2WrS3Z2t_ZWiikt48tP1D-hFkAYgPksooDpr-W_iN7B3S9rbw</recordid><startdate>20220901</startdate><enddate>20220901</enddate><creator>Kołomański, Mateusz</creator><creator>Szyda, Joanna</creator><creator>Frąszczak, Magdalena</creator><creator>Mielczarek, Magda</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-9688-0193</orcidid><orcidid>https://orcid.org/0000-0002-8012-4980</orcidid><orcidid>https://orcid.org/0000-0002-1086-9119</orcidid><orcidid>https://orcid.org/0000-0001-7424-3919</orcidid></search><sort><creationdate>20220901</creationdate><title>DNA sequence features underlying large-scale duplications and deletions in human</title><author>Kołomański, Mateusz ; Szyda, Joanna ; Frąszczak, Magdalena ; Mielczarek, Magda</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c422t-68c115261c98de6740810f0306296b7fbcbba2645820e1877eb7845e7b2997c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Animal Genetics and Genomics</topic><topic>Biological activity</topic><topic>Biomedical and Life Sciences</topic><topic>Cell adhesion</topic><topic>Complexity</topic><topic>Copy number</topic><topic>Cytoskeleton</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA sequencing</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Human Genetics</topic><topic>Human Genetics • Original Paper</topic><topic>Immune response</topic><topic>Introns</topic><topic>Life Sciences</topic><topic>Microbial Genetics and Genomics</topic><topic>Nucleotide sequence</topic><topic>Nucleotide sequencing</topic><topic>Phenotypes</topic><topic>Plant Genetics and Genomics</topic><topic>Reproduction (copying)</topic><topic>Synaptic transmission</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kołomański, Mateusz</creatorcontrib><creatorcontrib>Szyda, Joanna</creatorcontrib><creatorcontrib>Frąszczak, Magdalena</creatorcontrib><creatorcontrib>Mielczarek, Magda</creatorcontrib><collection>Springer Nature OA/Free Journals</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of applied genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kołomański, Mateusz</au><au>Szyda, Joanna</au><au>Frąszczak, Magdalena</au><au>Mielczarek, Magda</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DNA sequence features underlying large-scale duplications and deletions in human</atitle><jtitle>Journal of applied genetics</jtitle><stitle>J Appl Genetics</stitle><addtitle>J Appl Genet</addtitle><date>2022-09-01</date><risdate>2022</risdate><volume>63</volume><issue>3</issue><spage>527</spage><epage>533</epage><pages>527-533</pages><issn>1234-1983</issn><eissn>2190-3883</eissn><abstract>Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the 1000 Genomes Project to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><pmid>35590085</pmid><doi>10.1007/s13353-022-00704-0</doi><tpages>7</tpages><orcidid>https://orcid.org/0000-0001-9688-0193</orcidid><orcidid>https://orcid.org/0000-0002-8012-4980</orcidid><orcidid>https://orcid.org/0000-0002-1086-9119</orcidid><orcidid>https://orcid.org/0000-0001-7424-3919</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1234-1983
ispartof Journal of applied genetics, 2022-09, Vol.63 (3), p.527-533
issn 1234-1983
2190-3883
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9365719
source SpringerNature Journals
subjects Animal Genetics and Genomics
Biological activity
Biomedical and Life Sciences
Cell adhesion
Complexity
Copy number
Cytoskeleton
Deoxyribonucleic acid
DNA
DNA sequencing
Genomes
Genomics
Human Genetics
Human Genetics • Original Paper
Immune response
Introns
Life Sciences
Microbial Genetics and Genomics
Nucleotide sequence
Nucleotide sequencing
Phenotypes
Plant Genetics and Genomics
Reproduction (copying)
Synaptic transmission
title DNA sequence features underlying large-scale duplications and deletions in human
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T00%3A00%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DNA%20sequence%20features%20underlying%20large-scale%20duplications%20and%20deletions%20in%20human&rft.jtitle=Journal%20of%20applied%20genetics&rft.au=Ko%C5%82oma%C5%84ski,%20Mateusz&rft.date=2022-09-01&rft.volume=63&rft.issue=3&rft.spage=527&rft.epage=533&rft.pages=527-533&rft.issn=1234-1983&rft.eissn=2190-3883&rft_id=info:doi/10.1007/s13353-022-00704-0&rft_dat=%3Cgale_pubme%3EA713610586%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2700751749&rft_id=info:pmid/35590085&rft_galeid=A713610586&rfr_iscdi=true