DNA sequence features underlying large-scale duplications and deletions in human
Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the 1000 Genomes Project to characterise genomic regions vulnerable to CNV formation and to identify sequence features characte...
Gespeichert in:
Veröffentlicht in: | Journal of applied genetics 2022-09, Vol.63 (3), p.527-533 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 533 |
---|---|
container_issue | 3 |
container_start_page | 527 |
container_title | Journal of applied genetics |
container_volume | 63 |
creator | Kołomański, Mateusz Szyda, Joanna Frąszczak, Magdalena Mielczarek, Magda |
description | Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the
1000 Genomes Project
to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles. |
doi_str_mv | 10.1007/s13353-022-00704-0 |
format | Article |
fullrecord | <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9365719</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A713610586</galeid><sourcerecordid>A713610586</sourcerecordid><originalsourceid>FETCH-LOGICAL-c422t-68c115261c98de6740810f0306296b7fbcbba2645820e1877eb7845e7b2997c3</originalsourceid><addsrcrecordid>eNp9UU1v1TAQtBCIPlr-AAcUibPL2o4_ckF6KuVDqqCH3i3H2aSuEudhJ0j99zWklFZCaA-r9c6MdjyEvGFwygD0-8yEkIIC57SMUFN4RnacNUCFMeI52TEuasoaI47Iq5xvAISpNX9JjoSUDYCRO3L58du-yvhjxeix6tEta8JcrbHDNN6GOFSjSwPS7N2IVbcexuDdEuaYKxe7qsMRtynE6nqdXDwhL3o3Znx934_J1afzq7Mv9OL7569n-wvqa84XqoxnTHLFfGM6VLoGw6AHAYo3qtV969vWcVVLwwGZ0RpbbWqJuuVNo704Jh822cPaTth5jEtyoz2kMLl0a2cX7NNNDNd2mH_aRiipWVME3t0LpLm4z4u9mdcUy8mW6_Kdkun6EWoo9m2I_VzE_BSyt3vNhGIgjSqo03-gSnU4BT9H7EN5f0LgG8GnOeeE_cPhDOyvbO2WrS3Z2t_ZWiikt48tP1D-hFkAYgPksooDpr-W_iN7B3S9rbw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2700751749</pqid></control><display><type>article</type><title>DNA sequence features underlying large-scale duplications and deletions in human</title><source>SpringerNature Journals</source><creator>Kołomański, Mateusz ; Szyda, Joanna ; Frąszczak, Magdalena ; Mielczarek, Magda</creator><creatorcontrib>Kołomański, Mateusz ; Szyda, Joanna ; Frąszczak, Magdalena ; Mielczarek, Magda</creatorcontrib><description>Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the
1000 Genomes Project
to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.</description><identifier>ISSN: 1234-1983</identifier><identifier>EISSN: 2190-3883</identifier><identifier>DOI: 10.1007/s13353-022-00704-0</identifier><identifier>PMID: 35590085</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Animal Genetics and Genomics ; Biological activity ; Biomedical and Life Sciences ; Cell adhesion ; Complexity ; Copy number ; Cytoskeleton ; Deoxyribonucleic acid ; DNA ; DNA sequencing ; Genomes ; Genomics ; Human Genetics ; Human Genetics • Original Paper ; Immune response ; Introns ; Life Sciences ; Microbial Genetics and Genomics ; Nucleotide sequence ; Nucleotide sequencing ; Phenotypes ; Plant Genetics and Genomics ; Reproduction (copying) ; Synaptic transmission</subject><ispartof>Journal of applied genetics, 2022-09, Vol.63 (3), p.527-533</ispartof><rights>The Author(s) 2022</rights><rights>2022. The Author(s).</rights><rights>COPYRIGHT 2022 Springer</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c422t-68c115261c98de6740810f0306296b7fbcbba2645820e1877eb7845e7b2997c3</cites><orcidid>0000-0001-9688-0193 ; 0000-0002-8012-4980 ; 0000-0002-1086-9119 ; 0000-0001-7424-3919</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s13353-022-00704-0$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s13353-022-00704-0$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>230,315,781,785,886,27928,27929,41492,42561,51323</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35590085$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kołomański, Mateusz</creatorcontrib><creatorcontrib>Szyda, Joanna</creatorcontrib><creatorcontrib>Frąszczak, Magdalena</creatorcontrib><creatorcontrib>Mielczarek, Magda</creatorcontrib><title>DNA sequence features underlying large-scale duplications and deletions in human</title><title>Journal of applied genetics</title><addtitle>J Appl Genetics</addtitle><addtitle>J Appl Genet</addtitle><description>Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the
1000 Genomes Project
to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.</description><subject>Animal Genetics and Genomics</subject><subject>Biological activity</subject><subject>Biomedical and Life Sciences</subject><subject>Cell adhesion</subject><subject>Complexity</subject><subject>Copy number</subject><subject>Cytoskeleton</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA sequencing</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Human Genetics</subject><subject>Human Genetics • Original Paper</subject><subject>Immune response</subject><subject>Introns</subject><subject>Life Sciences</subject><subject>Microbial Genetics and Genomics</subject><subject>Nucleotide sequence</subject><subject>Nucleotide sequencing</subject><subject>Phenotypes</subject><subject>Plant Genetics and Genomics</subject><subject>Reproduction (copying)</subject><subject>Synaptic transmission</subject><issn>1234-1983</issn><issn>2190-3883</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9UU1v1TAQtBCIPlr-AAcUibPL2o4_ckF6KuVDqqCH3i3H2aSuEudhJ0j99zWklFZCaA-r9c6MdjyEvGFwygD0-8yEkIIC57SMUFN4RnacNUCFMeI52TEuasoaI47Iq5xvAISpNX9JjoSUDYCRO3L58du-yvhjxeix6tEta8JcrbHDNN6GOFSjSwPS7N2IVbcexuDdEuaYKxe7qsMRtynE6nqdXDwhL3o3Znx934_J1afzq7Mv9OL7569n-wvqa84XqoxnTHLFfGM6VLoGw6AHAYo3qtV969vWcVVLwwGZ0RpbbWqJuuVNo704Jh822cPaTth5jEtyoz2kMLl0a2cX7NNNDNd2mH_aRiipWVME3t0LpLm4z4u9mdcUy8mW6_Kdkun6EWoo9m2I_VzE_BSyt3vNhGIgjSqo03-gSnU4BT9H7EN5f0LgG8GnOeeE_cPhDOyvbO2WrS3Z2t_ZWiikt48tP1D-hFkAYgPksooDpr-W_iN7B3S9rbw</recordid><startdate>20220901</startdate><enddate>20220901</enddate><creator>Kołomański, Mateusz</creator><creator>Szyda, Joanna</creator><creator>Frąszczak, Magdalena</creator><creator>Mielczarek, Magda</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-9688-0193</orcidid><orcidid>https://orcid.org/0000-0002-8012-4980</orcidid><orcidid>https://orcid.org/0000-0002-1086-9119</orcidid><orcidid>https://orcid.org/0000-0001-7424-3919</orcidid></search><sort><creationdate>20220901</creationdate><title>DNA sequence features underlying large-scale duplications and deletions in human</title><author>Kołomański, Mateusz ; Szyda, Joanna ; Frąszczak, Magdalena ; Mielczarek, Magda</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c422t-68c115261c98de6740810f0306296b7fbcbba2645820e1877eb7845e7b2997c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Animal Genetics and Genomics</topic><topic>Biological activity</topic><topic>Biomedical and Life Sciences</topic><topic>Cell adhesion</topic><topic>Complexity</topic><topic>Copy number</topic><topic>Cytoskeleton</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA sequencing</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Human Genetics</topic><topic>Human Genetics • Original Paper</topic><topic>Immune response</topic><topic>Introns</topic><topic>Life Sciences</topic><topic>Microbial Genetics and Genomics</topic><topic>Nucleotide sequence</topic><topic>Nucleotide sequencing</topic><topic>Phenotypes</topic><topic>Plant Genetics and Genomics</topic><topic>Reproduction (copying)</topic><topic>Synaptic transmission</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kołomański, Mateusz</creatorcontrib><creatorcontrib>Szyda, Joanna</creatorcontrib><creatorcontrib>Frąszczak, Magdalena</creatorcontrib><creatorcontrib>Mielczarek, Magda</creatorcontrib><collection>Springer Nature OA/Free Journals</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of applied genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kołomański, Mateusz</au><au>Szyda, Joanna</au><au>Frąszczak, Magdalena</au><au>Mielczarek, Magda</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DNA sequence features underlying large-scale duplications and deletions in human</atitle><jtitle>Journal of applied genetics</jtitle><stitle>J Appl Genetics</stitle><addtitle>J Appl Genet</addtitle><date>2022-09-01</date><risdate>2022</risdate><volume>63</volume><issue>3</issue><spage>527</spage><epage>533</epage><pages>527-533</pages><issn>1234-1983</issn><eissn>2190-3883</eissn><abstract>Copy number variants (CNVs) may cover up to 12% of the whole genome and have substantial impact on phenotypes. We used 5867 duplications and 33,181 deletions available from the
1000 Genomes Project
to characterise genomic regions vulnerable to CNV formation and to identify sequence features characteristic for those regions. The GC content for deletions was lower and for duplications was higher than for randomly selected regions. In regions flanking deletions and downstream of duplications, content was higher than in the random sequences, but upstream of duplication content was lower. In duplications and downstream of deletion regions, the percentage of low-complexity sequences was not different from the randomised data. In deletions and upstream of CNVs, it was higher, while for downstream of duplications, it was lower as compared to random sequences. The majority of CNVs intersected with genic regions — mainly with introns. GC content may be associated with CNV formation and CNVs, especially duplications are initiated in low-complexity regions. Moreover, CNVs located or overlapped with introns indicate their role in shaping intron variability. Genic CNV regions were enriched in many essential biological processes such as cell adhesion, synaptic transmission, transport, cytoskeleton organization, immune response and metabolic mechanisms, which indicates that these large-scaled variants play important biological roles.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><pmid>35590085</pmid><doi>10.1007/s13353-022-00704-0</doi><tpages>7</tpages><orcidid>https://orcid.org/0000-0001-9688-0193</orcidid><orcidid>https://orcid.org/0000-0002-8012-4980</orcidid><orcidid>https://orcid.org/0000-0002-1086-9119</orcidid><orcidid>https://orcid.org/0000-0001-7424-3919</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1234-1983 |
ispartof | Journal of applied genetics, 2022-09, Vol.63 (3), p.527-533 |
issn | 1234-1983 2190-3883 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9365719 |
source | SpringerNature Journals |
subjects | Animal Genetics and Genomics Biological activity Biomedical and Life Sciences Cell adhesion Complexity Copy number Cytoskeleton Deoxyribonucleic acid DNA DNA sequencing Genomes Genomics Human Genetics Human Genetics • Original Paper Immune response Introns Life Sciences Microbial Genetics and Genomics Nucleotide sequence Nucleotide sequencing Phenotypes Plant Genetics and Genomics Reproduction (copying) Synaptic transmission |
title | DNA sequence features underlying large-scale duplications and deletions in human |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T00%3A00%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DNA%20sequence%20features%20underlying%20large-scale%20duplications%20and%20deletions%20in%20human&rft.jtitle=Journal%20of%20applied%20genetics&rft.au=Ko%C5%82oma%C5%84ski,%20Mateusz&rft.date=2022-09-01&rft.volume=63&rft.issue=3&rft.spage=527&rft.epage=533&rft.pages=527-533&rft.issn=1234-1983&rft.eissn=2190-3883&rft_id=info:doi/10.1007/s13353-022-00704-0&rft_dat=%3Cgale_pubme%3EA713610586%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2700751749&rft_id=info:pmid/35590085&rft_galeid=A713610586&rfr_iscdi=true |