SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements

Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2010-04, Vol.26 (7), p.867-872
Hauptverfasser: Munshaw, Supriya, Kepler, Thomas B.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 872
container_issue 7
container_start_page 867
container_title Bioinformatics
container_volume 26
creator Munshaw, Supriya
Kepler, Thomas B.
description Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a given Ig gene, but have different consequences in an analysis. Our aim in this article is to develop a probabilistic model of the rearrangement process and a Bayesian method for estimating posterior probabilities for the comparison of multiple plausible rearrangements. Results: We have developed SoDA2, which is based on a Hidden Markov Model and used to compute the posterior probabilities of candidate rearrangements and to find those with the highest values among them. We validated the software on a set of simulated data, a set of clonally related sequences, and a group of randomly selected Ig heavy chains from Genbank. In most tests, SoDA2 performed better than other available software for the task. Furthermore, the output format has been redesigned, in part, to facilitate comparison of multiple solutions. Availability: SoDA2 is available online at https://hippocrates.duhs.duke.edu/soda. Simulated sequences are available upon request. Contact: kepler@duke.edu
doi_str_mv 10.1093/bioinformatics/btq056
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2844993</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>746084868</sourcerecordid><originalsourceid>FETCH-LOGICAL-c576t-6ac6c6380e0a371cd0f15783c98be0d6acedf4a99b6e59ed1fc84df4a005fb4b3</originalsourceid><addsrcrecordid>eNqFkU9v1DAQxS0EomXLRwD5gnoKtWPHsTkgVeVPKm1VobYIuFgTx94aknhrJxX99nW1y0JPnGz5_WbG8x5Cryh5S4liR60PfnQhDjB5k47a6YZU4gnap1yQoiSVeprvTNQFl4TtoRcp_SSkopzz52ivJJTXjLB99P0ifDgu32HAje86O-IziL_CLT4Lne0xrNcxgLnGeRD2WZ688yZPDCMODvthmMew6kM7937E0UKMMK7skMF0gJ456JN9uT0X6OrTx8uTpliefz49OV4WpqrFVAgwwggmiSXAamo64mhVS2aUbC3psmw7x0GpVthK2Y46I_nDS97GtbxlC_R-03c9t4PtTJ4dodfr6AeIdzqA14-V0V_rVbjVpeRcKZYbHG4bxHAz2zTpwSdj-x5GG-ak6-yo5FLI_5OMMUJFNnaBqg1pYkgpWrf7DyX6IT_9OD-9yS_Xvf53mV3Vn8Ay8GYLQDLQu-y38ekvV2ZTKaeZKzacT5P9vdNzuFrUrK508-2H_qJ483XZlFqxeza-uwc</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>733301630</pqid></control><display><type>article</type><title>SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements</title><source>Open Access: PubMed Central</source><source>MEDLINE</source><source>Alma/SFX Local Collection</source><source>EZB Electronic Journals Library</source><source>Oxford Academic Journals (Open Access)</source><creator>Munshaw, Supriya ; Kepler, Thomas B.</creator><creatorcontrib>Munshaw, Supriya ; Kepler, Thomas B.</creatorcontrib><description>Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a given Ig gene, but have different consequences in an analysis. Our aim in this article is to develop a probabilistic model of the rearrangement process and a Bayesian method for estimating posterior probabilities for the comparison of multiple plausible rearrangements. Results: We have developed SoDA2, which is based on a Hidden Markov Model and used to compute the posterior probabilities of candidate rearrangements and to find those with the highest values among them. We validated the software on a set of simulated data, a set of clonally related sequences, and a group of randomly selected Ig heavy chains from Genbank. In most tests, SoDA2 performed better than other available software for the task. Furthermore, the output format has been redesigned, in part, to facilitate comparison of multiple solutions. Availability: SoDA2 is available online at https://hippocrates.duhs.duke.edu/soda. Simulated sequences are available upon request. Contact: kepler@duke.edu</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btq056</identifier><identifier>PMID: 20147303</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Amino Acid Sequence ; B-Lymphocytes - immunology ; Base Sequence ; Biological and medical sciences ; Fundamental and applied biological sciences. Psychology ; Gene Rearrangement, B-Lymphocyte ; General aspects ; Genes, Immunoglobulin ; Immunoglobulins - genetics ; Markov Chains ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Molecular Sequence Data ; Original Papers ; Sequence Alignment ; Software</subject><ispartof>Bioinformatics, 2010-04, Vol.26 (7), p.867-872</ispartof><rights>2015 INIST-CNRS</rights><rights>The Author(s) 2010. Published by Oxford University Press. 2010</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c576t-6ac6c6380e0a371cd0f15783c98be0d6acedf4a99b6e59ed1fc84df4a005fb4b3</citedby><cites>FETCH-LOGICAL-c576t-6ac6c6380e0a371cd0f15783c98be0d6acedf4a99b6e59ed1fc84df4a005fb4b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2844993/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2844993/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=22576141$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20147303$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Munshaw, Supriya</creatorcontrib><creatorcontrib>Kepler, Thomas B.</creatorcontrib><title>SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a given Ig gene, but have different consequences in an analysis. Our aim in this article is to develop a probabilistic model of the rearrangement process and a Bayesian method for estimating posterior probabilities for the comparison of multiple plausible rearrangements. Results: We have developed SoDA2, which is based on a Hidden Markov Model and used to compute the posterior probabilities of candidate rearrangements and to find those with the highest values among them. We validated the software on a set of simulated data, a set of clonally related sequences, and a group of randomly selected Ig heavy chains from Genbank. In most tests, SoDA2 performed better than other available software for the task. Furthermore, the output format has been redesigned, in part, to facilitate comparison of multiple solutions. Availability: SoDA2 is available online at https://hippocrates.duhs.duke.edu/soda. Simulated sequences are available upon request. Contact: kepler@duke.edu</description><subject>Amino Acid Sequence</subject><subject>B-Lymphocytes - immunology</subject><subject>Base Sequence</subject><subject>Biological and medical sciences</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Gene Rearrangement, B-Lymphocyte</subject><subject>General aspects</subject><subject>Genes, Immunoglobulin</subject><subject>Immunoglobulins - genetics</subject><subject>Markov Chains</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Molecular Sequence Data</subject><subject>Original Papers</subject><subject>Sequence Alignment</subject><subject>Software</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkU9v1DAQxS0EomXLRwD5gnoKtWPHsTkgVeVPKm1VobYIuFgTx94aknhrJxX99nW1y0JPnGz5_WbG8x5Cryh5S4liR60PfnQhDjB5k47a6YZU4gnap1yQoiSVeprvTNQFl4TtoRcp_SSkopzz52ivJJTXjLB99P0ifDgu32HAje86O-IziL_CLT4Lne0xrNcxgLnGeRD2WZ688yZPDCMODvthmMew6kM7937E0UKMMK7skMF0gJ456JN9uT0X6OrTx8uTpliefz49OV4WpqrFVAgwwggmiSXAamo64mhVS2aUbC3psmw7x0GpVthK2Y46I_nDS97GtbxlC_R-03c9t4PtTJ4dodfr6AeIdzqA14-V0V_rVbjVpeRcKZYbHG4bxHAz2zTpwSdj-x5GG-ak6-yo5FLI_5OMMUJFNnaBqg1pYkgpWrf7DyX6IT_9OD-9yS_Xvf53mV3Vn8Ay8GYLQDLQu-y38ekvV2ZTKaeZKzacT5P9vdNzuFrUrK508-2H_qJ483XZlFqxeza-uwc</recordid><startdate>20100401</startdate><enddate>20100401</enddate><creator>Munshaw, Supriya</creator><creator>Kepler, Thomas B.</creator><general>Oxford University Press</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7QO</scope><scope>7T5</scope><scope>8FD</scope><scope>FR3</scope><scope>H94</scope><scope>P64</scope><scope>5PM</scope></search><sort><creationdate>20100401</creationdate><title>SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements</title><author>Munshaw, Supriya ; Kepler, Thomas B.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c576t-6ac6c6380e0a371cd0f15783c98be0d6acedf4a99b6e59ed1fc84df4a005fb4b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Amino Acid Sequence</topic><topic>B-Lymphocytes - immunology</topic><topic>Base Sequence</topic><topic>Biological and medical sciences</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Gene Rearrangement, B-Lymphocyte</topic><topic>General aspects</topic><topic>Genes, Immunoglobulin</topic><topic>Immunoglobulins - genetics</topic><topic>Markov Chains</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Molecular Sequence Data</topic><topic>Original Papers</topic><topic>Sequence Alignment</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Munshaw, Supriya</creatorcontrib><creatorcontrib>Kepler, Thomas B.</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Immunology Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Munshaw, Supriya</au><au>Kepler, Thomas B.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2010-04-01</date><risdate>2010</risdate><volume>26</volume><issue>7</issue><spage>867</spage><epage>872</epage><pages>867-872</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a given Ig gene, but have different consequences in an analysis. Our aim in this article is to develop a probabilistic model of the rearrangement process and a Bayesian method for estimating posterior probabilities for the comparison of multiple plausible rearrangements. Results: We have developed SoDA2, which is based on a Hidden Markov Model and used to compute the posterior probabilities of candidate rearrangements and to find those with the highest values among them. We validated the software on a set of simulated data, a set of clonally related sequences, and a group of randomly selected Ig heavy chains from Genbank. In most tests, SoDA2 performed better than other available software for the task. Furthermore, the output format has been redesigned, in part, to facilitate comparison of multiple solutions. Availability: SoDA2 is available online at https://hippocrates.duhs.duke.edu/soda. Simulated sequences are available upon request. Contact: kepler@duke.edu</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>20147303</pmid><doi>10.1093/bioinformatics/btq056</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2010-04, Vol.26 (7), p.867-872
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2844993
source Open Access: PubMed Central; MEDLINE; Alma/SFX Local Collection; EZB Electronic Journals Library; Oxford Academic Journals (Open Access)
subjects Amino Acid Sequence
B-Lymphocytes - immunology
Base Sequence
Biological and medical sciences
Fundamental and applied biological sciences. Psychology
Gene Rearrangement, B-Lymphocyte
General aspects
Genes, Immunoglobulin
Immunoglobulins - genetics
Markov Chains
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Molecular Sequence Data
Original Papers
Sequence Alignment
Software
title SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T09%3A02%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SoDA2:%20a%20Hidden%20Markov%20Model%20approach%20for%20identification%20of%20immunoglobulin%20rearrangements&rft.jtitle=Bioinformatics&rft.au=Munshaw,%20Supriya&rft.date=2010-04-01&rft.volume=26&rft.issue=7&rft.spage=867&rft.epage=872&rft.pages=867-872&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btq056&rft_dat=%3Cproquest_pubme%3E746084868%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=733301630&rft_id=info:pmid/20147303&rfr_iscdi=true