OMA standalone: orthology inference among public and custom genomes and transcriptomes

Genomes and transcriptomes are now typically sequenced by individual laboratories but analyzing them often remains challenging. One essential step in many analyses lies in identifying orthologs-corresponding genes across multiple species-but this is far from trivial. The Orthologous MAtrix (OMA) dat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genome research 2019-07, Vol.29 (7), p.1152-1163
Hauptverfasser: Altenhoff, Adrian M, Levy, Jeremy, Zarowiecki, Magdalena, Tomiczek, Bartłomiej, Warwick Vesztrocy, Alex, Dalquen, Daniel A, Müller, Steven, Telford, Maximilian J, Glover, Natasha M, Dylus, David, Dessimoz, Christophe
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1163
container_issue 7
container_start_page 1152
container_title Genome research
container_volume 29
creator Altenhoff, Adrian M
Levy, Jeremy
Zarowiecki, Magdalena
Tomiczek, Bartłomiej
Warwick Vesztrocy, Alex
Dalquen, Daniel A
Müller, Steven
Telford, Maximilian J
Glover, Natasha M
Dylus, David
Dessimoz, Christophe
description Genomes and transcriptomes are now typically sequenced by individual laboratories but analyzing them often remains challenging. One essential step in many analyses lies in identifying orthologs-corresponding genes across multiple species-but this is far from trivial. The Orthologous MAtrix (OMA) database is a leading resource for identifying orthologs among publicly available, complete genomes. Here, we describe the OMA pipeline available as a standalone program for Linux and Mac. When run on a cluster, it has native support for the LSF, SGE, PBS Pro, and Slurm job schedulers and can scale up to thousands of parallel processes. Another key feature of OMA standalone is that users can combine their own data with existing public data by exporting genomes and precomputed alignments from the OMA database, which currently contains over 2100 complete genomes. We compare OMA standalone to other methods in the context of phylogenetic tree inference, by inferring a phylogeny of Lophotrochozoa, a challenging clade within the protostomes. We also discuss other potential applications of OMA standalone, including identifying gene families having undergone duplications/losses in specific clades, and identifying potential drug targets in nonmodel organisms. OMA standalone is available under the permissive open source Mozilla Public License Version 2.0.
doi_str_mv 10.1101/gr.243212.118
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6633268</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2259348314</sourcerecordid><originalsourceid>FETCH-LOGICAL-c481t-b04c19d2e70ef0f18d1787ca0f0df7b849d4983681accc4ad30adabaa9e8d05e3</originalsourceid><addsrcrecordid>eNpdkb1PwzAQxS0EonyNrCgSC0uKHTuJzYBUIb6kIhZgtVz7kqZK7GInSP3vcSlUwGTf3U9P7-4hdErwmBBMLms_zhjNSBZLvoMOSM5EmrNC7MY_5jwVOCcjdBjCAmNMGef7aERJRvMiZwfo7flpkoReWaNaZ-Eqcb6fu9bVq6SxFXiwGhLVOVsny2HWNjqJaKKH0LsuqcG6DsJXq_fKBu2bZb9uHaO9SrUBTr7fI_R6d_ty85BOn-8fbybTVDNO-nSGmSbCZFBiqHBFuCElL7XCFTZVOeNMGCY4LThRWmumDMXKqJlSArjBOdAjdL3RjeY6MBps9NHKpW865VfSqUb-ndhmLmv3IYuC0qzgUeDiW8C79wFCL7smaGhbZcENQWZZPCUuRLlGz_-hCzd4G9eLVC7iaSlhkUo3lPYuBA_V1gzBcp2YrL3cJBbLterZ7w229E9E9BP8nJOF</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2259348314</pqid></control><display><type>article</type><title>OMA standalone: orthology inference among public and custom genomes and transcriptomes</title><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Altenhoff, Adrian M ; Levy, Jeremy ; Zarowiecki, Magdalena ; Tomiczek, Bartłomiej ; Warwick Vesztrocy, Alex ; Dalquen, Daniel A ; Müller, Steven ; Telford, Maximilian J ; Glover, Natasha M ; Dylus, David ; Dessimoz, Christophe</creator><creatorcontrib>Altenhoff, Adrian M ; Levy, Jeremy ; Zarowiecki, Magdalena ; Tomiczek, Bartłomiej ; Warwick Vesztrocy, Alex ; Dalquen, Daniel A ; Müller, Steven ; Telford, Maximilian J ; Glover, Natasha M ; Dylus, David ; Dessimoz, Christophe</creatorcontrib><description>Genomes and transcriptomes are now typically sequenced by individual laboratories but analyzing them often remains challenging. One essential step in many analyses lies in identifying orthologs-corresponding genes across multiple species-but this is far from trivial. The Orthologous MAtrix (OMA) database is a leading resource for identifying orthologs among publicly available, complete genomes. Here, we describe the OMA pipeline available as a standalone program for Linux and Mac. When run on a cluster, it has native support for the LSF, SGE, PBS Pro, and Slurm job schedulers and can scale up to thousands of parallel processes. Another key feature of OMA standalone is that users can combine their own data with existing public data by exporting genomes and precomputed alignments from the OMA database, which currently contains over 2100 complete genomes. We compare OMA standalone to other methods in the context of phylogenetic tree inference, by inferring a phylogeny of Lophotrochozoa, a challenging clade within the protostomes. We also discuss other potential applications of OMA standalone, including identifying gene families having undergone duplications/losses in specific clades, and identifying potential drug targets in nonmodel organisms. OMA standalone is available under the permissive open source Mozilla Public License Version 2.0.</description><identifier>ISSN: 1088-9051</identifier><identifier>EISSN: 1549-5469</identifier><identifier>DOI: 10.1101/gr.243212.118</identifier><identifier>PMID: 31235654</identifier><language>eng</language><publisher>United States: Cold Spring Harbor Laboratory Press</publisher><subject>Animals ; Databases, Genetic ; Gene families ; Genome ; Genomes ; Invertebrates - classification ; Invertebrates - genetics ; Method ; Orthology ; Phylogeny ; Software ; Transcriptome</subject><ispartof>Genome research, 2019-07, Vol.29 (7), p.1152-1163</ispartof><rights>2019 Altenhoff et al.; Published by Cold Spring Harbor Laboratory Press.</rights><rights>Copyright Cold Spring Harbor Laboratory Press Jul 2019</rights><rights>2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c481t-b04c19d2e70ef0f18d1787ca0f0df7b849d4983681accc4ad30adabaa9e8d05e3</citedby><cites>FETCH-LOGICAL-c481t-b04c19d2e70ef0f18d1787ca0f0df7b849d4983681accc4ad30adabaa9e8d05e3</cites><orcidid>0000-0002-2170-853X ; 0000-0001-6102-7731 ; 0000-0002-4074-4261 ; 0000-0001-7492-1273 ; 0000-0003-1811-4340 ; 0000-0002-3749-5620</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6633268/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6633268/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31235654$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Altenhoff, Adrian M</creatorcontrib><creatorcontrib>Levy, Jeremy</creatorcontrib><creatorcontrib>Zarowiecki, Magdalena</creatorcontrib><creatorcontrib>Tomiczek, Bartłomiej</creatorcontrib><creatorcontrib>Warwick Vesztrocy, Alex</creatorcontrib><creatorcontrib>Dalquen, Daniel A</creatorcontrib><creatorcontrib>Müller, Steven</creatorcontrib><creatorcontrib>Telford, Maximilian J</creatorcontrib><creatorcontrib>Glover, Natasha M</creatorcontrib><creatorcontrib>Dylus, David</creatorcontrib><creatorcontrib>Dessimoz, Christophe</creatorcontrib><title>OMA standalone: orthology inference among public and custom genomes and transcriptomes</title><title>Genome research</title><addtitle>Genome Res</addtitle><description>Genomes and transcriptomes are now typically sequenced by individual laboratories but analyzing them often remains challenging. One essential step in many analyses lies in identifying orthologs-corresponding genes across multiple species-but this is far from trivial. The Orthologous MAtrix (OMA) database is a leading resource for identifying orthologs among publicly available, complete genomes. Here, we describe the OMA pipeline available as a standalone program for Linux and Mac. When run on a cluster, it has native support for the LSF, SGE, PBS Pro, and Slurm job schedulers and can scale up to thousands of parallel processes. Another key feature of OMA standalone is that users can combine their own data with existing public data by exporting genomes and precomputed alignments from the OMA database, which currently contains over 2100 complete genomes. We compare OMA standalone to other methods in the context of phylogenetic tree inference, by inferring a phylogeny of Lophotrochozoa, a challenging clade within the protostomes. We also discuss other potential applications of OMA standalone, including identifying gene families having undergone duplications/losses in specific clades, and identifying potential drug targets in nonmodel organisms. OMA standalone is available under the permissive open source Mozilla Public License Version 2.0.</description><subject>Animals</subject><subject>Databases, Genetic</subject><subject>Gene families</subject><subject>Genome</subject><subject>Genomes</subject><subject>Invertebrates - classification</subject><subject>Invertebrates - genetics</subject><subject>Method</subject><subject>Orthology</subject><subject>Phylogeny</subject><subject>Software</subject><subject>Transcriptome</subject><issn>1088-9051</issn><issn>1549-5469</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpdkb1PwzAQxS0EonyNrCgSC0uKHTuJzYBUIb6kIhZgtVz7kqZK7GInSP3vcSlUwGTf3U9P7-4hdErwmBBMLms_zhjNSBZLvoMOSM5EmrNC7MY_5jwVOCcjdBjCAmNMGef7aERJRvMiZwfo7flpkoReWaNaZ-Eqcb6fu9bVq6SxFXiwGhLVOVsny2HWNjqJaKKH0LsuqcG6DsJXq_fKBu2bZb9uHaO9SrUBTr7fI_R6d_ty85BOn-8fbybTVDNO-nSGmSbCZFBiqHBFuCElL7XCFTZVOeNMGCY4LThRWmumDMXKqJlSArjBOdAjdL3RjeY6MBps9NHKpW865VfSqUb-ndhmLmv3IYuC0qzgUeDiW8C79wFCL7smaGhbZcENQWZZPCUuRLlGz_-hCzd4G9eLVC7iaSlhkUo3lPYuBA_V1gzBcp2YrL3cJBbLterZ7w229E9E9BP8nJOF</recordid><startdate>20190701</startdate><enddate>20190701</enddate><creator>Altenhoff, Adrian M</creator><creator>Levy, Jeremy</creator><creator>Zarowiecki, Magdalena</creator><creator>Tomiczek, Bartłomiej</creator><creator>Warwick Vesztrocy, Alex</creator><creator>Dalquen, Daniel A</creator><creator>Müller, Steven</creator><creator>Telford, Maximilian J</creator><creator>Glover, Natasha M</creator><creator>Dylus, David</creator><creator>Dessimoz, Christophe</creator><general>Cold Spring Harbor Laboratory Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-2170-853X</orcidid><orcidid>https://orcid.org/0000-0001-6102-7731</orcidid><orcidid>https://orcid.org/0000-0002-4074-4261</orcidid><orcidid>https://orcid.org/0000-0001-7492-1273</orcidid><orcidid>https://orcid.org/0000-0003-1811-4340</orcidid><orcidid>https://orcid.org/0000-0002-3749-5620</orcidid></search><sort><creationdate>20190701</creationdate><title>OMA standalone: orthology inference among public and custom genomes and transcriptomes</title><author>Altenhoff, Adrian M ; Levy, Jeremy ; Zarowiecki, Magdalena ; Tomiczek, Bartłomiej ; Warwick Vesztrocy, Alex ; Dalquen, Daniel A ; Müller, Steven ; Telford, Maximilian J ; Glover, Natasha M ; Dylus, David ; Dessimoz, Christophe</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c481t-b04c19d2e70ef0f18d1787ca0f0df7b849d4983681accc4ad30adabaa9e8d05e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Animals</topic><topic>Databases, Genetic</topic><topic>Gene families</topic><topic>Genome</topic><topic>Genomes</topic><topic>Invertebrates - classification</topic><topic>Invertebrates - genetics</topic><topic>Method</topic><topic>Orthology</topic><topic>Phylogeny</topic><topic>Software</topic><topic>Transcriptome</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Altenhoff, Adrian M</creatorcontrib><creatorcontrib>Levy, Jeremy</creatorcontrib><creatorcontrib>Zarowiecki, Magdalena</creatorcontrib><creatorcontrib>Tomiczek, Bartłomiej</creatorcontrib><creatorcontrib>Warwick Vesztrocy, Alex</creatorcontrib><creatorcontrib>Dalquen, Daniel A</creatorcontrib><creatorcontrib>Müller, Steven</creatorcontrib><creatorcontrib>Telford, Maximilian J</creatorcontrib><creatorcontrib>Glover, Natasha M</creatorcontrib><creatorcontrib>Dylus, David</creatorcontrib><creatorcontrib>Dessimoz, Christophe</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Genome research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Altenhoff, Adrian M</au><au>Levy, Jeremy</au><au>Zarowiecki, Magdalena</au><au>Tomiczek, Bartłomiej</au><au>Warwick Vesztrocy, Alex</au><au>Dalquen, Daniel A</au><au>Müller, Steven</au><au>Telford, Maximilian J</au><au>Glover, Natasha M</au><au>Dylus, David</au><au>Dessimoz, Christophe</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>OMA standalone: orthology inference among public and custom genomes and transcriptomes</atitle><jtitle>Genome research</jtitle><addtitle>Genome Res</addtitle><date>2019-07-01</date><risdate>2019</risdate><volume>29</volume><issue>7</issue><spage>1152</spage><epage>1163</epage><pages>1152-1163</pages><issn>1088-9051</issn><eissn>1549-5469</eissn><abstract>Genomes and transcriptomes are now typically sequenced by individual laboratories but analyzing them often remains challenging. One essential step in many analyses lies in identifying orthologs-corresponding genes across multiple species-but this is far from trivial. The Orthologous MAtrix (OMA) database is a leading resource for identifying orthologs among publicly available, complete genomes. Here, we describe the OMA pipeline available as a standalone program for Linux and Mac. When run on a cluster, it has native support for the LSF, SGE, PBS Pro, and Slurm job schedulers and can scale up to thousands of parallel processes. Another key feature of OMA standalone is that users can combine their own data with existing public data by exporting genomes and precomputed alignments from the OMA database, which currently contains over 2100 complete genomes. We compare OMA standalone to other methods in the context of phylogenetic tree inference, by inferring a phylogeny of Lophotrochozoa, a challenging clade within the protostomes. We also discuss other potential applications of OMA standalone, including identifying gene families having undergone duplications/losses in specific clades, and identifying potential drug targets in nonmodel organisms. OMA standalone is available under the permissive open source Mozilla Public License Version 2.0.</abstract><cop>United States</cop><pub>Cold Spring Harbor Laboratory Press</pub><pmid>31235654</pmid><doi>10.1101/gr.243212.118</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0002-2170-853X</orcidid><orcidid>https://orcid.org/0000-0001-6102-7731</orcidid><orcidid>https://orcid.org/0000-0002-4074-4261</orcidid><orcidid>https://orcid.org/0000-0001-7492-1273</orcidid><orcidid>https://orcid.org/0000-0003-1811-4340</orcidid><orcidid>https://orcid.org/0000-0002-3749-5620</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1088-9051
ispartof Genome research, 2019-07, Vol.29 (7), p.1152-1163
issn 1088-9051
1549-5469
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6633268
source MEDLINE; PubMed Central; Alma/SFX Local Collection
subjects Animals
Databases, Genetic
Gene families
Genome
Genomes
Invertebrates - classification
Invertebrates - genetics
Method
Orthology
Phylogeny
Software
Transcriptome
title OMA standalone: orthology inference among public and custom genomes and transcriptomes
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T18%3A28%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=OMA%20standalone:%20orthology%20inference%20among%20public%20and%20custom%20genomes%20and%20transcriptomes&rft.jtitle=Genome%20research&rft.au=Altenhoff,%20Adrian%20M&rft.date=2019-07-01&rft.volume=29&rft.issue=7&rft.spage=1152&rft.epage=1163&rft.pages=1152-1163&rft.issn=1088-9051&rft.eissn=1549-5469&rft_id=info:doi/10.1101/gr.243212.118&rft_dat=%3Cproquest_pubme%3E2259348314%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2259348314&rft_id=info:pmid/31235654&rfr_iscdi=true