Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences

The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Protein science 2022-01, Vol.31 (1), p.141-146
Hauptverfasser: Olsen, Tobias H., Boyles, Fergus, Deane, Charlotte M.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 146
container_issue 1
container_start_page 141
container_title Protein science
container_volume 31
creator Olsen, Tobias H.
Boyles, Fergus
Deane, Charlotte M.
description The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in‐depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence‐based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.
doi_str_mv 10.1002/pro.4205
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8740823</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2617285144</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4775-3989321ab9a819b37ae2e96e134249772b2ea2b75b058babfa1af768582d5c9b3</originalsourceid><addsrcrecordid>eNp1kV2L1DAUhoO4uOMq-Auk4I0Xds1Xm8QLYVj8goURP8C7cNKeapdO0k3a0fn3m3HGWVfwKu9J3jznTQ4hTxg9Z5Tyl2MM55LT6h5ZMFmbUpv6232yoKZmpRa1PiUPU7qilErGxQNyKmRdVUyIBfm1cgnjBtti6afehXZbfB6hwVfFsmj7DcaERQsTOMgidEUzIHhsXxTgfZhg2su2mCL4NOzqYvYj9DGL3f5RHuAJr2f0DaZH5KSDIeHjw3pGvr598-XifXm5evfhYnlZNlKpqhRGG8EZOAOaGScUIEdTIxOSS6MUdxyBO1U5WmkHrgMGnap1pXlbNfnCGXm9546zW2PboM9RBzvGfg1xawP09u6J73_Y72FjtZJUc5EBzw-AGHL2NNl1nxochvwPYU6W51aaMWZMtj77x3oV5ujz8yyvmeK6YlLeApsYUorYHcMwanfjzHWwu3Fm69O_wx-Nf-aXDeXe8LMfcPtfkP34afUbeAO6fKqh</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2617285144</pqid></control><display><type>article</type><title>Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences</title><source>Wiley Free Content</source><source>MEDLINE</source><source>Wiley Online Library Journals Frontfile Complete</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Olsen, Tobias H. ; Boyles, Fergus ; Deane, Charlotte M.</creator><creatorcontrib>Olsen, Tobias H. ; Boyles, Fergus ; Deane, Charlotte M.</creatorcontrib><description>The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in‐depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence‐based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.</description><identifier>ISSN: 0961-8368</identifier><identifier>EISSN: 1469-896X</identifier><identifier>DOI: 10.1002/pro.4205</identifier><identifier>PMID: 34655133</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley &amp; Sons, Inc</publisher><subject>Accessibility ; Amino Acid Sequence ; Amino acids ; Animals ; annotated antibody sequences ; Annotations ; Antibodies ; Antibodies - chemistry ; Antibodies - immunology ; antibody database ; antibody repertoire ; antibody sequence ; BCR‐seq ; Coronaviruses ; COVID-19 - immunology ; Databases, Protein ; Downloading ; Humans ; Immunoglobulin Heavy Chains - chemistry ; Immunoglobulin Heavy Chains - immunology ; Immunoglobulin Light Chains - chemistry ; Immunoglobulin Light Chains - immunology ; Immunoglobulin Variable Region - chemistry ; Immunoglobulin Variable Region - immunology ; Nucleotide sequence ; Nucleotides ; Observed Antibody Space (OAS) ; Receptors ; SARS-CoV-2 - immunology ; Severe acute respiratory syndrome ; Severe acute respiratory syndrome coronavirus 2 ; Tools for Protein Science ; Viral diseases</subject><ispartof>Protein science, 2022-01, Vol.31 (1), p.141-146</ispartof><rights>2021 The Authors. published by Wiley Periodicals LLC on behalf of The Protein Society.</rights><rights>2021 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.</rights><rights>2021. This article is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4775-3989321ab9a819b37ae2e96e134249772b2ea2b75b058babfa1af768582d5c9b3</citedby><cites>FETCH-LOGICAL-c4775-3989321ab9a819b37ae2e96e134249772b2ea2b75b058babfa1af768582d5c9b3</cites><orcidid>0000-0002-6348-4650 ; 0000-0003-1388-2252</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740823/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740823/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,1411,1427,27901,27902,45550,45551,46384,46808,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34655133$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Olsen, Tobias H.</creatorcontrib><creatorcontrib>Boyles, Fergus</creatorcontrib><creatorcontrib>Deane, Charlotte M.</creatorcontrib><title>Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences</title><title>Protein science</title><addtitle>Protein Sci</addtitle><description>The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in‐depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence‐based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.</description><subject>Accessibility</subject><subject>Amino Acid Sequence</subject><subject>Amino acids</subject><subject>Animals</subject><subject>annotated antibody sequences</subject><subject>Annotations</subject><subject>Antibodies</subject><subject>Antibodies - chemistry</subject><subject>Antibodies - immunology</subject><subject>antibody database</subject><subject>antibody repertoire</subject><subject>antibody sequence</subject><subject>BCR‐seq</subject><subject>Coronaviruses</subject><subject>COVID-19 - immunology</subject><subject>Databases, Protein</subject><subject>Downloading</subject><subject>Humans</subject><subject>Immunoglobulin Heavy Chains - chemistry</subject><subject>Immunoglobulin Heavy Chains - immunology</subject><subject>Immunoglobulin Light Chains - chemistry</subject><subject>Immunoglobulin Light Chains - immunology</subject><subject>Immunoglobulin Variable Region - chemistry</subject><subject>Immunoglobulin Variable Region - immunology</subject><subject>Nucleotide sequence</subject><subject>Nucleotides</subject><subject>Observed Antibody Space (OAS)</subject><subject>Receptors</subject><subject>SARS-CoV-2 - immunology</subject><subject>Severe acute respiratory syndrome</subject><subject>Severe acute respiratory syndrome coronavirus 2</subject><subject>Tools for Protein Science</subject><subject>Viral diseases</subject><issn>0961-8368</issn><issn>1469-896X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>EIF</sourceid><recordid>eNp1kV2L1DAUhoO4uOMq-Auk4I0Xds1Xm8QLYVj8goURP8C7cNKeapdO0k3a0fn3m3HGWVfwKu9J3jznTQ4hTxg9Z5Tyl2MM55LT6h5ZMFmbUpv6232yoKZmpRa1PiUPU7qilErGxQNyKmRdVUyIBfm1cgnjBtti6afehXZbfB6hwVfFsmj7DcaERQsTOMgidEUzIHhsXxTgfZhg2su2mCL4NOzqYvYj9DGL3f5RHuAJr2f0DaZH5KSDIeHjw3pGvr598-XifXm5evfhYnlZNlKpqhRGG8EZOAOaGScUIEdTIxOSS6MUdxyBO1U5WmkHrgMGnap1pXlbNfnCGXm9546zW2PboM9RBzvGfg1xawP09u6J73_Y72FjtZJUc5EBzw-AGHL2NNl1nxochvwPYU6W51aaMWZMtj77x3oV5ujz8yyvmeK6YlLeApsYUorYHcMwanfjzHWwu3Fm69O_wx-Nf-aXDeXe8LMfcPtfkP34afUbeAO6fKqh</recordid><startdate>202201</startdate><enddate>202201</enddate><creator>Olsen, Tobias H.</creator><creator>Boyles, Fergus</creator><creator>Deane, Charlotte M.</creator><general>John Wiley &amp; Sons, Inc</general><general>Wiley Subscription Services, Inc</general><scope>24P</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>7T5</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-6348-4650</orcidid><orcidid>https://orcid.org/0000-0003-1388-2252</orcidid></search><sort><creationdate>202201</creationdate><title>Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences</title><author>Olsen, Tobias H. ; Boyles, Fergus ; Deane, Charlotte M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4775-3989321ab9a819b37ae2e96e134249772b2ea2b75b058babfa1af768582d5c9b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accessibility</topic><topic>Amino Acid Sequence</topic><topic>Amino acids</topic><topic>Animals</topic><topic>annotated antibody sequences</topic><topic>Annotations</topic><topic>Antibodies</topic><topic>Antibodies - chemistry</topic><topic>Antibodies - immunology</topic><topic>antibody database</topic><topic>antibody repertoire</topic><topic>antibody sequence</topic><topic>BCR‐seq</topic><topic>Coronaviruses</topic><topic>COVID-19 - immunology</topic><topic>Databases, Protein</topic><topic>Downloading</topic><topic>Humans</topic><topic>Immunoglobulin Heavy Chains - chemistry</topic><topic>Immunoglobulin Heavy Chains - immunology</topic><topic>Immunoglobulin Light Chains - chemistry</topic><topic>Immunoglobulin Light Chains - immunology</topic><topic>Immunoglobulin Variable Region - chemistry</topic><topic>Immunoglobulin Variable Region - immunology</topic><topic>Nucleotide sequence</topic><topic>Nucleotides</topic><topic>Observed Antibody Space (OAS)</topic><topic>Receptors</topic><topic>SARS-CoV-2 - immunology</topic><topic>Severe acute respiratory syndrome</topic><topic>Severe acute respiratory syndrome coronavirus 2</topic><topic>Tools for Protein Science</topic><topic>Viral diseases</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Olsen, Tobias H.</creatorcontrib><creatorcontrib>Boyles, Fergus</creatorcontrib><creatorcontrib>Deane, Charlotte M.</creatorcontrib><collection>Wiley Online Library Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Immunology Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Protein science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Olsen, Tobias H.</au><au>Boyles, Fergus</au><au>Deane, Charlotte M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences</atitle><jtitle>Protein science</jtitle><addtitle>Protein Sci</addtitle><date>2022-01</date><risdate>2022</risdate><volume>31</volume><issue>1</issue><spage>141</spage><epage>146</epage><pages>141-146</pages><issn>0961-8368</issn><eissn>1469-896X</eissn><abstract>The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in‐depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence‐based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.</abstract><cop>Hoboken, USA</cop><pub>John Wiley &amp; Sons, Inc</pub><pmid>34655133</pmid><doi>10.1002/pro.4205</doi><tpages>6</tpages><orcidid>https://orcid.org/0000-0002-6348-4650</orcidid><orcidid>https://orcid.org/0000-0003-1388-2252</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0961-8368
ispartof Protein science, 2022-01, Vol.31 (1), p.141-146
issn 0961-8368
1469-896X
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8740823
source Wiley Free Content; MEDLINE; Wiley Online Library Journals Frontfile Complete; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Free Full-Text Journals in Chemistry
subjects Accessibility
Amino Acid Sequence
Amino acids
Animals
annotated antibody sequences
Annotations
Antibodies
Antibodies - chemistry
Antibodies - immunology
antibody database
antibody repertoire
antibody sequence
BCR‐seq
Coronaviruses
COVID-19 - immunology
Databases, Protein
Downloading
Humans
Immunoglobulin Heavy Chains - chemistry
Immunoglobulin Heavy Chains - immunology
Immunoglobulin Light Chains - chemistry
Immunoglobulin Light Chains - immunology
Immunoglobulin Variable Region - chemistry
Immunoglobulin Variable Region - immunology
Nucleotide sequence
Nucleotides
Observed Antibody Space (OAS)
Receptors
SARS-CoV-2 - immunology
Severe acute respiratory syndrome
Severe acute respiratory syndrome coronavirus 2
Tools for Protein Science
Viral diseases
title Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T05%3A38%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Observed%20Antibody%20Space:%20A%20diverse%20database%20of%20cleaned,%20annotated,%20and%20translated%20unpaired%20and%20paired%20antibody%20sequences&rft.jtitle=Protein%20science&rft.au=Olsen,%20Tobias%20H.&rft.date=2022-01&rft.volume=31&rft.issue=1&rft.spage=141&rft.epage=146&rft.pages=141-146&rft.issn=0961-8368&rft.eissn=1469-896X&rft_id=info:doi/10.1002/pro.4205&rft_dat=%3Cproquest_pubme%3E2617285144%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2617285144&rft_id=info:pmid/34655133&rfr_iscdi=true