SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci

Abstract Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including dev...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Briefings in bioinformatics 2018-07, Vol.19 (4), p.636-643
Hauptverfasser: Hao, Yajing, Zhang, Lili, Niu, Yiwei, Cai, Tanxi, Luo, Jianjun, He, Shunmin, Zhang, Bao, Zhang, Dejiu, Qin, Yan, Yang, Fuquan, Chen, Runsheng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 643
container_issue 4
container_start_page 636
container_title Briefings in bioinformatics
container_volume 19
creator Hao, Yajing
Zhang, Lili
Niu, Yiwei
Cai, Tanxi
Luo, Jianjun
He, Shunmin
Zhang, Bao
Zhang, Dejiu
Qin, Yan
Yang, Fuquan
Chen, Runsheng
description Abstract Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.
doi_str_mv 10.1093/bib/bbx005
format Article
fullrecord <record><control><sourceid>proquest_TOX</sourceid><recordid>TN_cdi_proquest_miscellaneous_1863219700</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bib/bbx005</oup_id><sourcerecordid>1863219700</sourcerecordid><originalsourceid>FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</originalsourceid><addsrcrecordid>eNp9kF9LwzAUxYMobk5f_AASEEGEupsmaVrfxvAfDBXdg28lTVLpaJPZtOC-vRmdPvjg07333B-Hw0HolMA1gYxOi6qYFsUXAN9DY8KEiBhwtr_dExFxltAROvJ-BRCDSMkhGsUpoUIkYoze35qX1nU3WGItO1lIb7ArsW9kXeN1-JjKemysctpoXGywtNZ1sgtHkCr7EQSNrbPR7nx9muHaqeoYHZSy9uZkNydoeXe7nD9Ei-f7x_lsESnKeBdRFTPDqdAqprHhwAopgpClGRclSE50yoXUlBNICEilwbBSC6FJZmgMdIIuB9uQ9bM3vsubyitT19Ia1_ucpAmNSSZgi57_QVeub20Il8cUOGSCZmmgrgZKtc771pT5uq0a2W5yAvm27jzUnQ91B_hsZ9kXjdG_6E-_AbgYANev_zP6Bvq3hgs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2305097398</pqid></control><display><type>article</type><title>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</title><source>Oxford Journals Open Access Collection</source><creator>Hao, Yajing ; Zhang, Lili ; Niu, Yiwei ; Cai, Tanxi ; Luo, Jianjun ; He, Shunmin ; Zhang, Bao ; Zhang, Dejiu ; Qin, Yan ; Yang, Fuquan ; Chen, Runsheng</creator><creatorcontrib>Hao, Yajing ; Zhang, Lili ; Niu, Yiwei ; Cai, Tanxi ; Luo, Jianjun ; He, Shunmin ; Zhang, Bao ; Zhang, Dejiu ; Qin, Yan ; Yang, Fuquan ; Chen, Runsheng</creatorcontrib><description>Abstract Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.</description><identifier>ISSN: 1467-5463</identifier><identifier>EISSN: 1477-4054</identifier><identifier>DOI: 10.1093/bib/bbx005</identifier><identifier>PMID: 28137767</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Amino acids ; Biotechnology ; Cell lines ; Codon ; Data bases ; Databases, Factual ; Deoxyribonucleic acid ; DNA ; DNA repair ; Genomes ; Humans ; Molecular Sequence Annotation ; Muscle contraction ; Muscles ; Muscular function ; Non-coding RNA ; Open reading frames ; Proteins ; Proteins - genetics ; Proteins - metabolism ; RNA - genetics ; RNA, Untranslated - genetics ; Software ; Species ; Tissues</subject><ispartof>Briefings in bioinformatics, 2018-07, Vol.19 (4), p.636-643</ispartof><rights>The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2017</rights><rights>The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</citedby><cites>FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,1598,27901,27902</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bib/bbx005$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28137767$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hao, Yajing</creatorcontrib><creatorcontrib>Zhang, Lili</creatorcontrib><creatorcontrib>Niu, Yiwei</creatorcontrib><creatorcontrib>Cai, Tanxi</creatorcontrib><creatorcontrib>Luo, Jianjun</creatorcontrib><creatorcontrib>He, Shunmin</creatorcontrib><creatorcontrib>Zhang, Bao</creatorcontrib><creatorcontrib>Zhang, Dejiu</creatorcontrib><creatorcontrib>Qin, Yan</creatorcontrib><creatorcontrib>Yang, Fuquan</creatorcontrib><creatorcontrib>Chen, Runsheng</creatorcontrib><title>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</title><title>Briefings in bioinformatics</title><addtitle>Brief Bioinform</addtitle><description>Abstract Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.</description><subject>Amino acids</subject><subject>Biotechnology</subject><subject>Cell lines</subject><subject>Codon</subject><subject>Data bases</subject><subject>Databases, Factual</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA repair</subject><subject>Genomes</subject><subject>Humans</subject><subject>Molecular Sequence Annotation</subject><subject>Muscle contraction</subject><subject>Muscles</subject><subject>Muscular function</subject><subject>Non-coding RNA</subject><subject>Open reading frames</subject><subject>Proteins</subject><subject>Proteins - genetics</subject><subject>Proteins - metabolism</subject><subject>RNA - genetics</subject><subject>RNA, Untranslated - genetics</subject><subject>Software</subject><subject>Species</subject><subject>Tissues</subject><issn>1467-5463</issn><issn>1477-4054</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kF9LwzAUxYMobk5f_AASEEGEupsmaVrfxvAfDBXdg28lTVLpaJPZtOC-vRmdPvjg07333B-Hw0HolMA1gYxOi6qYFsUXAN9DY8KEiBhwtr_dExFxltAROvJ-BRCDSMkhGsUpoUIkYoze35qX1nU3WGItO1lIb7ArsW9kXeN1-JjKemysctpoXGywtNZ1sgtHkCr7EQSNrbPR7nx9muHaqeoYHZSy9uZkNydoeXe7nD9Ei-f7x_lsESnKeBdRFTPDqdAqprHhwAopgpClGRclSE50yoXUlBNICEilwbBSC6FJZmgMdIIuB9uQ9bM3vsubyitT19Ia1_ucpAmNSSZgi57_QVeub20Il8cUOGSCZmmgrgZKtc771pT5uq0a2W5yAvm27jzUnQ91B_hsZ9kXjdG_6E-_AbgYANev_zP6Bvq3hgs</recordid><startdate>20180701</startdate><enddate>20180701</enddate><creator>Hao, Yajing</creator><creator>Zhang, Lili</creator><creator>Niu, Yiwei</creator><creator>Cai, Tanxi</creator><creator>Luo, Jianjun</creator><creator>He, Shunmin</creator><creator>Zhang, Bao</creator><creator>Zhang, Dejiu</creator><creator>Qin, Yan</creator><creator>Yang, Fuquan</creator><creator>Chen, Runsheng</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>7SC</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>K9.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>20180701</creationdate><title>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</title><author>Hao, Yajing ; Zhang, Lili ; Niu, Yiwei ; Cai, Tanxi ; Luo, Jianjun ; He, Shunmin ; Zhang, Bao ; Zhang, Dejiu ; Qin, Yan ; Yang, Fuquan ; Chen, Runsheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Amino acids</topic><topic>Biotechnology</topic><topic>Cell lines</topic><topic>Codon</topic><topic>Data bases</topic><topic>Databases, Factual</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA repair</topic><topic>Genomes</topic><topic>Humans</topic><topic>Molecular Sequence Annotation</topic><topic>Muscle contraction</topic><topic>Muscles</topic><topic>Muscular function</topic><topic>Non-coding RNA</topic><topic>Open reading frames</topic><topic>Proteins</topic><topic>Proteins - genetics</topic><topic>Proteins - metabolism</topic><topic>RNA - genetics</topic><topic>RNA, Untranslated - genetics</topic><topic>Software</topic><topic>Species</topic><topic>Tissues</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hao, Yajing</creatorcontrib><creatorcontrib>Zhang, Lili</creatorcontrib><creatorcontrib>Niu, Yiwei</creatorcontrib><creatorcontrib>Cai, Tanxi</creatorcontrib><creatorcontrib>Luo, Jianjun</creatorcontrib><creatorcontrib>He, Shunmin</creatorcontrib><creatorcontrib>Zhang, Bao</creatorcontrib><creatorcontrib>Zhang, Dejiu</creatorcontrib><creatorcontrib>Qin, Yan</creatorcontrib><creatorcontrib>Yang, Fuquan</creatorcontrib><creatorcontrib>Chen, Runsheng</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Briefings in bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hao, Yajing</au><au>Zhang, Lili</au><au>Niu, Yiwei</au><au>Cai, Tanxi</au><au>Luo, Jianjun</au><au>He, Shunmin</au><au>Zhang, Bao</au><au>Zhang, Dejiu</au><au>Qin, Yan</au><au>Yang, Fuquan</au><au>Chen, Runsheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</atitle><jtitle>Briefings in bioinformatics</jtitle><addtitle>Brief Bioinform</addtitle><date>2018-07-01</date><risdate>2018</risdate><volume>19</volume><issue>4</issue><spage>636</spage><epage>643</epage><pages>636-643</pages><issn>1467-5463</issn><eissn>1477-4054</eissn><abstract>Abstract Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>28137767</pmid><doi>10.1093/bib/bbx005</doi><tpages>8</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1467-5463
ispartof Briefings in bioinformatics, 2018-07, Vol.19 (4), p.636-643
issn 1467-5463
1477-4054
language eng
recordid cdi_proquest_miscellaneous_1863219700
source Oxford Journals Open Access Collection
subjects Amino acids
Biotechnology
Cell lines
Codon
Data bases
Databases, Factual
Deoxyribonucleic acid
DNA
DNA repair
Genomes
Humans
Molecular Sequence Annotation
Muscle contraction
Muscles
Muscular function
Non-coding RNA
Open reading frames
Proteins
Proteins - genetics
Proteins - metabolism
RNA - genetics
RNA, Untranslated - genetics
Software
Species
Tissues
title SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T05%3A25%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SmProt:%20a%20database%20of%20small%20proteins%20encoded%20by%20annotated%20coding%20and%20non-coding%20RNA%20loci&rft.jtitle=Briefings%20in%20bioinformatics&rft.au=Hao,%20Yajing&rft.date=2018-07-01&rft.volume=19&rft.issue=4&rft.spage=636&rft.epage=643&rft.pages=636-643&rft.issn=1467-5463&rft.eissn=1477-4054&rft_id=info:doi/10.1093/bib/bbx005&rft_dat=%3Cproquest_TOX%3E1863219700%3C/proquest_TOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2305097398&rft_id=info:pmid/28137767&rft_oup_id=10.1093/bib/bbx005&rfr_iscdi=true