SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci
Abstract Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including dev...
Gespeichert in:
Veröffentlicht in: | Briefings in bioinformatics 2018-07, Vol.19 (4), p.636-643 |
---|---|
Hauptverfasser: | , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 643 |
---|---|
container_issue | 4 |
container_start_page | 636 |
container_title | Briefings in bioinformatics |
container_volume | 19 |
creator | Hao, Yajing Zhang, Lili Niu, Yiwei Cai, Tanxi Luo, Jianjun He, Shunmin Zhang, Bao Zhang, Dejiu Qin, Yan Yang, Fuquan Chen, Runsheng |
description | Abstract
Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins. |
doi_str_mv | 10.1093/bib/bbx005 |
format | Article |
fullrecord | <record><control><sourceid>proquest_TOX</sourceid><recordid>TN_cdi_proquest_miscellaneous_1863219700</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bib/bbx005</oup_id><sourcerecordid>1863219700</sourcerecordid><originalsourceid>FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</originalsourceid><addsrcrecordid>eNp9kF9LwzAUxYMobk5f_AASEEGEupsmaVrfxvAfDBXdg28lTVLpaJPZtOC-vRmdPvjg07333B-Hw0HolMA1gYxOi6qYFsUXAN9DY8KEiBhwtr_dExFxltAROvJ-BRCDSMkhGsUpoUIkYoze35qX1nU3WGItO1lIb7ArsW9kXeN1-JjKemysctpoXGywtNZ1sgtHkCr7EQSNrbPR7nx9muHaqeoYHZSy9uZkNydoeXe7nD9Ei-f7x_lsESnKeBdRFTPDqdAqprHhwAopgpClGRclSE50yoXUlBNICEilwbBSC6FJZmgMdIIuB9uQ9bM3vsubyitT19Ia1_ucpAmNSSZgi57_QVeub20Il8cUOGSCZmmgrgZKtc771pT5uq0a2W5yAvm27jzUnQ91B_hsZ9kXjdG_6E-_AbgYANev_zP6Bvq3hgs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2305097398</pqid></control><display><type>article</type><title>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</title><source>Oxford Journals Open Access Collection</source><creator>Hao, Yajing ; Zhang, Lili ; Niu, Yiwei ; Cai, Tanxi ; Luo, Jianjun ; He, Shunmin ; Zhang, Bao ; Zhang, Dejiu ; Qin, Yan ; Yang, Fuquan ; Chen, Runsheng</creator><creatorcontrib>Hao, Yajing ; Zhang, Lili ; Niu, Yiwei ; Cai, Tanxi ; Luo, Jianjun ; He, Shunmin ; Zhang, Bao ; Zhang, Dejiu ; Qin, Yan ; Yang, Fuquan ; Chen, Runsheng</creatorcontrib><description>Abstract
Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.</description><identifier>ISSN: 1467-5463</identifier><identifier>EISSN: 1477-4054</identifier><identifier>DOI: 10.1093/bib/bbx005</identifier><identifier>PMID: 28137767</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Amino acids ; Biotechnology ; Cell lines ; Codon ; Data bases ; Databases, Factual ; Deoxyribonucleic acid ; DNA ; DNA repair ; Genomes ; Humans ; Molecular Sequence Annotation ; Muscle contraction ; Muscles ; Muscular function ; Non-coding RNA ; Open reading frames ; Proteins ; Proteins - genetics ; Proteins - metabolism ; RNA - genetics ; RNA, Untranslated - genetics ; Software ; Species ; Tissues</subject><ispartof>Briefings in bioinformatics, 2018-07, Vol.19 (4), p.636-643</ispartof><rights>The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2017</rights><rights>The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</citedby><cites>FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,1598,27901,27902</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bib/bbx005$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28137767$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hao, Yajing</creatorcontrib><creatorcontrib>Zhang, Lili</creatorcontrib><creatorcontrib>Niu, Yiwei</creatorcontrib><creatorcontrib>Cai, Tanxi</creatorcontrib><creatorcontrib>Luo, Jianjun</creatorcontrib><creatorcontrib>He, Shunmin</creatorcontrib><creatorcontrib>Zhang, Bao</creatorcontrib><creatorcontrib>Zhang, Dejiu</creatorcontrib><creatorcontrib>Qin, Yan</creatorcontrib><creatorcontrib>Yang, Fuquan</creatorcontrib><creatorcontrib>Chen, Runsheng</creatorcontrib><title>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</title><title>Briefings in bioinformatics</title><addtitle>Brief Bioinform</addtitle><description>Abstract
Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.</description><subject>Amino acids</subject><subject>Biotechnology</subject><subject>Cell lines</subject><subject>Codon</subject><subject>Data bases</subject><subject>Databases, Factual</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA repair</subject><subject>Genomes</subject><subject>Humans</subject><subject>Molecular Sequence Annotation</subject><subject>Muscle contraction</subject><subject>Muscles</subject><subject>Muscular function</subject><subject>Non-coding RNA</subject><subject>Open reading frames</subject><subject>Proteins</subject><subject>Proteins - genetics</subject><subject>Proteins - metabolism</subject><subject>RNA - genetics</subject><subject>RNA, Untranslated - genetics</subject><subject>Software</subject><subject>Species</subject><subject>Tissues</subject><issn>1467-5463</issn><issn>1477-4054</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kF9LwzAUxYMobk5f_AASEEGEupsmaVrfxvAfDBXdg28lTVLpaJPZtOC-vRmdPvjg07333B-Hw0HolMA1gYxOi6qYFsUXAN9DY8KEiBhwtr_dExFxltAROvJ-BRCDSMkhGsUpoUIkYoze35qX1nU3WGItO1lIb7ArsW9kXeN1-JjKemysctpoXGywtNZ1sgtHkCr7EQSNrbPR7nx9muHaqeoYHZSy9uZkNydoeXe7nD9Ei-f7x_lsESnKeBdRFTPDqdAqprHhwAopgpClGRclSE50yoXUlBNICEilwbBSC6FJZmgMdIIuB9uQ9bM3vsubyitT19Ia1_ucpAmNSSZgi57_QVeub20Il8cUOGSCZmmgrgZKtc771pT5uq0a2W5yAvm27jzUnQ91B_hsZ9kXjdG_6E-_AbgYANev_zP6Bvq3hgs</recordid><startdate>20180701</startdate><enddate>20180701</enddate><creator>Hao, Yajing</creator><creator>Zhang, Lili</creator><creator>Niu, Yiwei</creator><creator>Cai, Tanxi</creator><creator>Luo, Jianjun</creator><creator>He, Shunmin</creator><creator>Zhang, Bao</creator><creator>Zhang, Dejiu</creator><creator>Qin, Yan</creator><creator>Yang, Fuquan</creator><creator>Chen, Runsheng</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>7SC</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>K9.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>20180701</creationdate><title>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</title><author>Hao, Yajing ; Zhang, Lili ; Niu, Yiwei ; Cai, Tanxi ; Luo, Jianjun ; He, Shunmin ; Zhang, Bao ; Zhang, Dejiu ; Qin, Yan ; Yang, Fuquan ; Chen, Runsheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c345t-3c24e537dc232e504ba74e598957f0a51d857ad3510610acd0e4fd77d19e3203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Amino acids</topic><topic>Biotechnology</topic><topic>Cell lines</topic><topic>Codon</topic><topic>Data bases</topic><topic>Databases, Factual</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA repair</topic><topic>Genomes</topic><topic>Humans</topic><topic>Molecular Sequence Annotation</topic><topic>Muscle contraction</topic><topic>Muscles</topic><topic>Muscular function</topic><topic>Non-coding RNA</topic><topic>Open reading frames</topic><topic>Proteins</topic><topic>Proteins - genetics</topic><topic>Proteins - metabolism</topic><topic>RNA - genetics</topic><topic>RNA, Untranslated - genetics</topic><topic>Software</topic><topic>Species</topic><topic>Tissues</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hao, Yajing</creatorcontrib><creatorcontrib>Zhang, Lili</creatorcontrib><creatorcontrib>Niu, Yiwei</creatorcontrib><creatorcontrib>Cai, Tanxi</creatorcontrib><creatorcontrib>Luo, Jianjun</creatorcontrib><creatorcontrib>He, Shunmin</creatorcontrib><creatorcontrib>Zhang, Bao</creatorcontrib><creatorcontrib>Zhang, Dejiu</creatorcontrib><creatorcontrib>Qin, Yan</creatorcontrib><creatorcontrib>Yang, Fuquan</creatorcontrib><creatorcontrib>Chen, Runsheng</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Briefings in bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hao, Yajing</au><au>Zhang, Lili</au><au>Niu, Yiwei</au><au>Cai, Tanxi</au><au>Luo, Jianjun</au><au>He, Shunmin</au><au>Zhang, Bao</au><au>Zhang, Dejiu</au><au>Qin, Yan</au><au>Yang, Fuquan</au><au>Chen, Runsheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci</atitle><jtitle>Briefings in bioinformatics</jtitle><addtitle>Brief Bioinform</addtitle><date>2018-07-01</date><risdate>2018</risdate><volume>19</volume><issue>4</issue><spage>636</spage><epage>643</epage><pages>636-643</pages><issn>1467-5463</issn><eissn>1477-4054</eissn><abstract>Abstract
Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>28137767</pmid><doi>10.1093/bib/bbx005</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1467-5463 |
ispartof | Briefings in bioinformatics, 2018-07, Vol.19 (4), p.636-643 |
issn | 1467-5463 1477-4054 |
language | eng |
recordid | cdi_proquest_miscellaneous_1863219700 |
source | Oxford Journals Open Access Collection |
subjects | Amino acids Biotechnology Cell lines Codon Data bases Databases, Factual Deoxyribonucleic acid DNA DNA repair Genomes Humans Molecular Sequence Annotation Muscle contraction Muscles Muscular function Non-coding RNA Open reading frames Proteins Proteins - genetics Proteins - metabolism RNA - genetics RNA, Untranslated - genetics Software Species Tissues |
title | SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T05%3A25%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SmProt:%20a%20database%20of%20small%20proteins%20encoded%20by%20annotated%20coding%20and%20non-coding%20RNA%20loci&rft.jtitle=Briefings%20in%20bioinformatics&rft.au=Hao,%20Yajing&rft.date=2018-07-01&rft.volume=19&rft.issue=4&rft.spage=636&rft.epage=643&rft.pages=636-643&rft.issn=1467-5463&rft.eissn=1477-4054&rft_id=info:doi/10.1093/bib/bbx005&rft_dat=%3Cproquest_TOX%3E1863219700%3C/proquest_TOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2305097398&rft_id=info:pmid/28137767&rft_oup_id=10.1093/bib/bbx005&rfr_iscdi=true |