Gene Ontology based housekeeping gene selection for RNA-seq normalization

•A novel transcriptomic data normalization method based on housekeeping genes.•The housekeeping genes are selected by GO distance and stability analysis.•Normalization results showed the proposed method outperformed traditional approaches.•A web-based online system is available for 12 model species....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Methods (San Diego, Calif.) Calif.), 2014-06, Vol.67 (3), p.354-363
Hauptverfasser: Chen, Chien-Ming, Lu, Yu-Lun, Sio, Chi-Pong, Wu, Guan-Chung, Tzou, Wen-Shyong, Pai, Tun-Wen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 363
container_issue 3
container_start_page 354
container_title Methods (San Diego, Calif.)
container_volume 67
creator Chen, Chien-Ming
Lu, Yu-Lun
Sio, Chi-Pong
Wu, Guan-Chung
Tzou, Wen-Shyong
Pai, Tun-Wen
description •A novel transcriptomic data normalization method based on housekeeping genes.•The housekeeping genes are selected by GO distance and stability analysis.•Normalization results showed the proposed method outperformed traditional approaches.•A web-based online system is available for 12 model species. RNA-seq analysis provides a powerful tool for revealing relationships between gene expression level and biological function of proteins. In order to identify differentially expressed genes among various RNA-seq datasets obtained from different experimental designs, an appropriate normalization method for calibrating multiple experimental datasets is the first challenging problem. We propose a novel method to facilitate biologists in selecting a set of suitable housekeeping genes for inter-sample normalization. The approach is achieved by adopting user defined experimentally related keywords, GO annotations, GO term distance matrices, orthologous housekeeping gene candidates, and stability ranking of housekeeping genes. By identifying the most distanced GO terms from query keywords and selecting housekeeping gene candidates with low coefficients of variation among different spatio-temporal datasets, the proposed method can automatically enumerate a set of functionally irrelevant housekeeping genes for pratical normalization. Novel and benchmark testing RNA-seq datasets were applied to demostrate that different selections of housekeeping gene lead to strong impact on differential gene expression analysis, and compared results have shown that our proposed method outperformed other traditional approaches in terms of both sensitivity and specificity. The proposed mechanism of selecting appropriate houskeeping genes for inter-dataset normalization is robust and accurate for differential expression analyses.
doi_str_mv 10.1016/j.ymeth.2014.01.019
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1544001150</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1046202314000309</els_id><sourcerecordid>1531956306</sourcerecordid><originalsourceid>FETCH-LOGICAL-c392t-65219015e37508a8d542ee015abba9cf991ec674506f95f5bb7081343b5203dc3</originalsourceid><addsrcrecordid>eNqNkE1P3DAQQC0EKhT6C5CqHLlkO2PHzvrQA0KFIiGQqnK2HGeyeEnixc5WWn59nS70WFUaydb4zYcfY-cICwRUX9aL3UDT04IDVgvAHPqAnSBoWWoUcDjfK1Vy4OKYfUxpDQDI6-UHdswrqRBVfcJub2ik4mGcQh9Wu6KxidriKWwTPRNt_LgqVjOQqCc3-TAWXYjFj_vLMtFLMYY42N6_2vnljB11tk_06e08ZY_X335efS_vHm5ury7vSic0n0olOWpASaKWsLTLVlacKCds01jtOq2RnKorCarTspNNU8MSRSUayUG0Tpyyi33fTQwvW0qTGXxy1Pd2pLy3QVlV-aMo4T9QgVoqASqjYo-6GFKK1JlN9IONO4NgZt1mbf7oNrNuA5hD56rPbwO2zUDt35p3vxn4ugcoG_nlKZrkPI2OWh-zUNMG_88BvwGhHJBX</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1531956306</pqid></control><display><type>article</type><title>Gene Ontology based housekeeping gene selection for RNA-seq normalization</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals</source><creator>Chen, Chien-Ming ; Lu, Yu-Lun ; Sio, Chi-Pong ; Wu, Guan-Chung ; Tzou, Wen-Shyong ; Pai, Tun-Wen</creator><creatorcontrib>Chen, Chien-Ming ; Lu, Yu-Lun ; Sio, Chi-Pong ; Wu, Guan-Chung ; Tzou, Wen-Shyong ; Pai, Tun-Wen</creatorcontrib><description>•A novel transcriptomic data normalization method based on housekeeping genes.•The housekeeping genes are selected by GO distance and stability analysis.•Normalization results showed the proposed method outperformed traditional approaches.•A web-based online system is available for 12 model species. RNA-seq analysis provides a powerful tool for revealing relationships between gene expression level and biological function of proteins. In order to identify differentially expressed genes among various RNA-seq datasets obtained from different experimental designs, an appropriate normalization method for calibrating multiple experimental datasets is the first challenging problem. We propose a novel method to facilitate biologists in selecting a set of suitable housekeeping genes for inter-sample normalization. The approach is achieved by adopting user defined experimentally related keywords, GO annotations, GO term distance matrices, orthologous housekeeping gene candidates, and stability ranking of housekeeping genes. By identifying the most distanced GO terms from query keywords and selecting housekeeping gene candidates with low coefficients of variation among different spatio-temporal datasets, the proposed method can automatically enumerate a set of functionally irrelevant housekeeping genes for pratical normalization. Novel and benchmark testing RNA-seq datasets were applied to demostrate that different selections of housekeeping gene lead to strong impact on differential gene expression analysis, and compared results have shown that our proposed method outperformed other traditional approaches in terms of both sensitivity and specificity. The proposed mechanism of selecting appropriate houskeeping genes for inter-dataset normalization is robust and accurate for differential expression analyses.</description><identifier>ISSN: 1046-2023</identifier><identifier>EISSN: 1095-9130</identifier><identifier>DOI: 10.1016/j.ymeth.2014.01.019</identifier><identifier>PMID: 24561167</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Animals ; Comparative genomics ; Differential gene expression ; Female ; Fishes - genetics ; Fishes - physiology ; Gene Expression Profiling ; Gene Ontology ; Genes, Essential ; Genomics - methods ; Housekeeping gene ; Humans ; Male ; RNA-seq ; Sequence Analysis, RNA - methods</subject><ispartof>Methods (San Diego, Calif.), 2014-06, Vol.67 (3), p.354-363</ispartof><rights>2014 Elsevier Inc.</rights><rights>Copyright © 2014 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c392t-65219015e37508a8d542ee015abba9cf991ec674506f95f5bb7081343b5203dc3</citedby><cites>FETCH-LOGICAL-c392t-65219015e37508a8d542ee015abba9cf991ec674506f95f5bb7081343b5203dc3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1046202314000309$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3536,27903,27904,65309</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24561167$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Chen, Chien-Ming</creatorcontrib><creatorcontrib>Lu, Yu-Lun</creatorcontrib><creatorcontrib>Sio, Chi-Pong</creatorcontrib><creatorcontrib>Wu, Guan-Chung</creatorcontrib><creatorcontrib>Tzou, Wen-Shyong</creatorcontrib><creatorcontrib>Pai, Tun-Wen</creatorcontrib><title>Gene Ontology based housekeeping gene selection for RNA-seq normalization</title><title>Methods (San Diego, Calif.)</title><addtitle>Methods</addtitle><description>•A novel transcriptomic data normalization method based on housekeeping genes.•The housekeeping genes are selected by GO distance and stability analysis.•Normalization results showed the proposed method outperformed traditional approaches.•A web-based online system is available for 12 model species. RNA-seq analysis provides a powerful tool for revealing relationships between gene expression level and biological function of proteins. In order to identify differentially expressed genes among various RNA-seq datasets obtained from different experimental designs, an appropriate normalization method for calibrating multiple experimental datasets is the first challenging problem. We propose a novel method to facilitate biologists in selecting a set of suitable housekeeping genes for inter-sample normalization. The approach is achieved by adopting user defined experimentally related keywords, GO annotations, GO term distance matrices, orthologous housekeeping gene candidates, and stability ranking of housekeeping genes. By identifying the most distanced GO terms from query keywords and selecting housekeeping gene candidates with low coefficients of variation among different spatio-temporal datasets, the proposed method can automatically enumerate a set of functionally irrelevant housekeeping genes for pratical normalization. Novel and benchmark testing RNA-seq datasets were applied to demostrate that different selections of housekeeping gene lead to strong impact on differential gene expression analysis, and compared results have shown that our proposed method outperformed other traditional approaches in terms of both sensitivity and specificity. The proposed mechanism of selecting appropriate houskeeping genes for inter-dataset normalization is robust and accurate for differential expression analyses.</description><subject>Animals</subject><subject>Comparative genomics</subject><subject>Differential gene expression</subject><subject>Female</subject><subject>Fishes - genetics</subject><subject>Fishes - physiology</subject><subject>Gene Expression Profiling</subject><subject>Gene Ontology</subject><subject>Genes, Essential</subject><subject>Genomics - methods</subject><subject>Housekeeping gene</subject><subject>Humans</subject><subject>Male</subject><subject>RNA-seq</subject><subject>Sequence Analysis, RNA - methods</subject><issn>1046-2023</issn><issn>1095-9130</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkE1P3DAQQC0EKhT6C5CqHLlkO2PHzvrQA0KFIiGQqnK2HGeyeEnixc5WWn59nS70WFUaydb4zYcfY-cICwRUX9aL3UDT04IDVgvAHPqAnSBoWWoUcDjfK1Vy4OKYfUxpDQDI6-UHdswrqRBVfcJub2ik4mGcQh9Wu6KxidriKWwTPRNt_LgqVjOQqCc3-TAWXYjFj_vLMtFLMYY42N6_2vnljB11tk_06e08ZY_X335efS_vHm5ury7vSic0n0olOWpASaKWsLTLVlacKCds01jtOq2RnKorCarTspNNU8MSRSUayUG0Tpyyi33fTQwvW0qTGXxy1Pd2pLy3QVlV-aMo4T9QgVoqASqjYo-6GFKK1JlN9IONO4NgZt1mbf7oNrNuA5hD56rPbwO2zUDt35p3vxn4ugcoG_nlKZrkPI2OWh-zUNMG_88BvwGhHJBX</recordid><startdate>20140601</startdate><enddate>20140601</enddate><creator>Chen, Chien-Ming</creator><creator>Lu, Yu-Lun</creator><creator>Sio, Chi-Pong</creator><creator>Wu, Guan-Chung</creator><creator>Tzou, Wen-Shyong</creator><creator>Pai, Tun-Wen</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope></search><sort><creationdate>20140601</creationdate><title>Gene Ontology based housekeeping gene selection for RNA-seq normalization</title><author>Chen, Chien-Ming ; Lu, Yu-Lun ; Sio, Chi-Pong ; Wu, Guan-Chung ; Tzou, Wen-Shyong ; Pai, Tun-Wen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c392t-65219015e37508a8d542ee015abba9cf991ec674506f95f5bb7081343b5203dc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Animals</topic><topic>Comparative genomics</topic><topic>Differential gene expression</topic><topic>Female</topic><topic>Fishes - genetics</topic><topic>Fishes - physiology</topic><topic>Gene Expression Profiling</topic><topic>Gene Ontology</topic><topic>Genes, Essential</topic><topic>Genomics - methods</topic><topic>Housekeeping gene</topic><topic>Humans</topic><topic>Male</topic><topic>RNA-seq</topic><topic>Sequence Analysis, RNA - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Chien-Ming</creatorcontrib><creatorcontrib>Lu, Yu-Lun</creatorcontrib><creatorcontrib>Sio, Chi-Pong</creatorcontrib><creatorcontrib>Wu, Guan-Chung</creatorcontrib><creatorcontrib>Tzou, Wen-Shyong</creatorcontrib><creatorcontrib>Pai, Tun-Wen</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><jtitle>Methods (San Diego, Calif.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Chien-Ming</au><au>Lu, Yu-Lun</au><au>Sio, Chi-Pong</au><au>Wu, Guan-Chung</au><au>Tzou, Wen-Shyong</au><au>Pai, Tun-Wen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Gene Ontology based housekeeping gene selection for RNA-seq normalization</atitle><jtitle>Methods (San Diego, Calif.)</jtitle><addtitle>Methods</addtitle><date>2014-06-01</date><risdate>2014</risdate><volume>67</volume><issue>3</issue><spage>354</spage><epage>363</epage><pages>354-363</pages><issn>1046-2023</issn><eissn>1095-9130</eissn><abstract>•A novel transcriptomic data normalization method based on housekeeping genes.•The housekeeping genes are selected by GO distance and stability analysis.•Normalization results showed the proposed method outperformed traditional approaches.•A web-based online system is available for 12 model species. RNA-seq analysis provides a powerful tool for revealing relationships between gene expression level and biological function of proteins. In order to identify differentially expressed genes among various RNA-seq datasets obtained from different experimental designs, an appropriate normalization method for calibrating multiple experimental datasets is the first challenging problem. We propose a novel method to facilitate biologists in selecting a set of suitable housekeeping genes for inter-sample normalization. The approach is achieved by adopting user defined experimentally related keywords, GO annotations, GO term distance matrices, orthologous housekeeping gene candidates, and stability ranking of housekeeping genes. By identifying the most distanced GO terms from query keywords and selecting housekeeping gene candidates with low coefficients of variation among different spatio-temporal datasets, the proposed method can automatically enumerate a set of functionally irrelevant housekeeping genes for pratical normalization. Novel and benchmark testing RNA-seq datasets were applied to demostrate that different selections of housekeeping gene lead to strong impact on differential gene expression analysis, and compared results have shown that our proposed method outperformed other traditional approaches in terms of both sensitivity and specificity. The proposed mechanism of selecting appropriate houskeeping genes for inter-dataset normalization is robust and accurate for differential expression analyses.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>24561167</pmid><doi>10.1016/j.ymeth.2014.01.019</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1046-2023
ispartof Methods (San Diego, Calif.), 2014-06, Vol.67 (3), p.354-363
issn 1046-2023
1095-9130
language eng
recordid cdi_proquest_miscellaneous_1544001150
source MEDLINE; Elsevier ScienceDirect Journals
subjects Animals
Comparative genomics
Differential gene expression
Female
Fishes - genetics
Fishes - physiology
Gene Expression Profiling
Gene Ontology
Genes, Essential
Genomics - methods
Housekeeping gene
Humans
Male
RNA-seq
Sequence Analysis, RNA - methods
title Gene Ontology based housekeeping gene selection for RNA-seq normalization
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T14%3A33%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Gene%20Ontology%20based%20housekeeping%20gene%20selection%20for%20RNA-seq%20normalization&rft.jtitle=Methods%20(San%20Diego,%20Calif.)&rft.au=Chen,%20Chien-Ming&rft.date=2014-06-01&rft.volume=67&rft.issue=3&rft.spage=354&rft.epage=363&rft.pages=354-363&rft.issn=1046-2023&rft.eissn=1095-9130&rft_id=info:doi/10.1016/j.ymeth.2014.01.019&rft_dat=%3Cproquest_cross%3E1531956306%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1531956306&rft_id=info:pmid/24561167&rft_els_id=S1046202314000309&rfr_iscdi=true