Discovering disease-genes by topological features in human protein–protein interaction network
Motivation: Mining the hereditary disease-genes from human genome is one of the most important tasks in bioinformatics research. A variety of sequence features and functional similarities between known human hereditary disease-genes and those not known to be involved in disease have been systematica...
Gespeichert in:
Veröffentlicht in: | Bioinformatics 2006-11, Vol.22 (22), p.2800-2805 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2805 |
---|---|
container_issue | 22 |
container_start_page | 2800 |
container_title | Bioinformatics |
container_volume | 22 |
creator | Xu, Jianzhen Li, Yongjin |
description | Motivation: Mining the hereditary disease-genes from human genome is one of the most important tasks in bioinformatics research. A variety of sequence features and functional similarities between known human hereditary disease-genes and those not known to be involved in disease have been systematically examined and efficient classifiers have been constructed based on the identified common patterns. The availability of human genome-wide protein–protein interactions (PPIs) provides us with new opportunity for discovering hereditary disease-genes by topological features in PPIs network. Results: This analysis reveals that the hereditary disease-genes ascertained from OMIM in the literature-curated (LC) PPIs network are characterized by a larger degree, tendency to interact with other disease-genes, more common neighbors and quick communication to each other whereas those properties could not be detected from the network identified from high-throughput yeast two-hybrid mapping approach (EXP) and predicted interactions (PDT) PPIs network. KNN classifier based on those features was created and on average gained overall prediction accuracy of 0.76 in cross-validation test. Then the classifier was applied to 5262 genes on human genome and predicted 178 novel disease-genes. Some of the predictions have been validated by biological experiments. Contact:jianzxu@hotmail.com Supplementary information: Supplementary data are available at Bioinformatics online. |
doi_str_mv | 10.1093/bioinformatics/btl467 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_68113468</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1246646741</sourcerecordid><originalsourceid>FETCH-LOGICAL-c546t-66e3984d03a8f8edfd85ecadd23d3ac540f752490ceda282da6b4b530f3680893</originalsourceid><addsrcrecordid>eNqFkc1u1TAQhS0EoqXwCKAICXahdvwTewktcJGK2IBAbIxjTy5uE_tiO0B3vANvyJPg6kZUsGE1o5lvznh8ELpP8BOCFT0efPRhjGk2xdt8PJSJif4GOiRM4LbDXN2sORV9yySmB-hOzucYc8IYu40OiFCcEdofok-nPtv4FZIP28b5DCZDu4UAuRkumxJ3cYpbb83UjGDKkmrdh-bzMpvQ7FIs4MOvHz_XrLYKJGOLj6EJUL7FdHEX3RrNlOHeGo_QuxfP355s2rM3L1-dPD1rLWeitEIAVZI5TI0cJbjRSQ7WONdRR01l8NjzjilswZlOds6IgQ2c4pEKiaWiR-jxXre-5csCuei5XgbTZALEJWshCaFMyP-CRFGF-66v4MN_wPO4pFCPqIwUnHXqSo3vIZtizglGvUt-NulSE6yvjNJ_G6X3RtW5B6v4MszgrqdWZyrwaAVMrt8_JhOsz9ecpBhXmyvX7jmfC3z_0zfpQtc1PdebDx_1--41P1XPNprR37ePs0Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>198654298</pqid></control><display><type>article</type><title>Discovering disease-genes by topological features in human protein–protein interaction network</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Oxford Journals Open Access Collection</source><source>Alma/SFX Local Collection</source><creator>Xu, Jianzhen ; Li, Yongjin</creator><creatorcontrib>Xu, Jianzhen ; Li, Yongjin</creatorcontrib><description>Motivation: Mining the hereditary disease-genes from human genome is one of the most important tasks in bioinformatics research. A variety of sequence features and functional similarities between known human hereditary disease-genes and those not known to be involved in disease have been systematically examined and efficient classifiers have been constructed based on the identified common patterns. The availability of human genome-wide protein–protein interactions (PPIs) provides us with new opportunity for discovering hereditary disease-genes by topological features in PPIs network. Results: This analysis reveals that the hereditary disease-genes ascertained from OMIM in the literature-curated (LC) PPIs network are characterized by a larger degree, tendency to interact with other disease-genes, more common neighbors and quick communication to each other whereas those properties could not be detected from the network identified from high-throughput yeast two-hybrid mapping approach (EXP) and predicted interactions (PDT) PPIs network. KNN classifier based on those features was created and on average gained overall prediction accuracy of 0.76 in cross-validation test. Then the classifier was applied to 5262 genes on human genome and predicted 178 novel disease-genes. Some of the predictions have been validated by biological experiments. Contact:jianzxu@hotmail.com Supplementary information: Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btl467</identifier><identifier>PMID: 16954137</identifier><identifier>CODEN: BOINFP</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Biological and medical sciences ; Chromosome Mapping ; Cluster Analysis ; Computational Biology - methods ; Computer Simulation ; Databases, Genetic ; Databases, Protein ; Fundamental and applied biological sciences. Psychology ; General aspects ; Genetic Diseases, Inborn - genetics ; Genome, Human ; Humans ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Models, Statistical ; Phenotype ; Protein Interaction Mapping ; Proteins - genetics</subject><ispartof>Bioinformatics, 2006-11, Vol.22 (22), p.2800-2805</ispartof><rights>2007 INIST-CNRS</rights><rights>Copyright Oxford University Press(England) Nov 2006</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c546t-66e3984d03a8f8edfd85ecadd23d3ac540f752490ceda282da6b4b530f3680893</citedby><cites>FETCH-LOGICAL-c546t-66e3984d03a8f8edfd85ecadd23d3ac540f752490ceda282da6b4b530f3680893</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=18300205$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16954137$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Xu, Jianzhen</creatorcontrib><creatorcontrib>Li, Yongjin</creatorcontrib><title>Discovering disease-genes by topological features in human protein–protein interaction network</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: Mining the hereditary disease-genes from human genome is one of the most important tasks in bioinformatics research. A variety of sequence features and functional similarities between known human hereditary disease-genes and those not known to be involved in disease have been systematically examined and efficient classifiers have been constructed based on the identified common patterns. The availability of human genome-wide protein–protein interactions (PPIs) provides us with new opportunity for discovering hereditary disease-genes by topological features in PPIs network. Results: This analysis reveals that the hereditary disease-genes ascertained from OMIM in the literature-curated (LC) PPIs network are characterized by a larger degree, tendency to interact with other disease-genes, more common neighbors and quick communication to each other whereas those properties could not be detected from the network identified from high-throughput yeast two-hybrid mapping approach (EXP) and predicted interactions (PDT) PPIs network. KNN classifier based on those features was created and on average gained overall prediction accuracy of 0.76 in cross-validation test. Then the classifier was applied to 5262 genes on human genome and predicted 178 novel disease-genes. Some of the predictions have been validated by biological experiments. Contact:jianzxu@hotmail.com Supplementary information: Supplementary data are available at Bioinformatics online.</description><subject>Biological and medical sciences</subject><subject>Chromosome Mapping</subject><subject>Cluster Analysis</subject><subject>Computational Biology - methods</subject><subject>Computer Simulation</subject><subject>Databases, Genetic</subject><subject>Databases, Protein</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Genetic Diseases, Inborn - genetics</subject><subject>Genome, Human</subject><subject>Humans</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Models, Statistical</subject><subject>Phenotype</subject><subject>Protein Interaction Mapping</subject><subject>Proteins - genetics</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkc1u1TAQhS0EoqXwCKAICXahdvwTewktcJGK2IBAbIxjTy5uE_tiO0B3vANvyJPg6kZUsGE1o5lvznh8ELpP8BOCFT0efPRhjGk2xdt8PJSJif4GOiRM4LbDXN2sORV9yySmB-hOzucYc8IYu40OiFCcEdofok-nPtv4FZIP28b5DCZDu4UAuRkumxJ3cYpbb83UjGDKkmrdh-bzMpvQ7FIs4MOvHz_XrLYKJGOLj6EJUL7FdHEX3RrNlOHeGo_QuxfP355s2rM3L1-dPD1rLWeitEIAVZI5TI0cJbjRSQ7WONdRR01l8NjzjilswZlOds6IgQ2c4pEKiaWiR-jxXre-5csCuei5XgbTZALEJWshCaFMyP-CRFGF-66v4MN_wPO4pFCPqIwUnHXqSo3vIZtizglGvUt-NulSE6yvjNJ_G6X3RtW5B6v4MszgrqdWZyrwaAVMrt8_JhOsz9ecpBhXmyvX7jmfC3z_0zfpQtc1PdebDx_1--41P1XPNprR37ePs0Q</recordid><startdate>20061115</startdate><enddate>20061115</enddate><creator>Xu, Jianzhen</creator><creator>Li, Yongjin</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7TO</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>20061115</creationdate><title>Discovering disease-genes by topological features in human protein–protein interaction network</title><author>Xu, Jianzhen ; Li, Yongjin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c546t-66e3984d03a8f8edfd85ecadd23d3ac540f752490ceda282da6b4b530f3680893</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Biological and medical sciences</topic><topic>Chromosome Mapping</topic><topic>Cluster Analysis</topic><topic>Computational Biology - methods</topic><topic>Computer Simulation</topic><topic>Databases, Genetic</topic><topic>Databases, Protein</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Genetic Diseases, Inborn - genetics</topic><topic>Genome, Human</topic><topic>Humans</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Models, Statistical</topic><topic>Phenotype</topic><topic>Protein Interaction Mapping</topic><topic>Proteins - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Jianzhen</creatorcontrib><creatorcontrib>Li, Yongjin</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Jianzhen</au><au>Li, Yongjin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Discovering disease-genes by topological features in human protein–protein interaction network</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2006-11-15</date><risdate>2006</risdate><volume>22</volume><issue>22</issue><spage>2800</spage><epage>2805</epage><pages>2800-2805</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><coden>BOINFP</coden><abstract>Motivation: Mining the hereditary disease-genes from human genome is one of the most important tasks in bioinformatics research. A variety of sequence features and functional similarities between known human hereditary disease-genes and those not known to be involved in disease have been systematically examined and efficient classifiers have been constructed based on the identified common patterns. The availability of human genome-wide protein–protein interactions (PPIs) provides us with new opportunity for discovering hereditary disease-genes by topological features in PPIs network. Results: This analysis reveals that the hereditary disease-genes ascertained from OMIM in the literature-curated (LC) PPIs network are characterized by a larger degree, tendency to interact with other disease-genes, more common neighbors and quick communication to each other whereas those properties could not be detected from the network identified from high-throughput yeast two-hybrid mapping approach (EXP) and predicted interactions (PDT) PPIs network. KNN classifier based on those features was created and on average gained overall prediction accuracy of 0.76 in cross-validation test. Then the classifier was applied to 5262 genes on human genome and predicted 178 novel disease-genes. Some of the predictions have been validated by biological experiments. Contact:jianzxu@hotmail.com Supplementary information: Supplementary data are available at Bioinformatics online.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>16954137</pmid><doi>10.1093/bioinformatics/btl467</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1367-4803 |
ispartof | Bioinformatics, 2006-11, Vol.22 (22), p.2800-2805 |
issn | 1367-4803 1460-2059 1367-4811 |
language | eng |
recordid | cdi_proquest_miscellaneous_68113468 |
source | MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Oxford Journals Open Access Collection; Alma/SFX Local Collection |
subjects | Biological and medical sciences Chromosome Mapping Cluster Analysis Computational Biology - methods Computer Simulation Databases, Genetic Databases, Protein Fundamental and applied biological sciences. Psychology General aspects Genetic Diseases, Inborn - genetics Genome, Human Humans Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Models, Statistical Phenotype Protein Interaction Mapping Proteins - genetics |
title | Discovering disease-genes by topological features in human protein–protein interaction network |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T09%3A17%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Discovering%20disease-genes%20by%20topological%20features%20in%20human%20protein%E2%80%93protein%20interaction%20network&rft.jtitle=Bioinformatics&rft.au=Xu,%20Jianzhen&rft.date=2006-11-15&rft.volume=22&rft.issue=22&rft.spage=2800&rft.epage=2805&rft.pages=2800-2805&rft.issn=1367-4803&rft.eissn=1460-2059&rft.coden=BOINFP&rft_id=info:doi/10.1093/bioinformatics/btl467&rft_dat=%3Cproquest_cross%3E1246646741%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=198654298&rft_id=info:pmid/16954137&rfr_iscdi=true |