Efficient Clustering for Gene Expression Data

In the past decade there have been advance in technologies, the amount of biological data such as DNA sequences and microarray data have been increased tremendously. To obtain knowledge from the data, explore relationships between genes, understanding severe diseases and development of drugs for pat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of computer applications 2012-01, Vol.47 (5), p.30-35
Hauptverfasser: SalomeJ, Jacinth, M Suresh, R
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 35
container_issue 5
container_start_page 30
container_title International journal of computer applications
container_volume 47
creator SalomeJ, Jacinth
M Suresh, R
description In the past decade there have been advance in technologies, the amount of biological data such as DNA sequences and microarray data have been increased tremendously. To obtain knowledge from the data, explore relationships between genes, understanding severe diseases and development of drugs for patterns from the databases of large size and high dimensionality. Information retrieval and data mining are powerful tools to extract information from the databases and/or information repositories. The integrative cluster analysis of both clinical and gene expression data has shown to be an effective alternative to overcome the abovementioned problems. In this paper, we focus on how to improve the searching and the clustering performance in genomic data from commonly used clustering techniques. In the proposed gene clustering technique, firstly, the high dimensionality of the microarray gene data is reduced using LPP. The LPP is chosen for the dimensionality reduction because of its ability of preserving locality of neighborhood relationship. Secondly, through performance experiments on real data sets, the proposed method fuzzy C-means is shown to achieve higher efficiency, clustering quality and automation than other clustering method.
doi_str_mv 10.5120/7186-9925
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1038297190</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1038297190</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1705-e516b97a146ccc45c9381881730715fbbb3076214fcb01e9feebf0145f77c8cc3</originalsourceid><addsrcrecordid>eNpd0MFKxDAQBuAgCi7rHnyDghc9VDNJ0yRHqd1VWPCi59KGiWTppjVpQd_elPUgzmX-w8cw_IRcA70XwOiDBFXmWjNxRlZUS5ErpeT5n3xJNjEeaBquWamLFclra51x6Kes6uc4YXD-I7NDyHboMau_xoAxusFnT-3UXpEL2_YRN797Td639Vv1nO9fdy_V4z43IKnIUUDZadlCURpjCmE0V6AUSE4lCNt1XQolg8KajgJqi9hZCoWwUhplDF-T29PdMQyfM8apObposO9bj8McG6BcMS1B00Rv_tHDMAefvkuKccYLXsqk7k7KhCHGgLYZgzu24TuhZumuWbprlu74D_w2XZA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1023234367</pqid></control><display><type>article</type><title>Efficient Clustering for Gene Expression Data</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>SalomeJ, Jacinth ; M Suresh, R</creator><creatorcontrib>SalomeJ, Jacinth ; M Suresh, R</creatorcontrib><description>In the past decade there have been advance in technologies, the amount of biological data such as DNA sequences and microarray data have been increased tremendously. To obtain knowledge from the data, explore relationships between genes, understanding severe diseases and development of drugs for patterns from the databases of large size and high dimensionality. Information retrieval and data mining are powerful tools to extract information from the databases and/or information repositories. The integrative cluster analysis of both clinical and gene expression data has shown to be an effective alternative to overcome the abovementioned problems. In this paper, we focus on how to improve the searching and the clustering performance in genomic data from commonly used clustering techniques. In the proposed gene clustering technique, firstly, the high dimensionality of the microarray gene data is reduced using LPP. The LPP is chosen for the dimensionality reduction because of its ability of preserving locality of neighborhood relationship. Secondly, through performance experiments on real data sets, the proposed method fuzzy C-means is shown to achieve higher efficiency, clustering quality and automation than other clustering method.</description><identifier>ISSN: 0975-8887</identifier><identifier>EISSN: 0975-8887</identifier><identifier>DOI: 10.5120/7186-9925</identifier><language>eng</language><publisher>New York: Foundation of Computer Science</publisher><subject>Automation ; Clustering ; Diseases ; Fuzzy ; Gene expression ; Genes ; Searching</subject><ispartof>International journal of computer applications, 2012-01, Vol.47 (5), p.30-35</ispartof><rights>Copyright Foundation of Computer Science 2012</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1705-e516b97a146ccc45c9381881730715fbbb3076214fcb01e9feebf0145f77c8cc3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>SalomeJ, Jacinth</creatorcontrib><creatorcontrib>M Suresh, R</creatorcontrib><title>Efficient Clustering for Gene Expression Data</title><title>International journal of computer applications</title><description>In the past decade there have been advance in technologies, the amount of biological data such as DNA sequences and microarray data have been increased tremendously. To obtain knowledge from the data, explore relationships between genes, understanding severe diseases and development of drugs for patterns from the databases of large size and high dimensionality. Information retrieval and data mining are powerful tools to extract information from the databases and/or information repositories. The integrative cluster analysis of both clinical and gene expression data has shown to be an effective alternative to overcome the abovementioned problems. In this paper, we focus on how to improve the searching and the clustering performance in genomic data from commonly used clustering techniques. In the proposed gene clustering technique, firstly, the high dimensionality of the microarray gene data is reduced using LPP. The LPP is chosen for the dimensionality reduction because of its ability of preserving locality of neighborhood relationship. Secondly, through performance experiments on real data sets, the proposed method fuzzy C-means is shown to achieve higher efficiency, clustering quality and automation than other clustering method.</description><subject>Automation</subject><subject>Clustering</subject><subject>Diseases</subject><subject>Fuzzy</subject><subject>Gene expression</subject><subject>Genes</subject><subject>Searching</subject><issn>0975-8887</issn><issn>0975-8887</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNpd0MFKxDAQBuAgCi7rHnyDghc9VDNJ0yRHqd1VWPCi59KGiWTppjVpQd_elPUgzmX-w8cw_IRcA70XwOiDBFXmWjNxRlZUS5ErpeT5n3xJNjEeaBquWamLFclra51x6Kes6uc4YXD-I7NDyHboMau_xoAxusFnT-3UXpEL2_YRN797Td639Vv1nO9fdy_V4z43IKnIUUDZadlCURpjCmE0V6AUSE4lCNt1XQolg8KajgJqi9hZCoWwUhplDF-T29PdMQyfM8apObposO9bj8McG6BcMS1B00Rv_tHDMAefvkuKccYLXsqk7k7KhCHGgLYZgzu24TuhZumuWbprlu74D_w2XZA</recordid><startdate>20120101</startdate><enddate>20120101</enddate><creator>SalomeJ, Jacinth</creator><creator>M Suresh, R</creator><general>Foundation of Computer Science</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120101</creationdate><title>Efficient Clustering for Gene Expression Data</title><author>SalomeJ, Jacinth ; M Suresh, R</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1705-e516b97a146ccc45c9381881730715fbbb3076214fcb01e9feebf0145f77c8cc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Automation</topic><topic>Clustering</topic><topic>Diseases</topic><topic>Fuzzy</topic><topic>Gene expression</topic><topic>Genes</topic><topic>Searching</topic><toplevel>online_resources</toplevel><creatorcontrib>SalomeJ, Jacinth</creatorcontrib><creatorcontrib>M Suresh, R</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal of computer applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>SalomeJ, Jacinth</au><au>M Suresh, R</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient Clustering for Gene Expression Data</atitle><jtitle>International journal of computer applications</jtitle><date>2012-01-01</date><risdate>2012</risdate><volume>47</volume><issue>5</issue><spage>30</spage><epage>35</epage><pages>30-35</pages><issn>0975-8887</issn><eissn>0975-8887</eissn><abstract>In the past decade there have been advance in technologies, the amount of biological data such as DNA sequences and microarray data have been increased tremendously. To obtain knowledge from the data, explore relationships between genes, understanding severe diseases and development of drugs for patterns from the databases of large size and high dimensionality. Information retrieval and data mining are powerful tools to extract information from the databases and/or information repositories. The integrative cluster analysis of both clinical and gene expression data has shown to be an effective alternative to overcome the abovementioned problems. In this paper, we focus on how to improve the searching and the clustering performance in genomic data from commonly used clustering techniques. In the proposed gene clustering technique, firstly, the high dimensionality of the microarray gene data is reduced using LPP. The LPP is chosen for the dimensionality reduction because of its ability of preserving locality of neighborhood relationship. Secondly, through performance experiments on real data sets, the proposed method fuzzy C-means is shown to achieve higher efficiency, clustering quality and automation than other clustering method.</abstract><cop>New York</cop><pub>Foundation of Computer Science</pub><doi>10.5120/7186-9925</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0975-8887
ispartof International journal of computer applications, 2012-01, Vol.47 (5), p.30-35
issn 0975-8887
0975-8887
language eng
recordid cdi_proquest_miscellaneous_1038297190
source EZB-FREE-00999 freely available EZB journals
subjects Automation
Clustering
Diseases
Fuzzy
Gene expression
Genes
Searching
title Efficient Clustering for Gene Expression Data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T10%3A29%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20Clustering%20for%20Gene%20Expression%20Data&rft.jtitle=International%20journal%20of%20computer%20applications&rft.au=SalomeJ,%20Jacinth&rft.date=2012-01-01&rft.volume=47&rft.issue=5&rft.spage=30&rft.epage=35&rft.pages=30-35&rft.issn=0975-8887&rft.eissn=0975-8887&rft_id=info:doi/10.5120/7186-9925&rft_dat=%3Cproquest_cross%3E1038297190%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1023234367&rft_id=info:pmid/&rfr_iscdi=true