Constraint-based Formal Concept Mining and its Application to Microarray Data Analysis

We are designing new data mining techniques on boolean contexts to identify a priori interesting bi-sets, i.e., sets of objects (or transactions) and associated sets of attributes (or items). It improves the state of the art in many application domains where transactional/boolean data are to be mine...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Intelligent data analysis 2005, Vol.9 (1), p.59-82
Hauptverfasser: Besson, Jérémy, Robardet, Céline, Boulicaut, Jean-François, Rome, Sophie
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 82
container_issue 1
container_start_page 59
container_title Intelligent data analysis
container_volume 9
creator Besson, Jérémy
Robardet, Céline
Boulicaut, Jean-François
Rome, Sophie
description We are designing new data mining techniques on boolean contexts to identify a priori interesting bi-sets, i.e., sets of objects (or transactions) and associated sets of attributes (or items). It improves the state of the art in many application domains where transactional/boolean data are to be mined (e.g., basket analysis, WWW usage mining, gene expression data analysis). The so-called (formal) concepts are important special cases of a priori interesting bi-sets that associate closed sets on both dimensions thanks to the Galois operators. Concept mining in boolean data is tractable provided that at least one of the dimensions (number of objects or attributes) is small enough and the data is not too dense. The task is extremely hard otherwise. Furthermore, it is important to enable user-defined constraints on the desired bi-sets and use them during the extraction to increase both the efficiency and the a priori interestingness of the extracted patterns. It leads us to the design of a new algorithm, called D-Miner, for mining concepts under constraints. We provide an experimental validation on benchmark data sets. Moreover, we introduce an original data mining technique for microarray data analysis. Not only boolean expression properties of genes are recorded but also we add biological information about transcription factors. In such a context, D-Miner can be used for concept mining under constraints and outperforms the other studied algorithms. We show also that data enrichment is useful for evaluating the biological relevancy of the extracted concepts.
format Article
fullrecord <record><control><sourceid>hal</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_01535568v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>oai_HAL_hal_01535568v1</sourcerecordid><originalsourceid>FETCH-hal_primary_oai_HAL_hal_01535568v13</originalsourceid><addsrcrecordid>eNqVissKwjAQAHNQsD7-Ya8eCql92GupSg96E_FW1rbqSpqEbBD691bwBzwNzMxEBJHM8zDJtteZmDO_pJTJRiaBuJRGs3dI2oc35K6Fg3E9Khh901kPJ9KkH4C6BfIMhbWKGvRkNHgz1sYZdA4H2KFHKDSqgYmXYnpHxd3qx4VYH_bnsgqfqGrrqEc31Aapropj_XUySuM0zfJ3FP_zfgC3-ERk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Constraint-based Formal Concept Mining and its Application to Microarray Data Analysis</title><source>EBSCOhost Business Source Complete</source><creator>Besson, Jérémy ; Robardet, Céline ; Boulicaut, Jean-François ; Rome, Sophie</creator><creatorcontrib>Besson, Jérémy ; Robardet, Céline ; Boulicaut, Jean-François ; Rome, Sophie</creatorcontrib><description>We are designing new data mining techniques on boolean contexts to identify a priori interesting bi-sets, i.e., sets of objects (or transactions) and associated sets of attributes (or items). It improves the state of the art in many application domains where transactional/boolean data are to be mined (e.g., basket analysis, WWW usage mining, gene expression data analysis). The so-called (formal) concepts are important special cases of a priori interesting bi-sets that associate closed sets on both dimensions thanks to the Galois operators. Concept mining in boolean data is tractable provided that at least one of the dimensions (number of objects or attributes) is small enough and the data is not too dense. The task is extremely hard otherwise. Furthermore, it is important to enable user-defined constraints on the desired bi-sets and use them during the extraction to increase both the efficiency and the a priori interestingness of the extracted patterns. It leads us to the design of a new algorithm, called D-Miner, for mining concepts under constraints. We provide an experimental validation on benchmark data sets. Moreover, we introduce an original data mining technique for microarray data analysis. Not only boolean expression properties of genes are recorded but also we add biological information about transcription factors. In such a context, D-Miner can be used for concept mining under constraints and outperforms the other studied algorithms. We show also that data enrichment is useful for evaluating the biological relevancy of the extracted concepts.</description><identifier>ISSN: 1088-467X</identifier><language>eng</language><publisher>IOS Press</publisher><subject>Artificial Intelligence ; Computer Science ; Data Structures and Algorithms ; Machine Learning</subject><ispartof>Intelligent data analysis, 2005, Vol.9 (1), p.59-82</ispartof><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-8583-9408 ; 0000-0002-8583-9408</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,4009</link.rule.ids><backlink>$$Uhttps://hal.science/hal-01535568$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Besson, Jérémy</creatorcontrib><creatorcontrib>Robardet, Céline</creatorcontrib><creatorcontrib>Boulicaut, Jean-François</creatorcontrib><creatorcontrib>Rome, Sophie</creatorcontrib><title>Constraint-based Formal Concept Mining and its Application to Microarray Data Analysis</title><title>Intelligent data analysis</title><description>We are designing new data mining techniques on boolean contexts to identify a priori interesting bi-sets, i.e., sets of objects (or transactions) and associated sets of attributes (or items). It improves the state of the art in many application domains where transactional/boolean data are to be mined (e.g., basket analysis, WWW usage mining, gene expression data analysis). The so-called (formal) concepts are important special cases of a priori interesting bi-sets that associate closed sets on both dimensions thanks to the Galois operators. Concept mining in boolean data is tractable provided that at least one of the dimensions (number of objects or attributes) is small enough and the data is not too dense. The task is extremely hard otherwise. Furthermore, it is important to enable user-defined constraints on the desired bi-sets and use them during the extraction to increase both the efficiency and the a priori interestingness of the extracted patterns. It leads us to the design of a new algorithm, called D-Miner, for mining concepts under constraints. We provide an experimental validation on benchmark data sets. Moreover, we introduce an original data mining technique for microarray data analysis. Not only boolean expression properties of genes are recorded but also we add biological information about transcription factors. In such a context, D-Miner can be used for concept mining under constraints and outperforms the other studied algorithms. We show also that data enrichment is useful for evaluating the biological relevancy of the extracted concepts.</description><subject>Artificial Intelligence</subject><subject>Computer Science</subject><subject>Data Structures and Algorithms</subject><subject>Machine Learning</subject><issn>1088-467X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqVissKwjAQAHNQsD7-Ya8eCql92GupSg96E_FW1rbqSpqEbBD691bwBzwNzMxEBJHM8zDJtteZmDO_pJTJRiaBuJRGs3dI2oc35K6Fg3E9Khh901kPJ9KkH4C6BfIMhbWKGvRkNHgz1sYZdA4H2KFHKDSqgYmXYnpHxd3qx4VYH_bnsgqfqGrrqEc31Aapropj_XUySuM0zfJ3FP_zfgC3-ERk</recordid><startdate>2005</startdate><enddate>2005</enddate><creator>Besson, Jérémy</creator><creator>Robardet, Céline</creator><creator>Boulicaut, Jean-François</creator><creator>Rome, Sophie</creator><general>IOS Press</general><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0002-8583-9408</orcidid><orcidid>https://orcid.org/0000-0002-8583-9408</orcidid></search><sort><creationdate>2005</creationdate><title>Constraint-based Formal Concept Mining and its Application to Microarray Data Analysis</title><author>Besson, Jérémy ; Robardet, Céline ; Boulicaut, Jean-François ; Rome, Sophie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-hal_primary_oai_HAL_hal_01535568v13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Artificial Intelligence</topic><topic>Computer Science</topic><topic>Data Structures and Algorithms</topic><topic>Machine Learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Besson, Jérémy</creatorcontrib><creatorcontrib>Robardet, Céline</creatorcontrib><creatorcontrib>Boulicaut, Jean-François</creatorcontrib><creatorcontrib>Rome, Sophie</creatorcontrib><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>Intelligent data analysis</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Besson, Jérémy</au><au>Robardet, Céline</au><au>Boulicaut, Jean-François</au><au>Rome, Sophie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Constraint-based Formal Concept Mining and its Application to Microarray Data Analysis</atitle><jtitle>Intelligent data analysis</jtitle><date>2005</date><risdate>2005</risdate><volume>9</volume><issue>1</issue><spage>59</spage><epage>82</epage><pages>59-82</pages><issn>1088-467X</issn><abstract>We are designing new data mining techniques on boolean contexts to identify a priori interesting bi-sets, i.e., sets of objects (or transactions) and associated sets of attributes (or items). It improves the state of the art in many application domains where transactional/boolean data are to be mined (e.g., basket analysis, WWW usage mining, gene expression data analysis). The so-called (formal) concepts are important special cases of a priori interesting bi-sets that associate closed sets on both dimensions thanks to the Galois operators. Concept mining in boolean data is tractable provided that at least one of the dimensions (number of objects or attributes) is small enough and the data is not too dense. The task is extremely hard otherwise. Furthermore, it is important to enable user-defined constraints on the desired bi-sets and use them during the extraction to increase both the efficiency and the a priori interestingness of the extracted patterns. It leads us to the design of a new algorithm, called D-Miner, for mining concepts under constraints. We provide an experimental validation on benchmark data sets. Moreover, we introduce an original data mining technique for microarray data analysis. Not only boolean expression properties of genes are recorded but also we add biological information about transcription factors. In such a context, D-Miner can be used for concept mining under constraints and outperforms the other studied algorithms. We show also that data enrichment is useful for evaluating the biological relevancy of the extracted concepts.</abstract><pub>IOS Press</pub><orcidid>https://orcid.org/0000-0002-8583-9408</orcidid><orcidid>https://orcid.org/0000-0002-8583-9408</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1088-467X
ispartof Intelligent data analysis, 2005, Vol.9 (1), p.59-82
issn 1088-467X
language eng
recordid cdi_hal_primary_oai_HAL_hal_01535568v1
source EBSCOhost Business Source Complete
subjects Artificial Intelligence
Computer Science
Data Structures and Algorithms
Machine Learning
title Constraint-based Formal Concept Mining and its Application to Microarray Data Analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T01%3A44%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-hal&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Constraint-based%20Formal%20Concept%20Mining%20and%20its%20Application%20to%20Microarray%20Data%20Analysis&rft.jtitle=Intelligent%20data%20analysis&rft.au=Besson,%20J%C3%A9r%C3%A9my&rft.date=2005&rft.volume=9&rft.issue=1&rft.spage=59&rft.epage=82&rft.pages=59-82&rft.issn=1088-467X&rft_id=info:doi/&rft_dat=%3Chal%3Eoai_HAL_hal_01535568v1%3C/hal%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true