DDOC: Overlapping Clustering of Words for Document Classification
In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification. The Distributional Divisive Overlapping Clustering (DDOC) method is...
Gespeichert in:
Veröffentlicht in: | Lecture notes in computer science 2004, p.127-128 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 128 |
---|---|
container_issue | |
container_start_page | 127 |
container_title | Lecture notes in computer science |
container_volume | |
creator | Cleuziou, Guillaume Martin, Lionel Clavier, Viviane Vrain, Christel |
description | In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification.
The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup. |
doi_str_mv | 10.1007/978-3-540-30213-1_17 |
format | Article |
fullrecord | <record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_16177871</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>16177871</sourcerecordid><originalsourceid>FETCH-LOGICAL-p228t-7685ddfd7fab70cf0f12da87617f60798616089283d4687d7c79691143a57a063</originalsourceid><addsrcrecordid>eNotkE1PwzAMhsOXxBj7Bxx64RiIkzZOuE0dX9KkXUAco6xpUKFrq6RD4t-Tbvhi6_UjS34IuQF2B4zhvUZFBS1yRgXjICgYwBOySLFI4SGDUzIDCUCFyPUZuZoWXHBg-pzMJoRqzMUlWcT4xVLxgucoZmS5Wm3Kh2zzU4fWDkPTfWZlu49jHaax99lHH1zMfB-yVV_td3U3JsDG2PimsmPTd9fkwts21ov_PifvT49v5Qtdb55fy-WaDpyrkaJUhXPeobdbZJVnHrizCiWglwy1kiCZ0lwJl0uFDivUUgPkwhZomRRzcnu8O9hY2dYH21VNNENodjb8mvQ8okJIHD9ycZh-qIPZ9v13NMDMJNMka0aYpMccxJlJpvgDwT1gSQ</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DDOC: Overlapping Clustering of Words for Document Classification</title><source>Springer Books</source><creator>Cleuziou, Guillaume ; Martin, Lionel ; Clavier, Viviane ; Vrain, Christel</creator><contributor>Melucci, Massimo ; Apostolico, Alberto</contributor><creatorcontrib>Cleuziou, Guillaume ; Martin, Lionel ; Clavier, Viviane ; Vrain, Christel ; Melucci, Massimo ; Apostolico, Alberto</creatorcontrib><description>In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification.
The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 3540232109</identifier><identifier>ISBN: 9783540232100</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 9783540302131</identifier><identifier>EISBN: 3540302131</identifier><identifier>DOI: 10.1007/978-3-540-30213-1_17</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>Applied sciences ; Computer science; control theory; systems ; Data processing. List processing. Character string processing ; Exact sciences and technology ; Memory organisation. Data processing ; Software</subject><ispartof>Lecture notes in computer science, 2004, p.127-128</ispartof><rights>Springer-Verlag Berlin Heidelberg 2004</rights><rights>2004 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/978-3-540-30213-1_17$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/978-3-540-30213-1_17$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,4050,4051,27925,38255,41442,42511</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=16177871$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Melucci, Massimo</contributor><contributor>Apostolico, Alberto</contributor><creatorcontrib>Cleuziou, Guillaume</creatorcontrib><creatorcontrib>Martin, Lionel</creatorcontrib><creatorcontrib>Clavier, Viviane</creatorcontrib><creatorcontrib>Vrain, Christel</creatorcontrib><title>DDOC: Overlapping Clustering of Words for Document Classification</title><title>Lecture notes in computer science</title><description>In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification.
The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.</description><subject>Applied sciences</subject><subject>Computer science; control theory; systems</subject><subject>Data processing. List processing. Character string processing</subject><subject>Exact sciences and technology</subject><subject>Memory organisation. Data processing</subject><subject>Software</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>3540232109</isbn><isbn>9783540232100</isbn><isbn>9783540302131</isbn><isbn>3540302131</isbn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNotkE1PwzAMhsOXxBj7Bxx64RiIkzZOuE0dX9KkXUAco6xpUKFrq6RD4t-Tbvhi6_UjS34IuQF2B4zhvUZFBS1yRgXjICgYwBOySLFI4SGDUzIDCUCFyPUZuZoWXHBg-pzMJoRqzMUlWcT4xVLxgucoZmS5Wm3Kh2zzU4fWDkPTfWZlu49jHaax99lHH1zMfB-yVV_td3U3JsDG2PimsmPTd9fkwts21ov_PifvT49v5Qtdb55fy-WaDpyrkaJUhXPeobdbZJVnHrizCiWglwy1kiCZ0lwJl0uFDivUUgPkwhZomRRzcnu8O9hY2dYH21VNNENodjb8mvQ8okJIHD9ycZh-qIPZ9v13NMDMJNMka0aYpMccxJlJpvgDwT1gSQ</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Cleuziou, Guillaume</creator><creator>Martin, Lionel</creator><creator>Clavier, Viviane</creator><creator>Vrain, Christel</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>DDOC: Overlapping Clustering of Words for Document Classification</title><author>Cleuziou, Guillaume ; Martin, Lionel ; Clavier, Viviane ; Vrain, Christel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p228t-7685ddfd7fab70cf0f12da87617f60798616089283d4687d7c79691143a57a063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Computer science; control theory; systems</topic><topic>Data processing. List processing. Character string processing</topic><topic>Exact sciences and technology</topic><topic>Memory organisation. Data processing</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cleuziou, Guillaume</creatorcontrib><creatorcontrib>Martin, Lionel</creatorcontrib><creatorcontrib>Clavier, Viviane</creatorcontrib><creatorcontrib>Vrain, Christel</creatorcontrib><collection>Pascal-Francis</collection><jtitle>Lecture notes in computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cleuziou, Guillaume</au><au>Martin, Lionel</au><au>Clavier, Viviane</au><au>Vrain, Christel</au><au>Melucci, Massimo</au><au>Apostolico, Alberto</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DDOC: Overlapping Clustering of Words for Document Classification</atitle><jtitle>Lecture notes in computer science</jtitle><date>2004</date><risdate>2004</risdate><spage>127</spage><epage>128</epage><pages>127-128</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>3540232109</isbn><isbn>9783540232100</isbn><eisbn>9783540302131</eisbn><eisbn>3540302131</eisbn><abstract>In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification.
The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/978-3-540-30213-1_17</doi><tpages>2</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0302-9743 |
ispartof | Lecture notes in computer science, 2004, p.127-128 |
issn | 0302-9743 1611-3349 |
language | eng |
recordid | cdi_pascalfrancis_primary_16177871 |
source | Springer Books |
subjects | Applied sciences Computer science control theory systems Data processing. List processing. Character string processing Exact sciences and technology Memory organisation. Data processing Software |
title | DDOC: Overlapping Clustering of Words for Document Classification |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T20%3A34%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DDOC:%20Overlapping%20Clustering%20of%20Words%20for%20Document%20Classification&rft.jtitle=Lecture%20notes%20in%20computer%20science&rft.au=Cleuziou,%20Guillaume&rft.date=2004&rft.spage=127&rft.epage=128&rft.pages=127-128&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=3540232109&rft.isbn_list=9783540232100&rft_id=info:doi/10.1007/978-3-540-30213-1_17&rft_dat=%3Cpascalfrancis_sprin%3E16177871%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783540302131&rft.eisbn_list=3540302131&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |