DDOC: Overlapping Clustering of Words for Document Classification

In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification. The Distributional Divisive Overlapping Clustering (DDOC) method is...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Lecture notes in computer science 2004, p.127-128
Hauptverfasser: Cleuziou, Guillaume, Martin, Lionel, Clavier, Viviane, Vrain, Christel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 128
container_issue
container_start_page 127
container_title Lecture notes in computer science
container_volume
creator Cleuziou, Guillaume
Martin, Lionel
Clavier, Viviane
Vrain, Christel
description In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification. The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.
doi_str_mv 10.1007/978-3-540-30213-1_17
format Article
fullrecord <record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_16177871</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>16177871</sourcerecordid><originalsourceid>FETCH-LOGICAL-p228t-7685ddfd7fab70cf0f12da87617f60798616089283d4687d7c79691143a57a063</originalsourceid><addsrcrecordid>eNotkE1PwzAMhsOXxBj7Bxx64RiIkzZOuE0dX9KkXUAco6xpUKFrq6RD4t-Tbvhi6_UjS34IuQF2B4zhvUZFBS1yRgXjICgYwBOySLFI4SGDUzIDCUCFyPUZuZoWXHBg-pzMJoRqzMUlWcT4xVLxgucoZmS5Wm3Kh2zzU4fWDkPTfWZlu49jHaax99lHH1zMfB-yVV_td3U3JsDG2PimsmPTd9fkwts21ov_PifvT49v5Qtdb55fy-WaDpyrkaJUhXPeobdbZJVnHrizCiWglwy1kiCZ0lwJl0uFDivUUgPkwhZomRRzcnu8O9hY2dYH21VNNENodjb8mvQ8okJIHD9ycZh-qIPZ9v13NMDMJNMka0aYpMccxJlJpvgDwT1gSQ</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DDOC: Overlapping Clustering of Words for Document Classification</title><source>Springer Books</source><creator>Cleuziou, Guillaume ; Martin, Lionel ; Clavier, Viviane ; Vrain, Christel</creator><contributor>Melucci, Massimo ; Apostolico, Alberto</contributor><creatorcontrib>Cleuziou, Guillaume ; Martin, Lionel ; Clavier, Viviane ; Vrain, Christel ; Melucci, Massimo ; Apostolico, Alberto</creatorcontrib><description>In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification. The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 3540232109</identifier><identifier>ISBN: 9783540232100</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 9783540302131</identifier><identifier>EISBN: 3540302131</identifier><identifier>DOI: 10.1007/978-3-540-30213-1_17</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>Applied sciences ; Computer science; control theory; systems ; Data processing. List processing. Character string processing ; Exact sciences and technology ; Memory organisation. Data processing ; Software</subject><ispartof>Lecture notes in computer science, 2004, p.127-128</ispartof><rights>Springer-Verlag Berlin Heidelberg 2004</rights><rights>2004 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/978-3-540-30213-1_17$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/978-3-540-30213-1_17$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,4050,4051,27925,38255,41442,42511</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=16177871$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Melucci, Massimo</contributor><contributor>Apostolico, Alberto</contributor><creatorcontrib>Cleuziou, Guillaume</creatorcontrib><creatorcontrib>Martin, Lionel</creatorcontrib><creatorcontrib>Clavier, Viviane</creatorcontrib><creatorcontrib>Vrain, Christel</creatorcontrib><title>DDOC: Overlapping Clustering of Words for Document Classification</title><title>Lecture notes in computer science</title><description>In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification. The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.</description><subject>Applied sciences</subject><subject>Computer science; control theory; systems</subject><subject>Data processing. List processing. Character string processing</subject><subject>Exact sciences and technology</subject><subject>Memory organisation. Data processing</subject><subject>Software</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>3540232109</isbn><isbn>9783540232100</isbn><isbn>9783540302131</isbn><isbn>3540302131</isbn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNotkE1PwzAMhsOXxBj7Bxx64RiIkzZOuE0dX9KkXUAco6xpUKFrq6RD4t-Tbvhi6_UjS34IuQF2B4zhvUZFBS1yRgXjICgYwBOySLFI4SGDUzIDCUCFyPUZuZoWXHBg-pzMJoRqzMUlWcT4xVLxgucoZmS5Wm3Kh2zzU4fWDkPTfWZlu49jHaax99lHH1zMfB-yVV_td3U3JsDG2PimsmPTd9fkwts21ov_PifvT49v5Qtdb55fy-WaDpyrkaJUhXPeobdbZJVnHrizCiWglwy1kiCZ0lwJl0uFDivUUgPkwhZomRRzcnu8O9hY2dYH21VNNENodjb8mvQ8okJIHD9ycZh-qIPZ9v13NMDMJNMka0aYpMccxJlJpvgDwT1gSQ</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Cleuziou, Guillaume</creator><creator>Martin, Lionel</creator><creator>Clavier, Viviane</creator><creator>Vrain, Christel</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>DDOC: Overlapping Clustering of Words for Document Classification</title><author>Cleuziou, Guillaume ; Martin, Lionel ; Clavier, Viviane ; Vrain, Christel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p228t-7685ddfd7fab70cf0f12da87617f60798616089283d4687d7c79691143a57a063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Computer science; control theory; systems</topic><topic>Data processing. List processing. Character string processing</topic><topic>Exact sciences and technology</topic><topic>Memory organisation. Data processing</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cleuziou, Guillaume</creatorcontrib><creatorcontrib>Martin, Lionel</creatorcontrib><creatorcontrib>Clavier, Viviane</creatorcontrib><creatorcontrib>Vrain, Christel</creatorcontrib><collection>Pascal-Francis</collection><jtitle>Lecture notes in computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cleuziou, Guillaume</au><au>Martin, Lionel</au><au>Clavier, Viviane</au><au>Vrain, Christel</au><au>Melucci, Massimo</au><au>Apostolico, Alberto</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DDOC: Overlapping Clustering of Words for Document Classification</atitle><jtitle>Lecture notes in computer science</jtitle><date>2004</date><risdate>2004</risdate><spage>127</spage><epage>128</epage><pages>127-128</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>3540232109</isbn><isbn>9783540232100</isbn><eisbn>9783540302131</eisbn><eisbn>3540302131</eisbn><abstract>In this paper we study the interest of integration of an overlapping clustering approach rather than traditional hard-clustering ones, in the context of dimensionality reduction of the description space for document classification. The Distributional Divisive Overlapping Clustering (DDOC) method is briefly presented and compared to Agglomerative Distributional Clustering (ADC) [2] and Information-Theoretical Divisive Clustering (ITDC) [3] on the two corpus Reuters-21578 and 20Newsgroup.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/978-3-540-30213-1_17</doi><tpages>2</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0302-9743
ispartof Lecture notes in computer science, 2004, p.127-128
issn 0302-9743
1611-3349
language eng
recordid cdi_pascalfrancis_primary_16177871
source Springer Books
subjects Applied sciences
Computer science
control theory
systems
Data processing. List processing. Character string processing
Exact sciences and technology
Memory organisation. Data processing
Software
title DDOC: Overlapping Clustering of Words for Document Classification
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T20%3A34%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DDOC:%20Overlapping%20Clustering%20of%20Words%20for%20Document%20Classification&rft.jtitle=Lecture%20notes%20in%20computer%20science&rft.au=Cleuziou,%20Guillaume&rft.date=2004&rft.spage=127&rft.epage=128&rft.pages=127-128&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=3540232109&rft.isbn_list=9783540232100&rft_id=info:doi/10.1007/978-3-540-30213-1_17&rft_dat=%3Cpascalfrancis_sprin%3E16177871%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783540302131&rft.eisbn_list=3540302131&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true