Multidocument Summarization: An Added Value to Clustering in Interactive Retrieval

A more and more generalized problem in effective information access is the presence in the same corpus of multiple documents that contain similar information. Generally, users may be interested in locating, for a topic addressed by a group of similar documents, one or several particular aspects. Thi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	ACM transactions on computer systems 2004-04, Vol.22 (2), p.215-241
Hauptverfasser:	Mana-Lopez, M J, De Buenaga, M, Gomez-Hidalgo, J M
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	241
container_issue	2
container_start_page	215
container_title	ACM transactions on computer systems
container_volume	22
creator	Mana-Lopez, M J De Buenaga, M Gomez-Hidalgo, J M
description	A more and more generalized problem in effective information access is the presence in the same corpus of multiple documents that contain similar information. Generally, users may be interested in locating, for a topic addressed by a group of similar documents, one or several particular aspects. This kind of task, called instance or aspectual retrieval, has been explored in several TREC Interactive Tracks. In this article, we propose in addition to the classification capacity of clustering techniques, the possibility of offering a indicative extract about the contents of several sources by means of multidocument summarization techniques. Two kinds of summaries are provided. The first one covers the similarities of each cluster of documents retrieved. The second one shows the particularities of each document with respect to the common topic in the cluster. The document multitopic structure has been used in order to determine similarities and differences of topics in the cluster of documents. The system is independent of document domain and genre. An evaluation of the proposed system with users proves significant improvements in effectiveness. The results of previous experiments that have compared clustering algorithms are also reported.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_28248471</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>28248471</sourcerecordid><originalsourceid>FETCH-proquest_miscellaneous_282484713</originalsourceid><addsrcrecordid>eNqNyrsKwjAUgOEMCtbLO5zJrZA2hRa3UhQdXFRcS2iOEslFk5MOPr0OPoDT_w3_hGW8FlVe8rqYsXmMD865EKLM2OmYDGnlh2TREZyTtTLotyTt3QZaB61SqOAqTUIgD51JkTBodwft4OC-lgPpEeGEFDSO0izZ9CZNxNWvC7bebS_dPn8G_0oYqbc6DmiMdOhT7MumrJqqLsTf4wdBrUKH</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>28248471</pqid></control><display><type>article</type><title>Multidocument Summarization: An Added Value to Clustering in Interactive Retrieval</title><source>ACM Digital Library Complete</source><creator>Mana-Lopez, M J ; De Buenaga, M ; Gomez-Hidalgo, J M</creator><creatorcontrib>Mana-Lopez, M J ; De Buenaga, M ; Gomez-Hidalgo, J M</creatorcontrib><description>A more and more generalized problem in effective information access is the presence in the same corpus of multiple documents that contain similar information. Generally, users may be interested in locating, for a topic addressed by a group of similar documents, one or several particular aspects. This kind of task, called instance or aspectual retrieval, has been explored in several TREC Interactive Tracks. In this article, we propose in addition to the classification capacity of clustering techniques, the possibility of offering a indicative extract about the contents of several sources by means of multidocument summarization techniques. Two kinds of summaries are provided. The first one covers the similarities of each cluster of documents retrieved. The second one shows the particularities of each document with respect to the common topic in the cluster. The document multitopic structure has been used in order to determine similarities and differences of topics in the cluster of documents. The system is independent of document domain and genre. An evaluation of the proposed system with users proves significant improvements in effectiveness. The results of previous experiments that have compared clustering algorithms are also reported.</description><identifier>ISSN: 0734-2071</identifier><language>eng</language><ispartof>ACM transactions on computer systems, 2004-04, Vol.22 (2), p.215-241</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784</link.rule.ids></links><search><creatorcontrib>Mana-Lopez, M J</creatorcontrib><creatorcontrib>De Buenaga, M</creatorcontrib><creatorcontrib>Gomez-Hidalgo, J M</creatorcontrib><title>Multidocument Summarization: An Added Value to Clustering in Interactive Retrieval</title><title>ACM transactions on computer systems</title><description>A more and more generalized problem in effective information access is the presence in the same corpus of multiple documents that contain similar information. Generally, users may be interested in locating, for a topic addressed by a group of similar documents, one or several particular aspects. This kind of task, called instance or aspectual retrieval, has been explored in several TREC Interactive Tracks. In this article, we propose in addition to the classification capacity of clustering techniques, the possibility of offering a indicative extract about the contents of several sources by means of multidocument summarization techniques. Two kinds of summaries are provided. The first one covers the similarities of each cluster of documents retrieved. The second one shows the particularities of each document with respect to the common topic in the cluster. The document multitopic structure has been used in order to determine similarities and differences of topics in the cluster of documents. The system is independent of document domain and genre. An evaluation of the proposed system with users proves significant improvements in effectiveness. The results of previous experiments that have compared clustering algorithms are also reported.</description><issn>0734-2071</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNqNyrsKwjAUgOEMCtbLO5zJrZA2hRa3UhQdXFRcS2iOEslFk5MOPr0OPoDT_w3_hGW8FlVe8rqYsXmMD865EKLM2OmYDGnlh2TREZyTtTLotyTt3QZaB61SqOAqTUIgD51JkTBodwft4OC-lgPpEeGEFDSO0izZ9CZNxNWvC7bebS_dPn8G_0oYqbc6DmiMdOhT7MumrJqqLsTf4wdBrUKH</recordid><startdate>20040401</startdate><enddate>20040401</enddate><creator>Mana-Lopez, M J</creator><creator>De Buenaga, M</creator><creator>Gomez-Hidalgo, J M</creator><scope>7SC</scope><scope>8FD</scope><scope>H8D</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20040401</creationdate><title>Multidocument Summarization: An Added Value to Clustering in Interactive Retrieval</title><author>Mana-Lopez, M J ; De Buenaga, M ; Gomez-Hidalgo, J M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_miscellaneous_282484713</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mana-Lopez, M J</creatorcontrib><creatorcontrib>De Buenaga, M</creatorcontrib><creatorcontrib>Gomez-Hidalgo, J M</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on computer systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mana-Lopez, M J</au><au>De Buenaga, M</au><au>Gomez-Hidalgo, J M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multidocument Summarization: An Added Value to Clustering in Interactive Retrieval</atitle><jtitle>ACM transactions on computer systems</jtitle><date>2004-04-01</date><risdate>2004</risdate><volume>22</volume><issue>2</issue><spage>215</spage><epage>241</epage><pages>215-241</pages><issn>0734-2071</issn><abstract>A more and more generalized problem in effective information access is the presence in the same corpus of multiple documents that contain similar information. Generally, users may be interested in locating, for a topic addressed by a group of similar documents, one or several particular aspects. This kind of task, called instance or aspectual retrieval, has been explored in several TREC Interactive Tracks. In this article, we propose in addition to the classification capacity of clustering techniques, the possibility of offering a indicative extract about the contents of several sources by means of multidocument summarization techniques. Two kinds of summaries are provided. The first one covers the similarities of each cluster of documents retrieved. The second one shows the particularities of each document with respect to the common topic in the cluster. The document multitopic structure has been used in order to determine similarities and differences of topics in the cluster of documents. The system is independent of document domain and genre. An evaluation of the proposed system with users proves significant improvements in effectiveness. The results of previous experiments that have compared clustering algorithms are also reported.</abstract></addata></record>
fulltext	fulltext
identifier	ISSN: 0734-2071
ispartof	ACM transactions on computer systems, 2004-04, Vol.22 (2), p.215-241
issn	0734-2071
language	eng
recordid	cdi_proquest_miscellaneous_28248471
source	ACM Digital Library Complete
title	Multidocument Summarization: An Added Value to Clustering in Interactive Retrieval
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T00%3A38%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multidocument%20Summarization:%20An%20Added%20Value%20to%20Clustering%20in%20Interactive%20Retrieval&rft.jtitle=ACM%20transactions%20on%20computer%20systems&rft.au=Mana-Lopez,%20M%20J&rft.date=2004-04-01&rft.volume=22&rft.issue=2&rft.spage=215&rft.epage=241&rft.pages=215-241&rft.issn=0734-2071&rft_id=info:doi/&rft_dat=%3Cproquest%3E28248471%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=28248471&rft_id=info:pmid/&rfr_iscdi=true