Cross-language document summarization via extraction and ranking of multiple summaries

The task of cross-language document summarization aims to produce a summary in a target language (e.g., Chinese) for a given document set in a different source language (e.g., English). Previous studies focus on ranking and selection of translated sentences in the target language. In this paper, we...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Knowledge and information systems 2019-02, Vol.58 (2), p.481-499
Hauptverfasser:	Wan, Xiaojun, Luo, Fuli, Sun, Xue, Huang, Songfang, Yao, Jin-ge
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science Data Mining and Knowledge Discovery Database Management Information Storage and Retrieval Information Systems and Communication Service Information Systems Applications (incl.Internet) IT in Business Language Ranking Sentences Short Paper Summaries Translations Upper bounds
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The task of cross-language document summarization aims to produce a summary in a target language (e.g., Chinese) for a given document set in a different source language (e.g., English). Previous studies focus on ranking and selection of translated sentences in the target language. In this paper, we propose a new framework for addressing the task by extraction and ranking of multiple summaries in the target language. First, we extract multiple candidate summaries by proposing several schemes for improving the upper-bound quality of the summaries. Then, we propose a new ensemble ranking method for ranking the candidate summaries by making use of bilingual features. Extensive experiments have been conducted on a benchmark dataset and the results verify the effectiveness of our proposed framework, which outperforms a variety of baselines, including supervised baselines.
ISSN:	0219-1377 0219-3116
DOI:	10.1007/s10115-018-1152-7