A novel multi document summarization with document-elements augmentation for learning materials using concept based ILP and clustering methods

Multi Document Summarization (MDS) is a technique for extracting succinct summaries from groups of related documents. The usage of MDS in the e-learning context is more appealing for providing summaries for learning materials, which helps students and teachers to focus on key concepts of the learnin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of computers & applications 2024-02, Vol.46 (2), p.78-89
Hauptverfasser: Sakkaravarthy Iyyappan, K., Balasundaram, S. R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Multi Document Summarization (MDS) is a technique for extracting succinct summaries from groups of related documents. The usage of MDS in the e-learning context is more appealing for providing summaries for learning materials, which helps students and teachers to focus on key concepts of the learning materials. In a learning material, availability of non-textual document-elements such as figures/diagrams, plots, graphs, tables and algorithms can be seen widely, but keeping such document-elements in the summary is not possible as they are non-textual. This proposed work incorporates the text summarization approach with document-elements augmentation to the summary to provide a detailed coverage of information without exceeding the summary length constraint. The key information in the source text is identified by important phrase features and sentence features, and the summaries are generated by selecting important sentences using the Integer Linear Programming (ILP) framework while reducing the redundancy using pre-trained sentence vectors. The relationships between the summary and document-elements are identified through document-element snippet extraction and a Hierarchical Agglomerative Clustering approach. Experimental results of the proposed summary extraction and augmentation on educational dataset (EduSumm) show better performance compared to the state-of-the-art approaches.
ISSN:1206-212X
1925-7074
DOI:10.1080/1206212X.2023.2284446