HIERARCHICAL DOCUMENT SECTIONING FOR CONTEXTUAL RETRIEVAL

According to examples, an apparatus may include a processor that may divide content of a document to be indexed into sections. The apparatus may divide and arrange each section into a hierarchy based on linguistic, spatial, or other analysis. The apparatus may identify a context of each section that...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: COWE, Brian Gibson, BLANCHFLOWER, Sean Mark
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:According to examples, an apparatus may include a processor that may divide content of a document to be indexed into sections. The apparatus may divide and arrange each section into a hierarchy based on linguistic, spatial, or other analysis. The apparatus may identify a context of each section that may provide an indication of the subject matter of the section. The apparatus may add the context to downstream sections in the hierarchy. The apparatus may generate an index entry for each section based on the content of the section and any added context from upstream sections. Thus, the index entry for a given section may be based on the context of the given section and context of upstream sections in the hierarchy. In this way, the index entries may account for not only the content of the given section, but also context from upstream sections.