System, method, and computer program product for automated factual citation detection

A method, system, and computer program product is provided for tokenizing document citations. The method may include tokenizing each string of a plurality of strings into at least one citation token representing at least one citation parameter, resulting in a plurality of citation tokens, grouping t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Newman, Tovi Jordan, Schafer, Jacqueline Grace, Saura, Jose Demetrio
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method, system, and computer program product is provided for tokenizing document citations. The method may include tokenizing each string of a plurality of strings into at least one citation token representing at least one citation parameter, resulting in a plurality of citation tokens, grouping the plurality of citation tokens into a plurality of token groups, each token group of the plurality of token groups including at least one citation token representing a separate citation in the at least one textual document, assigning metadata to each token group of the plurality of token groups, and generating a normalized citation for each token group of the plurality of token groups based on the metadata.