SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR TOKENIZING DOCUMENT CITATIONS

A method, system, and computer program product is provided for tokenizing document citations. The method may include tokenizing each string of a plurality of strings into at least one citation token representing at least one citation parameter, resulting in a plurality of citation tokens, grouping t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: NEWMAN, Tovi Jordan, HOLLINGSHEAD, Kristina Marian, SAURA, Jose Demetrio, SCHAFER, Jacqueline Grace
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method, system, and computer program product is provided for tokenizing document citations. The method may include tokenizing each string of a plurality of strings into at least one citation token representing at least one citation parameter, resulting in a plurality of citation tokens, grouping the plurality of citation tokens into a plurality of token groups, each token group of the plurality of token groups including at least one citation token representing a separate citation in the at least one textual document, assigning metadata to each token group of the plurality of token groups, and generating a normalized citation for each token group of the plurality of token groups based on the metadata.