Systems and Methods for Expert Driven Document Identification

Systems and methods for identifying data strings in electronic documents using pattern recognition. The method includes receiving a first data string corresponding to an electronic reference document from a first database and a second data string corresponding to an electronic legal document from a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Girish, Greeshma, Sharma, Shreyash Kumar, Narulkar, Gunjan
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Systems and methods for identifying data strings in electronic documents using pattern recognition. The method includes receiving a first data string corresponding to an electronic reference document from a first database and a second data string corresponding to an electronic legal document from a second database. The method also includes processing the first data string into a first processed data string and processing the second data string into a second processed data string. The method also includes calculating a cosine similarity between the first processed data string and the second processed data string. The method also includes receiving a feedback score from a user which corresponds to an accuracy of the calculated cosine similarity. The method also includes calculating an adjusted cosine similarity between the first processed data string and the second processed data string based on the calculated cosine similarity and the feedback score.