ENHANCED NAMED ENTITY RECOGNITION (NER) USING CUSTOM-BUILT REGULAR EXPRESSION (REGEX) MATCHER AND HEURISTIC ENTITY RULER

Pre-trained models for Named Entity Recognition (NER) come with static NE classes, limited in number, and remain same irrespective of domain of the input text. Thus, domain specific training is required. Embodiments of the present disclosure provide a method and system for enhanced NER using a custo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ROY, ASHIM, PATWARDHAN, NIKHIL GIRISH, HOSUDURG, ANANTHA DESIK PURANAM, NAMAN, SUMIRAN
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Pre-trained models for Named Entity Recognition (NER) come with static NE classes, limited in number, and remain same irrespective of domain of the input text. Thus, domain specific training is required. Embodiments of the present disclosure provide a method and system for enhanced NER using a custom-built REGEX matcher and a heuristic entity ruler. The invention helps in discovering the NE's of the given text with pipeline-based approach with combination of models of NLP transformer, custom-built REGEX, and heuristic entity rules. The method automatically handles class resolution based on the heuristic entity ruler. The method enables a user to customize or add any new heuristic rules for entity ruler or custom regex as a knowledgebase to train the model with automatic relearning and unlearning. The extracted NEs are provided for further processing or masking in a structured format.