Approaches to Cross-Language Retrieval of Similar Legal Documents Based on Machine Learning

— In order to study global experience for legislation changing and rule-making necessitates, tools for information retrieval of regulatory documents written in different languages become increasingly necessary. One of the aspects of information identification is retrieval of thematically similar doc...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Scientific and technical information processing 2023-12, Vol.50 (5), p.494-499
Hauptverfasser:	Zhebel, V. V., Devyatkin, D. A., Zubarev, D. V., Sochenkov, I. V.
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science Computer Systems Organization and Communication Networks Digital systems Documents Information retrieval Languages Legislation Machine learning Semantics Translations
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	— In order to study global experience for legislation changing and rule-making necessitates, tools for information retrieval of regulatory documents written in different languages become increasingly necessary. One of the aspects of information identification is retrieval of thematically similar documents for a given input document. In this context, an important task of cross-lingual search arises when the user of an information system specifies a reference document in one language, and the search results contain relevant documents in other languages. The article describes different approaches to solving this problem: from classic mediator-based methods to more modern solutions, based on distributional semantics. The test collection used in the study was taken from the United Nations Digital Library, which provides legal documents in both the original English and their Russian translations.
ISSN:	0147-6882 1934-8118
DOI:	10.3103/S0147688223050167