NEREL: A Russian Dataset with Nested Named Entities, Relations and Events

In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is anno...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2021-09
Hauptverfasser: Loukachevitch, Natalia, Artemova, Ekaterina, Batura, Tatiana, Braslavski, Pavel, Denisov, Ilia, Ivanov, Vladimir, Manandhar, Suresh, Pugachev, Alexander, Tutubalina, Elena
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Loukachevitch, Natalia
Artemova, Ekaterina
Batura, Tatiana
Braslavski, Pavel
Denisov, Ilia
Ivanov, Vladimir
Manandhar, Suresh
Pugachev, Alexander
Tutubalina, Elena
description In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2567809211</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2567809211</sourcerecordid><originalsourceid>FETCH-proquest_journals_25678092113</originalsourceid><addsrcrecordid>eNqNjM0KwjAQhIMgWLTvsOBVId3YH72JRhSkh-K9BIyYUlPtbvX1zcEH8DID33zMSESoVLIsVogTERM1UkrMckxTFYlTqSt93sAWqoHIGQ97w4Ysw8fxHUpLbK9QmkdI7dmxs7SAyraGXecJjA_8bT3TTIxvpiUb_3oq5gd92R2Xz757DeGnbrqh92GqMc3yQq4xSdR_1heYgzrK</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2567809211</pqid></control><display><type>article</type><title>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</title><source>Free E- Journals</source><creator>Loukachevitch, Natalia ; Artemova, Ekaterina ; Batura, Tatiana ; Braslavski, Pavel ; Denisov, Ilia ; Ivanov, Vladimir ; Manandhar, Suresh ; Pugachev, Alexander ; Tutubalina, Elena</creator><creatorcontrib>Loukachevitch, Natalia ; Artemova, Ekaterina ; Batura, Tatiana ; Braslavski, Pavel ; Denisov, Ilia ; Ivanov, Vladimir ; Manandhar, Suresh ; Pugachev, Alexander ; Tutubalina, Elena</creatorcontrib><description>In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Datasets</subject><ispartof>arXiv.org, 2021-09</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Loukachevitch, Natalia</creatorcontrib><creatorcontrib>Artemova, Ekaterina</creatorcontrib><creatorcontrib>Batura, Tatiana</creatorcontrib><creatorcontrib>Braslavski, Pavel</creatorcontrib><creatorcontrib>Denisov, Ilia</creatorcontrib><creatorcontrib>Ivanov, Vladimir</creatorcontrib><creatorcontrib>Manandhar, Suresh</creatorcontrib><creatorcontrib>Pugachev, Alexander</creatorcontrib><creatorcontrib>Tutubalina, Elena</creatorcontrib><title>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</title><title>arXiv.org</title><description>In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.</description><subject>Annotations</subject><subject>Datasets</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNjM0KwjAQhIMgWLTvsOBVId3YH72JRhSkh-K9BIyYUlPtbvX1zcEH8DID33zMSESoVLIsVogTERM1UkrMckxTFYlTqSt93sAWqoHIGQ97w4Ysw8fxHUpLbK9QmkdI7dmxs7SAyraGXecJjA_8bT3TTIxvpiUb_3oq5gd92R2Xz757DeGnbrqh92GqMc3yQq4xSdR_1heYgzrK</recordid><startdate>20210903</startdate><enddate>20210903</enddate><creator>Loukachevitch, Natalia</creator><creator>Artemova, Ekaterina</creator><creator>Batura, Tatiana</creator><creator>Braslavski, Pavel</creator><creator>Denisov, Ilia</creator><creator>Ivanov, Vladimir</creator><creator>Manandhar, Suresh</creator><creator>Pugachev, Alexander</creator><creator>Tutubalina, Elena</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210903</creationdate><title>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</title><author>Loukachevitch, Natalia ; Artemova, Ekaterina ; Batura, Tatiana ; Braslavski, Pavel ; Denisov, Ilia ; Ivanov, Vladimir ; Manandhar, Suresh ; Pugachev, Alexander ; Tutubalina, Elena</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25678092113</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Annotations</topic><topic>Datasets</topic><toplevel>online_resources</toplevel><creatorcontrib>Loukachevitch, Natalia</creatorcontrib><creatorcontrib>Artemova, Ekaterina</creatorcontrib><creatorcontrib>Batura, Tatiana</creatorcontrib><creatorcontrib>Braslavski, Pavel</creatorcontrib><creatorcontrib>Denisov, Ilia</creatorcontrib><creatorcontrib>Ivanov, Vladimir</creatorcontrib><creatorcontrib>Manandhar, Suresh</creatorcontrib><creatorcontrib>Pugachev, Alexander</creatorcontrib><creatorcontrib>Tutubalina, Elena</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Loukachevitch, Natalia</au><au>Artemova, Ekaterina</au><au>Batura, Tatiana</au><au>Braslavski, Pavel</au><au>Denisov, Ilia</au><au>Ivanov, Vladimir</au><au>Manandhar, Suresh</au><au>Pugachev, Alexander</au><au>Tutubalina, Elena</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</atitle><jtitle>arXiv.org</jtitle><date>2021-09-03</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_2567809211
source Free E- Journals
subjects Annotations
Datasets
title NEREL: A Russian Dataset with Nested Named Entities, Relations and Events
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T04%3A22%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=NEREL:%20A%20Russian%20Dataset%20with%20Nested%20Named%20Entities,%20Relations%20and%20Events&rft.jtitle=arXiv.org&rft.au=Loukachevitch,%20Natalia&rft.date=2021-09-03&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2567809211%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2567809211&rft_id=info:pmid/&rfr_iscdi=true