NEREL: A Russian Dataset with Nested Named Entities, Relations and Events
In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is anno...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2021-09 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Loukachevitch, Natalia Artemova, Ekaterina Batura, Tatiana Braslavski, Pavel Denisov, Ilia Ivanov, Vladimir Manandhar, Suresh Pugachev, Alexander Tutubalina, Elena |
description | In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2567809211</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2567809211</sourcerecordid><originalsourceid>FETCH-proquest_journals_25678092113</originalsourceid><addsrcrecordid>eNqNjM0KwjAQhIMgWLTvsOBVId3YH72JRhSkh-K9BIyYUlPtbvX1zcEH8DID33zMSESoVLIsVogTERM1UkrMckxTFYlTqSt93sAWqoHIGQ97w4Ysw8fxHUpLbK9QmkdI7dmxs7SAyraGXecJjA_8bT3TTIxvpiUb_3oq5gd92R2Xz757DeGnbrqh92GqMc3yQq4xSdR_1heYgzrK</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2567809211</pqid></control><display><type>article</type><title>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</title><source>Free E- Journals</source><creator>Loukachevitch, Natalia ; Artemova, Ekaterina ; Batura, Tatiana ; Braslavski, Pavel ; Denisov, Ilia ; Ivanov, Vladimir ; Manandhar, Suresh ; Pugachev, Alexander ; Tutubalina, Elena</creator><creatorcontrib>Loukachevitch, Natalia ; Artemova, Ekaterina ; Batura, Tatiana ; Braslavski, Pavel ; Denisov, Ilia ; Ivanov, Vladimir ; Manandhar, Suresh ; Pugachev, Alexander ; Tutubalina, Elena</creatorcontrib><description>In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Datasets</subject><ispartof>arXiv.org, 2021-09</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Loukachevitch, Natalia</creatorcontrib><creatorcontrib>Artemova, Ekaterina</creatorcontrib><creatorcontrib>Batura, Tatiana</creatorcontrib><creatorcontrib>Braslavski, Pavel</creatorcontrib><creatorcontrib>Denisov, Ilia</creatorcontrib><creatorcontrib>Ivanov, Vladimir</creatorcontrib><creatorcontrib>Manandhar, Suresh</creatorcontrib><creatorcontrib>Pugachev, Alexander</creatorcontrib><creatorcontrib>Tutubalina, Elena</creatorcontrib><title>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</title><title>arXiv.org</title><description>In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.</description><subject>Annotations</subject><subject>Datasets</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNjM0KwjAQhIMgWLTvsOBVId3YH72JRhSkh-K9BIyYUlPtbvX1zcEH8DID33zMSESoVLIsVogTERM1UkrMckxTFYlTqSt93sAWqoHIGQ97w4Ysw8fxHUpLbK9QmkdI7dmxs7SAyraGXecJjA_8bT3TTIxvpiUb_3oq5gd92R2Xz757DeGnbrqh92GqMc3yQq4xSdR_1heYgzrK</recordid><startdate>20210903</startdate><enddate>20210903</enddate><creator>Loukachevitch, Natalia</creator><creator>Artemova, Ekaterina</creator><creator>Batura, Tatiana</creator><creator>Braslavski, Pavel</creator><creator>Denisov, Ilia</creator><creator>Ivanov, Vladimir</creator><creator>Manandhar, Suresh</creator><creator>Pugachev, Alexander</creator><creator>Tutubalina, Elena</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210903</creationdate><title>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</title><author>Loukachevitch, Natalia ; Artemova, Ekaterina ; Batura, Tatiana ; Braslavski, Pavel ; Denisov, Ilia ; Ivanov, Vladimir ; Manandhar, Suresh ; Pugachev, Alexander ; Tutubalina, Elena</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25678092113</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Annotations</topic><topic>Datasets</topic><toplevel>online_resources</toplevel><creatorcontrib>Loukachevitch, Natalia</creatorcontrib><creatorcontrib>Artemova, Ekaterina</creatorcontrib><creatorcontrib>Batura, Tatiana</creatorcontrib><creatorcontrib>Braslavski, Pavel</creatorcontrib><creatorcontrib>Denisov, Ilia</creatorcontrib><creatorcontrib>Ivanov, Vladimir</creatorcontrib><creatorcontrib>Manandhar, Suresh</creatorcontrib><creatorcontrib>Pugachev, Alexander</creatorcontrib><creatorcontrib>Tutubalina, Elena</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Loukachevitch, Natalia</au><au>Artemova, Ekaterina</au><au>Batura, Tatiana</au><au>Braslavski, Pavel</au><au>Denisov, Ilia</au><au>Ivanov, Vladimir</au><au>Manandhar, Suresh</au><au>Pugachev, Alexander</au><au>Tutubalina, Elena</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>NEREL: A Russian Dataset with Nested Named Entities, Relations and Events</atitle><jtitle>arXiv.org</jtitle><date>2021-09-03</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction. NEREL is significantly larger than existing Russian datasets: to date it contains 56K annotated named entities and 39K annotated relations. Its important difference from previous datasets is annotation of nested named entities, as well as relations within nested entities and at the discourse level. NEREL can facilitate development of novel models that can extract relations between nested named entities, as well as relations on both sentence and document levels. NEREL also contains the annotation of events involving named entities and their roles in the events. The NEREL collection is available via https://github.com/nerel-ds/NEREL.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-09 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2567809211 |
source | Free E- Journals |
subjects | Annotations Datasets |
title | NEREL: A Russian Dataset with Nested Named Entities, Relations and Events |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T04%3A22%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=NEREL:%20A%20Russian%20Dataset%20with%20Nested%20Named%20Entities,%20Relations%20and%20Events&rft.jtitle=arXiv.org&rft.au=Loukachevitch,%20Natalia&rft.date=2021-09-03&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2567809211%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2567809211&rft_id=info:pmid/&rfr_iscdi=true |