UniMorph 4.0: Universal Morphology

The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annota...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2022-06
Hauptverfasser: Batsuren, Khuyagbaatar, Goldman, Omer, Khalifa, Salam, Habash, Nizar, Kieraś, Witold, Bella, Gábor, Leonard, Brian, Garrett, Nicolai, Gorman, Kyle, Ate, Yustinus Ghanggo, Ryskina, Maria, Mielke, Sabrina J, Budianskaya, Elena, El-Khaissi, Charbel, Pimentel, Tiago, Gasser, Michael, Lane, William, Raj, Mohit, Coler, Matt, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Sagot, Benoît, Rojas, Esaú Zumaeta, Didier López Francis, Oncevay, Arturo, Juan López Bautista, Gema Celeste Silva Villegas, Lucas Torroba Hennigen, Ek, Adam, Guriel, David, Dirix, Peter, Bernardy, Jean-Philippe, Scherbakov, Andrey, Bayyr-ool, Aziyana, Anastasopoulos, Antonios, Zariquiey, Roberto, Sheifer, Karina, Ganieva, Sofya, Cruz, Hilaria, Karahóǧa, Ritván, Markantonatou, Stella, Pavlidis, George, Plugaryov, Matvey, Klyachko, Elena, Salehi, Ali, Angulo, Candy, Baxi, Jatayu, Krizhanovsky, Andrew, Krizhanovskaya, Natalia, Salesky, Elizabeth, Vania, Clara, Ivanova, Sardana, White, Jennifer, Rowan Hall Maudslay, Valvoda, Josef, Zmigrod, Ran, Czarnowska, Paula, Nikkarinen, Irene, Salchak, Aelita, Bhatt, Brijesh, Straughn, Christopher, Liu, Zoey, Jonathan North Washington, Pinter, Yuval, Ataman, Duygu, Wolinski, Marcin, Suhardijanto, Totok, Yablonskaya, Anna, Stoehr, Niklas, Dolatian, Hossep, Nuriah, Zahroh, Ratan, Shyam, Tyers, Francis M, Ponti, Edoardo M, Aiton, Grant, Arora, Aryaman, Hatcher, Richard J, Kumar, Ritesh, Young, Jeremiah, Rodionova, Daria, Yemelina, Anastasia, Andrushko, Taras, Marchenko, Igor, Mashkovtseva, Polina, Serova, Alexandra, Prud'hommeaux, Emily, Nepomniashchaya, Maria, Giunchiglia, Fausto, Chodroff, Eleanor, Mans Hulden, Silfverberg, Miikka, McCarthy, Arya D, Yarowsky, David, Cotterell, Ryan, Tsarfaty, Reut, Vylomova, Ekaterina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Batsuren, Khuyagbaatar
Goldman, Omer
Khalifa, Salam
Habash, Nizar
Kieraś, Witold
Bella, Gábor
Leonard, Brian
Garrett, Nicolai
Gorman, Kyle
Ate, Yustinus Ghanggo
Ryskina, Maria
Mielke, Sabrina J
Budianskaya, Elena
El-Khaissi, Charbel
Pimentel, Tiago
Gasser, Michael
Lane, William
Raj, Mohit
Coler, Matt
Jaime Rafael Montoya Samame
Delio Siticonatzi Camaiteri
Sagot, Benoît
Rojas, Esaú Zumaeta
Didier López Francis
Oncevay, Arturo
Juan López Bautista
Gema Celeste Silva Villegas
Lucas Torroba Hennigen
Ek, Adam
Guriel, David
Dirix, Peter
Bernardy, Jean-Philippe
Scherbakov, Andrey
Bayyr-ool, Aziyana
Anastasopoulos, Antonios
Zariquiey, Roberto
Sheifer, Karina
Ganieva, Sofya
Cruz, Hilaria
Karahóǧa, Ritván
Markantonatou, Stella
Pavlidis, George
Plugaryov, Matvey
Klyachko, Elena
Salehi, Ali
Angulo, Candy
Baxi, Jatayu
Krizhanovsky, Andrew
Krizhanovskaya, Natalia
Salesky, Elizabeth
Vania, Clara
Ivanova, Sardana
White, Jennifer
Rowan Hall Maudslay
Valvoda, Josef
Zmigrod, Ran
Czarnowska, Paula
Nikkarinen, Irene
Salchak, Aelita
Bhatt, Brijesh
Straughn, Christopher
Liu, Zoey
Jonathan North Washington
Pinter, Yuval
Ataman, Duygu
Wolinski, Marcin
Suhardijanto, Totok
Yablonskaya, Anna
Stoehr, Niklas
Dolatian, Hossep
Nuriah, Zahroh
Ratan, Shyam
Tyers, Francis M
Ponti, Edoardo M
Aiton, Grant
Arora, Aryaman
Hatcher, Richard J
Kumar, Ritesh
Young, Jeremiah
Rodionova, Daria
Yemelina, Anastasia
Andrushko, Taras
Marchenko, Igor
Mashkovtseva, Polina
Serova, Alexandra
Prud'hommeaux, Emily
Nepomniashchaya, Maria
Giunchiglia, Fausto
Chodroff, Eleanor
Mans Hulden
Silfverberg, Miikka
McCarthy, Arya D
Yarowsky, David
Cotterell, Ryan
Tsarfaty, Reut
Vylomova, Ekaterina
description The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2662167755</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2662167755</sourcerecordid><originalsourceid>FETCH-proquest_journals_26621677553</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mRQCs3L9M0vKshQMNEzsFIA8spSi4oTcxTAgvk5-emVPAysaYk5xam8UJqbQdnNNcTZQ7egKL-wNLW4JD4rv7QoDygVb2RmZmRoZm4ONJo4VQAXCy1M</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2662167755</pqid></control><display><type>article</type><title>UniMorph 4.0: Universal Morphology</title><source>Free E- Journals</source><creator>Batsuren, Khuyagbaatar ; Goldman, Omer ; Khalifa, Salam ; Habash, Nizar ; Kieraś, Witold ; Bella, Gábor ; Leonard, Brian ; Garrett, Nicolai ; Gorman, Kyle ; Ate, Yustinus Ghanggo ; Ryskina, Maria ; Mielke, Sabrina J ; Budianskaya, Elena ; El-Khaissi, Charbel ; Pimentel, Tiago ; Gasser, Michael ; Lane, William ; Raj, Mohit ; Coler, Matt ; Jaime Rafael Montoya Samame ; Delio Siticonatzi Camaiteri ; Sagot, Benoît ; Rojas, Esaú Zumaeta ; Didier López Francis ; Oncevay, Arturo ; Juan López Bautista ; Gema Celeste Silva Villegas ; Lucas Torroba Hennigen ; Ek, Adam ; Guriel, David ; Dirix, Peter ; Bernardy, Jean-Philippe ; Scherbakov, Andrey ; Bayyr-ool, Aziyana ; Anastasopoulos, Antonios ; Zariquiey, Roberto ; Sheifer, Karina ; Ganieva, Sofya ; Cruz, Hilaria ; Karahóǧa, Ritván ; Markantonatou, Stella ; Pavlidis, George ; Plugaryov, Matvey ; Klyachko, Elena ; Salehi, Ali ; Angulo, Candy ; Baxi, Jatayu ; Krizhanovsky, Andrew ; Krizhanovskaya, Natalia ; Salesky, Elizabeth ; Vania, Clara ; Ivanova, Sardana ; White, Jennifer ; Rowan Hall Maudslay ; Valvoda, Josef ; Zmigrod, Ran ; Czarnowska, Paula ; Nikkarinen, Irene ; Salchak, Aelita ; Bhatt, Brijesh ; Straughn, Christopher ; Liu, Zoey ; Jonathan North Washington ; Pinter, Yuval ; Ataman, Duygu ; Wolinski, Marcin ; Suhardijanto, Totok ; Yablonskaya, Anna ; Stoehr, Niklas ; Dolatian, Hossep ; Nuriah, Zahroh ; Ratan, Shyam ; Tyers, Francis M ; Ponti, Edoardo M ; Aiton, Grant ; Arora, Aryaman ; Hatcher, Richard J ; Kumar, Ritesh ; Young, Jeremiah ; Rodionova, Daria ; Yemelina, Anastasia ; Andrushko, Taras ; Marchenko, Igor ; Mashkovtseva, Polina ; Serova, Alexandra ; Prud'hommeaux, Emily ; Nepomniashchaya, Maria ; Giunchiglia, Fausto ; Chodroff, Eleanor ; Mans Hulden ; Silfverberg, Miikka ; McCarthy, Arya D ; Yarowsky, David ; Cotterell, Ryan ; Tsarfaty, Reut ; Vylomova, Ekaterina</creator><creatorcontrib>Batsuren, Khuyagbaatar ; Goldman, Omer ; Khalifa, Salam ; Habash, Nizar ; Kieraś, Witold ; Bella, Gábor ; Leonard, Brian ; Garrett, Nicolai ; Gorman, Kyle ; Ate, Yustinus Ghanggo ; Ryskina, Maria ; Mielke, Sabrina J ; Budianskaya, Elena ; El-Khaissi, Charbel ; Pimentel, Tiago ; Gasser, Michael ; Lane, William ; Raj, Mohit ; Coler, Matt ; Jaime Rafael Montoya Samame ; Delio Siticonatzi Camaiteri ; Sagot, Benoît ; Rojas, Esaú Zumaeta ; Didier López Francis ; Oncevay, Arturo ; Juan López Bautista ; Gema Celeste Silva Villegas ; Lucas Torroba Hennigen ; Ek, Adam ; Guriel, David ; Dirix, Peter ; Bernardy, Jean-Philippe ; Scherbakov, Andrey ; Bayyr-ool, Aziyana ; Anastasopoulos, Antonios ; Zariquiey, Roberto ; Sheifer, Karina ; Ganieva, Sofya ; Cruz, Hilaria ; Karahóǧa, Ritván ; Markantonatou, Stella ; Pavlidis, George ; Plugaryov, Matvey ; Klyachko, Elena ; Salehi, Ali ; Angulo, Candy ; Baxi, Jatayu ; Krizhanovsky, Andrew ; Krizhanovskaya, Natalia ; Salesky, Elizabeth ; Vania, Clara ; Ivanova, Sardana ; White, Jennifer ; Rowan Hall Maudslay ; Valvoda, Josef ; Zmigrod, Ran ; Czarnowska, Paula ; Nikkarinen, Irene ; Salchak, Aelita ; Bhatt, Brijesh ; Straughn, Christopher ; Liu, Zoey ; Jonathan North Washington ; Pinter, Yuval ; Ataman, Duygu ; Wolinski, Marcin ; Suhardijanto, Totok ; Yablonskaya, Anna ; Stoehr, Niklas ; Dolatian, Hossep ; Nuriah, Zahroh ; Ratan, Shyam ; Tyers, Francis M ; Ponti, Edoardo M ; Aiton, Grant ; Arora, Aryaman ; Hatcher, Richard J ; Kumar, Ritesh ; Young, Jeremiah ; Rodionova, Daria ; Yemelina, Anastasia ; Andrushko, Taras ; Marchenko, Igor ; Mashkovtseva, Polina ; Serova, Alexandra ; Prud'hommeaux, Emily ; Nepomniashchaya, Maria ; Giunchiglia, Fausto ; Chodroff, Eleanor ; Mans Hulden ; Silfverberg, Miikka ; McCarthy, Arya D ; Yarowsky, David ; Cotterell, Ryan ; Tsarfaty, Reut ; Vylomova, Ekaterina</creatorcontrib><description>The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Collaboration ; Languages ; Morphology ; Segmentation ; Structural hierarchy</subject><ispartof>arXiv.org, 2022-06</ispartof><rights>2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Batsuren, Khuyagbaatar</creatorcontrib><creatorcontrib>Goldman, Omer</creatorcontrib><creatorcontrib>Khalifa, Salam</creatorcontrib><creatorcontrib>Habash, Nizar</creatorcontrib><creatorcontrib>Kieraś, Witold</creatorcontrib><creatorcontrib>Bella, Gábor</creatorcontrib><creatorcontrib>Leonard, Brian</creatorcontrib><creatorcontrib>Garrett, Nicolai</creatorcontrib><creatorcontrib>Gorman, Kyle</creatorcontrib><creatorcontrib>Ate, Yustinus Ghanggo</creatorcontrib><creatorcontrib>Ryskina, Maria</creatorcontrib><creatorcontrib>Mielke, Sabrina J</creatorcontrib><creatorcontrib>Budianskaya, Elena</creatorcontrib><creatorcontrib>El-Khaissi, Charbel</creatorcontrib><creatorcontrib>Pimentel, Tiago</creatorcontrib><creatorcontrib>Gasser, Michael</creatorcontrib><creatorcontrib>Lane, William</creatorcontrib><creatorcontrib>Raj, Mohit</creatorcontrib><creatorcontrib>Coler, Matt</creatorcontrib><creatorcontrib>Jaime Rafael Montoya Samame</creatorcontrib><creatorcontrib>Delio Siticonatzi Camaiteri</creatorcontrib><creatorcontrib>Sagot, Benoît</creatorcontrib><creatorcontrib>Rojas, Esaú Zumaeta</creatorcontrib><creatorcontrib>Didier López Francis</creatorcontrib><creatorcontrib>Oncevay, Arturo</creatorcontrib><creatorcontrib>Juan López Bautista</creatorcontrib><creatorcontrib>Gema Celeste Silva Villegas</creatorcontrib><creatorcontrib>Lucas Torroba Hennigen</creatorcontrib><creatorcontrib>Ek, Adam</creatorcontrib><creatorcontrib>Guriel, David</creatorcontrib><creatorcontrib>Dirix, Peter</creatorcontrib><creatorcontrib>Bernardy, Jean-Philippe</creatorcontrib><creatorcontrib>Scherbakov, Andrey</creatorcontrib><creatorcontrib>Bayyr-ool, Aziyana</creatorcontrib><creatorcontrib>Anastasopoulos, Antonios</creatorcontrib><creatorcontrib>Zariquiey, Roberto</creatorcontrib><creatorcontrib>Sheifer, Karina</creatorcontrib><creatorcontrib>Ganieva, Sofya</creatorcontrib><creatorcontrib>Cruz, Hilaria</creatorcontrib><creatorcontrib>Karahóǧa, Ritván</creatorcontrib><creatorcontrib>Markantonatou, Stella</creatorcontrib><creatorcontrib>Pavlidis, George</creatorcontrib><creatorcontrib>Plugaryov, Matvey</creatorcontrib><creatorcontrib>Klyachko, Elena</creatorcontrib><creatorcontrib>Salehi, Ali</creatorcontrib><creatorcontrib>Angulo, Candy</creatorcontrib><creatorcontrib>Baxi, Jatayu</creatorcontrib><creatorcontrib>Krizhanovsky, Andrew</creatorcontrib><creatorcontrib>Krizhanovskaya, Natalia</creatorcontrib><creatorcontrib>Salesky, Elizabeth</creatorcontrib><creatorcontrib>Vania, Clara</creatorcontrib><creatorcontrib>Ivanova, Sardana</creatorcontrib><creatorcontrib>White, Jennifer</creatorcontrib><creatorcontrib>Rowan Hall Maudslay</creatorcontrib><creatorcontrib>Valvoda, Josef</creatorcontrib><creatorcontrib>Zmigrod, Ran</creatorcontrib><creatorcontrib>Czarnowska, Paula</creatorcontrib><creatorcontrib>Nikkarinen, Irene</creatorcontrib><creatorcontrib>Salchak, Aelita</creatorcontrib><creatorcontrib>Bhatt, Brijesh</creatorcontrib><creatorcontrib>Straughn, Christopher</creatorcontrib><creatorcontrib>Liu, Zoey</creatorcontrib><creatorcontrib>Jonathan North Washington</creatorcontrib><creatorcontrib>Pinter, Yuval</creatorcontrib><creatorcontrib>Ataman, Duygu</creatorcontrib><creatorcontrib>Wolinski, Marcin</creatorcontrib><creatorcontrib>Suhardijanto, Totok</creatorcontrib><creatorcontrib>Yablonskaya, Anna</creatorcontrib><creatorcontrib>Stoehr, Niklas</creatorcontrib><creatorcontrib>Dolatian, Hossep</creatorcontrib><creatorcontrib>Nuriah, Zahroh</creatorcontrib><creatorcontrib>Ratan, Shyam</creatorcontrib><creatorcontrib>Tyers, Francis M</creatorcontrib><creatorcontrib>Ponti, Edoardo M</creatorcontrib><creatorcontrib>Aiton, Grant</creatorcontrib><creatorcontrib>Arora, Aryaman</creatorcontrib><creatorcontrib>Hatcher, Richard J</creatorcontrib><creatorcontrib>Kumar, Ritesh</creatorcontrib><creatorcontrib>Young, Jeremiah</creatorcontrib><creatorcontrib>Rodionova, Daria</creatorcontrib><creatorcontrib>Yemelina, Anastasia</creatorcontrib><creatorcontrib>Andrushko, Taras</creatorcontrib><creatorcontrib>Marchenko, Igor</creatorcontrib><creatorcontrib>Mashkovtseva, Polina</creatorcontrib><creatorcontrib>Serova, Alexandra</creatorcontrib><creatorcontrib>Prud'hommeaux, Emily</creatorcontrib><creatorcontrib>Nepomniashchaya, Maria</creatorcontrib><creatorcontrib>Giunchiglia, Fausto</creatorcontrib><creatorcontrib>Chodroff, Eleanor</creatorcontrib><creatorcontrib>Mans Hulden</creatorcontrib><creatorcontrib>Silfverberg, Miikka</creatorcontrib><creatorcontrib>McCarthy, Arya D</creatorcontrib><creatorcontrib>Yarowsky, David</creatorcontrib><creatorcontrib>Cotterell, Ryan</creatorcontrib><creatorcontrib>Tsarfaty, Reut</creatorcontrib><creatorcontrib>Vylomova, Ekaterina</creatorcontrib><title>UniMorph 4.0: Universal Morphology</title><title>arXiv.org</title><description>The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.</description><subject>Annotations</subject><subject>Collaboration</subject><subject>Languages</subject><subject>Morphology</subject><subject>Segmentation</subject><subject>Structural hierarchy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mRQCs3L9M0vKshQMNEzsFIA8spSi4oTcxTAgvk5-emVPAysaYk5xam8UJqbQdnNNcTZQ7egKL-wNLW4JD4rv7QoDygVb2RmZmRoZm4ONJo4VQAXCy1M</recordid><startdate>20220619</startdate><enddate>20220619</enddate><creator>Batsuren, Khuyagbaatar</creator><creator>Goldman, Omer</creator><creator>Khalifa, Salam</creator><creator>Habash, Nizar</creator><creator>Kieraś, Witold</creator><creator>Bella, Gábor</creator><creator>Leonard, Brian</creator><creator>Garrett, Nicolai</creator><creator>Gorman, Kyle</creator><creator>Ate, Yustinus Ghanggo</creator><creator>Ryskina, Maria</creator><creator>Mielke, Sabrina J</creator><creator>Budianskaya, Elena</creator><creator>El-Khaissi, Charbel</creator><creator>Pimentel, Tiago</creator><creator>Gasser, Michael</creator><creator>Lane, William</creator><creator>Raj, Mohit</creator><creator>Coler, Matt</creator><creator>Jaime Rafael Montoya Samame</creator><creator>Delio Siticonatzi Camaiteri</creator><creator>Sagot, Benoît</creator><creator>Rojas, Esaú Zumaeta</creator><creator>Didier López Francis</creator><creator>Oncevay, Arturo</creator><creator>Juan López Bautista</creator><creator>Gema Celeste Silva Villegas</creator><creator>Lucas Torroba Hennigen</creator><creator>Ek, Adam</creator><creator>Guriel, David</creator><creator>Dirix, Peter</creator><creator>Bernardy, Jean-Philippe</creator><creator>Scherbakov, Andrey</creator><creator>Bayyr-ool, Aziyana</creator><creator>Anastasopoulos, Antonios</creator><creator>Zariquiey, Roberto</creator><creator>Sheifer, Karina</creator><creator>Ganieva, Sofya</creator><creator>Cruz, Hilaria</creator><creator>Karahóǧa, Ritván</creator><creator>Markantonatou, Stella</creator><creator>Pavlidis, George</creator><creator>Plugaryov, Matvey</creator><creator>Klyachko, Elena</creator><creator>Salehi, Ali</creator><creator>Angulo, Candy</creator><creator>Baxi, Jatayu</creator><creator>Krizhanovsky, Andrew</creator><creator>Krizhanovskaya, Natalia</creator><creator>Salesky, Elizabeth</creator><creator>Vania, Clara</creator><creator>Ivanova, Sardana</creator><creator>White, Jennifer</creator><creator>Rowan Hall Maudslay</creator><creator>Valvoda, Josef</creator><creator>Zmigrod, Ran</creator><creator>Czarnowska, Paula</creator><creator>Nikkarinen, Irene</creator><creator>Salchak, Aelita</creator><creator>Bhatt, Brijesh</creator><creator>Straughn, Christopher</creator><creator>Liu, Zoey</creator><creator>Jonathan North Washington</creator><creator>Pinter, Yuval</creator><creator>Ataman, Duygu</creator><creator>Wolinski, Marcin</creator><creator>Suhardijanto, Totok</creator><creator>Yablonskaya, Anna</creator><creator>Stoehr, Niklas</creator><creator>Dolatian, Hossep</creator><creator>Nuriah, Zahroh</creator><creator>Ratan, Shyam</creator><creator>Tyers, Francis M</creator><creator>Ponti, Edoardo M</creator><creator>Aiton, Grant</creator><creator>Arora, Aryaman</creator><creator>Hatcher, Richard J</creator><creator>Kumar, Ritesh</creator><creator>Young, Jeremiah</creator><creator>Rodionova, Daria</creator><creator>Yemelina, Anastasia</creator><creator>Andrushko, Taras</creator><creator>Marchenko, Igor</creator><creator>Mashkovtseva, Polina</creator><creator>Serova, Alexandra</creator><creator>Prud'hommeaux, Emily</creator><creator>Nepomniashchaya, Maria</creator><creator>Giunchiglia, Fausto</creator><creator>Chodroff, Eleanor</creator><creator>Mans Hulden</creator><creator>Silfverberg, Miikka</creator><creator>McCarthy, Arya D</creator><creator>Yarowsky, David</creator><creator>Cotterell, Ryan</creator><creator>Tsarfaty, Reut</creator><creator>Vylomova, Ekaterina</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220619</creationdate><title>UniMorph 4.0: Universal Morphology</title><author>Batsuren, Khuyagbaatar ; Goldman, Omer ; Khalifa, Salam ; Habash, Nizar ; Kieraś, Witold ; Bella, Gábor ; Leonard, Brian ; Garrett, Nicolai ; Gorman, Kyle ; Ate, Yustinus Ghanggo ; Ryskina, Maria ; Mielke, Sabrina J ; Budianskaya, Elena ; El-Khaissi, Charbel ; Pimentel, Tiago ; Gasser, Michael ; Lane, William ; Raj, Mohit ; Coler, Matt ; Jaime Rafael Montoya Samame ; Delio Siticonatzi Camaiteri ; Sagot, Benoît ; Rojas, Esaú Zumaeta ; Didier López Francis ; Oncevay, Arturo ; Juan López Bautista ; Gema Celeste Silva Villegas ; Lucas Torroba Hennigen ; Ek, Adam ; Guriel, David ; Dirix, Peter ; Bernardy, Jean-Philippe ; Scherbakov, Andrey ; Bayyr-ool, Aziyana ; Anastasopoulos, Antonios ; Zariquiey, Roberto ; Sheifer, Karina ; Ganieva, Sofya ; Cruz, Hilaria ; Karahóǧa, Ritván ; Markantonatou, Stella ; Pavlidis, George ; Plugaryov, Matvey ; Klyachko, Elena ; Salehi, Ali ; Angulo, Candy ; Baxi, Jatayu ; Krizhanovsky, Andrew ; Krizhanovskaya, Natalia ; Salesky, Elizabeth ; Vania, Clara ; Ivanova, Sardana ; White, Jennifer ; Rowan Hall Maudslay ; Valvoda, Josef ; Zmigrod, Ran ; Czarnowska, Paula ; Nikkarinen, Irene ; Salchak, Aelita ; Bhatt, Brijesh ; Straughn, Christopher ; Liu, Zoey ; Jonathan North Washington ; Pinter, Yuval ; Ataman, Duygu ; Wolinski, Marcin ; Suhardijanto, Totok ; Yablonskaya, Anna ; Stoehr, Niklas ; Dolatian, Hossep ; Nuriah, Zahroh ; Ratan, Shyam ; Tyers, Francis M ; Ponti, Edoardo M ; Aiton, Grant ; Arora, Aryaman ; Hatcher, Richard J ; Kumar, Ritesh ; Young, Jeremiah ; Rodionova, Daria ; Yemelina, Anastasia ; Andrushko, Taras ; Marchenko, Igor ; Mashkovtseva, Polina ; Serova, Alexandra ; Prud'hommeaux, Emily ; Nepomniashchaya, Maria ; Giunchiglia, Fausto ; Chodroff, Eleanor ; Mans Hulden ; Silfverberg, Miikka ; McCarthy, Arya D ; Yarowsky, David ; Cotterell, Ryan ; Tsarfaty, Reut ; Vylomova, Ekaterina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26621677553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Annotations</topic><topic>Collaboration</topic><topic>Languages</topic><topic>Morphology</topic><topic>Segmentation</topic><topic>Structural hierarchy</topic><toplevel>online_resources</toplevel><creatorcontrib>Batsuren, Khuyagbaatar</creatorcontrib><creatorcontrib>Goldman, Omer</creatorcontrib><creatorcontrib>Khalifa, Salam</creatorcontrib><creatorcontrib>Habash, Nizar</creatorcontrib><creatorcontrib>Kieraś, Witold</creatorcontrib><creatorcontrib>Bella, Gábor</creatorcontrib><creatorcontrib>Leonard, Brian</creatorcontrib><creatorcontrib>Garrett, Nicolai</creatorcontrib><creatorcontrib>Gorman, Kyle</creatorcontrib><creatorcontrib>Ate, Yustinus Ghanggo</creatorcontrib><creatorcontrib>Ryskina, Maria</creatorcontrib><creatorcontrib>Mielke, Sabrina J</creatorcontrib><creatorcontrib>Budianskaya, Elena</creatorcontrib><creatorcontrib>El-Khaissi, Charbel</creatorcontrib><creatorcontrib>Pimentel, Tiago</creatorcontrib><creatorcontrib>Gasser, Michael</creatorcontrib><creatorcontrib>Lane, William</creatorcontrib><creatorcontrib>Raj, Mohit</creatorcontrib><creatorcontrib>Coler, Matt</creatorcontrib><creatorcontrib>Jaime Rafael Montoya Samame</creatorcontrib><creatorcontrib>Delio Siticonatzi Camaiteri</creatorcontrib><creatorcontrib>Sagot, Benoît</creatorcontrib><creatorcontrib>Rojas, Esaú Zumaeta</creatorcontrib><creatorcontrib>Didier López Francis</creatorcontrib><creatorcontrib>Oncevay, Arturo</creatorcontrib><creatorcontrib>Juan López Bautista</creatorcontrib><creatorcontrib>Gema Celeste Silva Villegas</creatorcontrib><creatorcontrib>Lucas Torroba Hennigen</creatorcontrib><creatorcontrib>Ek, Adam</creatorcontrib><creatorcontrib>Guriel, David</creatorcontrib><creatorcontrib>Dirix, Peter</creatorcontrib><creatorcontrib>Bernardy, Jean-Philippe</creatorcontrib><creatorcontrib>Scherbakov, Andrey</creatorcontrib><creatorcontrib>Bayyr-ool, Aziyana</creatorcontrib><creatorcontrib>Anastasopoulos, Antonios</creatorcontrib><creatorcontrib>Zariquiey, Roberto</creatorcontrib><creatorcontrib>Sheifer, Karina</creatorcontrib><creatorcontrib>Ganieva, Sofya</creatorcontrib><creatorcontrib>Cruz, Hilaria</creatorcontrib><creatorcontrib>Karahóǧa, Ritván</creatorcontrib><creatorcontrib>Markantonatou, Stella</creatorcontrib><creatorcontrib>Pavlidis, George</creatorcontrib><creatorcontrib>Plugaryov, Matvey</creatorcontrib><creatorcontrib>Klyachko, Elena</creatorcontrib><creatorcontrib>Salehi, Ali</creatorcontrib><creatorcontrib>Angulo, Candy</creatorcontrib><creatorcontrib>Baxi, Jatayu</creatorcontrib><creatorcontrib>Krizhanovsky, Andrew</creatorcontrib><creatorcontrib>Krizhanovskaya, Natalia</creatorcontrib><creatorcontrib>Salesky, Elizabeth</creatorcontrib><creatorcontrib>Vania, Clara</creatorcontrib><creatorcontrib>Ivanova, Sardana</creatorcontrib><creatorcontrib>White, Jennifer</creatorcontrib><creatorcontrib>Rowan Hall Maudslay</creatorcontrib><creatorcontrib>Valvoda, Josef</creatorcontrib><creatorcontrib>Zmigrod, Ran</creatorcontrib><creatorcontrib>Czarnowska, Paula</creatorcontrib><creatorcontrib>Nikkarinen, Irene</creatorcontrib><creatorcontrib>Salchak, Aelita</creatorcontrib><creatorcontrib>Bhatt, Brijesh</creatorcontrib><creatorcontrib>Straughn, Christopher</creatorcontrib><creatorcontrib>Liu, Zoey</creatorcontrib><creatorcontrib>Jonathan North Washington</creatorcontrib><creatorcontrib>Pinter, Yuval</creatorcontrib><creatorcontrib>Ataman, Duygu</creatorcontrib><creatorcontrib>Wolinski, Marcin</creatorcontrib><creatorcontrib>Suhardijanto, Totok</creatorcontrib><creatorcontrib>Yablonskaya, Anna</creatorcontrib><creatorcontrib>Stoehr, Niklas</creatorcontrib><creatorcontrib>Dolatian, Hossep</creatorcontrib><creatorcontrib>Nuriah, Zahroh</creatorcontrib><creatorcontrib>Ratan, Shyam</creatorcontrib><creatorcontrib>Tyers, Francis M</creatorcontrib><creatorcontrib>Ponti, Edoardo M</creatorcontrib><creatorcontrib>Aiton, Grant</creatorcontrib><creatorcontrib>Arora, Aryaman</creatorcontrib><creatorcontrib>Hatcher, Richard J</creatorcontrib><creatorcontrib>Kumar, Ritesh</creatorcontrib><creatorcontrib>Young, Jeremiah</creatorcontrib><creatorcontrib>Rodionova, Daria</creatorcontrib><creatorcontrib>Yemelina, Anastasia</creatorcontrib><creatorcontrib>Andrushko, Taras</creatorcontrib><creatorcontrib>Marchenko, Igor</creatorcontrib><creatorcontrib>Mashkovtseva, Polina</creatorcontrib><creatorcontrib>Serova, Alexandra</creatorcontrib><creatorcontrib>Prud'hommeaux, Emily</creatorcontrib><creatorcontrib>Nepomniashchaya, Maria</creatorcontrib><creatorcontrib>Giunchiglia, Fausto</creatorcontrib><creatorcontrib>Chodroff, Eleanor</creatorcontrib><creatorcontrib>Mans Hulden</creatorcontrib><creatorcontrib>Silfverberg, Miikka</creatorcontrib><creatorcontrib>McCarthy, Arya D</creatorcontrib><creatorcontrib>Yarowsky, David</creatorcontrib><creatorcontrib>Cotterell, Ryan</creatorcontrib><creatorcontrib>Tsarfaty, Reut</creatorcontrib><creatorcontrib>Vylomova, Ekaterina</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Batsuren, Khuyagbaatar</au><au>Goldman, Omer</au><au>Khalifa, Salam</au><au>Habash, Nizar</au><au>Kieraś, Witold</au><au>Bella, Gábor</au><au>Leonard, Brian</au><au>Garrett, Nicolai</au><au>Gorman, Kyle</au><au>Ate, Yustinus Ghanggo</au><au>Ryskina, Maria</au><au>Mielke, Sabrina J</au><au>Budianskaya, Elena</au><au>El-Khaissi, Charbel</au><au>Pimentel, Tiago</au><au>Gasser, Michael</au><au>Lane, William</au><au>Raj, Mohit</au><au>Coler, Matt</au><au>Jaime Rafael Montoya Samame</au><au>Delio Siticonatzi Camaiteri</au><au>Sagot, Benoît</au><au>Rojas, Esaú Zumaeta</au><au>Didier López Francis</au><au>Oncevay, Arturo</au><au>Juan López Bautista</au><au>Gema Celeste Silva Villegas</au><au>Lucas Torroba Hennigen</au><au>Ek, Adam</au><au>Guriel, David</au><au>Dirix, Peter</au><au>Bernardy, Jean-Philippe</au><au>Scherbakov, Andrey</au><au>Bayyr-ool, Aziyana</au><au>Anastasopoulos, Antonios</au><au>Zariquiey, Roberto</au><au>Sheifer, Karina</au><au>Ganieva, Sofya</au><au>Cruz, Hilaria</au><au>Karahóǧa, Ritván</au><au>Markantonatou, Stella</au><au>Pavlidis, George</au><au>Plugaryov, Matvey</au><au>Klyachko, Elena</au><au>Salehi, Ali</au><au>Angulo, Candy</au><au>Baxi, Jatayu</au><au>Krizhanovsky, Andrew</au><au>Krizhanovskaya, Natalia</au><au>Salesky, Elizabeth</au><au>Vania, Clara</au><au>Ivanova, Sardana</au><au>White, Jennifer</au><au>Rowan Hall Maudslay</au><au>Valvoda, Josef</au><au>Zmigrod, Ran</au><au>Czarnowska, Paula</au><au>Nikkarinen, Irene</au><au>Salchak, Aelita</au><au>Bhatt, Brijesh</au><au>Straughn, Christopher</au><au>Liu, Zoey</au><au>Jonathan North Washington</au><au>Pinter, Yuval</au><au>Ataman, Duygu</au><au>Wolinski, Marcin</au><au>Suhardijanto, Totok</au><au>Yablonskaya, Anna</au><au>Stoehr, Niklas</au><au>Dolatian, Hossep</au><au>Nuriah, Zahroh</au><au>Ratan, Shyam</au><au>Tyers, Francis M</au><au>Ponti, Edoardo M</au><au>Aiton, Grant</au><au>Arora, Aryaman</au><au>Hatcher, Richard J</au><au>Kumar, Ritesh</au><au>Young, Jeremiah</au><au>Rodionova, Daria</au><au>Yemelina, Anastasia</au><au>Andrushko, Taras</au><au>Marchenko, Igor</au><au>Mashkovtseva, Polina</au><au>Serova, Alexandra</au><au>Prud'hommeaux, Emily</au><au>Nepomniashchaya, Maria</au><au>Giunchiglia, Fausto</au><au>Chodroff, Eleanor</au><au>Mans Hulden</au><au>Silfverberg, Miikka</au><au>McCarthy, Arya D</au><au>Yarowsky, David</au><au>Cotterell, Ryan</au><au>Tsarfaty, Reut</au><au>Vylomova, Ekaterina</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>UniMorph 4.0: Universal Morphology</atitle><jtitle>arXiv.org</jtitle><date>2022-06-19</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2022-06
issn 2331-8422
language eng
recordid cdi_proquest_journals_2662167755
source Free E- Journals
subjects Annotations
Collaboration
Languages
Morphology
Segmentation
Structural hierarchy
title UniMorph 4.0: Universal Morphology
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T14%3A39%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=UniMorph%204.0:%20Universal%20Morphology&rft.jtitle=arXiv.org&rft.au=Batsuren,%20Khuyagbaatar&rft.date=2022-06-19&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2662167755%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2662167755&rft_id=info:pmid/&rfr_iscdi=true