UniMorph 4.0: Universal Morphology
The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annota...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2022-06 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Batsuren, Khuyagbaatar Goldman, Omer Khalifa, Salam Habash, Nizar Kieraś, Witold Bella, Gábor Leonard, Brian Garrett, Nicolai Gorman, Kyle Ate, Yustinus Ghanggo Ryskina, Maria Mielke, Sabrina J Budianskaya, Elena El-Khaissi, Charbel Pimentel, Tiago Gasser, Michael Lane, William Raj, Mohit Coler, Matt Jaime Rafael Montoya Samame Delio Siticonatzi Camaiteri Sagot, Benoît Rojas, Esaú Zumaeta Didier López Francis Oncevay, Arturo Juan López Bautista Gema Celeste Silva Villegas Lucas Torroba Hennigen Ek, Adam Guriel, David Dirix, Peter Bernardy, Jean-Philippe Scherbakov, Andrey Bayyr-ool, Aziyana Anastasopoulos, Antonios Zariquiey, Roberto Sheifer, Karina Ganieva, Sofya Cruz, Hilaria Karahóǧa, Ritván Markantonatou, Stella Pavlidis, George Plugaryov, Matvey Klyachko, Elena Salehi, Ali Angulo, Candy Baxi, Jatayu Krizhanovsky, Andrew Krizhanovskaya, Natalia Salesky, Elizabeth Vania, Clara Ivanova, Sardana White, Jennifer Rowan Hall Maudslay Valvoda, Josef Zmigrod, Ran Czarnowska, Paula Nikkarinen, Irene Salchak, Aelita Bhatt, Brijesh Straughn, Christopher Liu, Zoey Jonathan North Washington Pinter, Yuval Ataman, Duygu Wolinski, Marcin Suhardijanto, Totok Yablonskaya, Anna Stoehr, Niklas Dolatian, Hossep Nuriah, Zahroh Ratan, Shyam Tyers, Francis M Ponti, Edoardo M Aiton, Grant Arora, Aryaman Hatcher, Richard J Kumar, Ritesh Young, Jeremiah Rodionova, Daria Yemelina, Anastasia Andrushko, Taras Marchenko, Igor Mashkovtseva, Polina Serova, Alexandra Prud'hommeaux, Emily Nepomniashchaya, Maria Giunchiglia, Fausto Chodroff, Eleanor Mans Hulden Silfverberg, Miikka McCarthy, Arya D Yarowsky, David Cotterell, Ryan Tsarfaty, Reut Vylomova, Ekaterina |
description | The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2662167755</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2662167755</sourcerecordid><originalsourceid>FETCH-proquest_journals_26621677553</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mRQCs3L9M0vKshQMNEzsFIA8spSi4oTcxTAgvk5-emVPAysaYk5xam8UJqbQdnNNcTZQ7egKL-wNLW4JD4rv7QoDygVb2RmZmRoZm4ONJo4VQAXCy1M</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2662167755</pqid></control><display><type>article</type><title>UniMorph 4.0: Universal Morphology</title><source>Free E- Journals</source><creator>Batsuren, Khuyagbaatar ; Goldman, Omer ; Khalifa, Salam ; Habash, Nizar ; Kieraś, Witold ; Bella, Gábor ; Leonard, Brian ; Garrett, Nicolai ; Gorman, Kyle ; Ate, Yustinus Ghanggo ; Ryskina, Maria ; Mielke, Sabrina J ; Budianskaya, Elena ; El-Khaissi, Charbel ; Pimentel, Tiago ; Gasser, Michael ; Lane, William ; Raj, Mohit ; Coler, Matt ; Jaime Rafael Montoya Samame ; Delio Siticonatzi Camaiteri ; Sagot, Benoît ; Rojas, Esaú Zumaeta ; Didier López Francis ; Oncevay, Arturo ; Juan López Bautista ; Gema Celeste Silva Villegas ; Lucas Torroba Hennigen ; Ek, Adam ; Guriel, David ; Dirix, Peter ; Bernardy, Jean-Philippe ; Scherbakov, Andrey ; Bayyr-ool, Aziyana ; Anastasopoulos, Antonios ; Zariquiey, Roberto ; Sheifer, Karina ; Ganieva, Sofya ; Cruz, Hilaria ; Karahóǧa, Ritván ; Markantonatou, Stella ; Pavlidis, George ; Plugaryov, Matvey ; Klyachko, Elena ; Salehi, Ali ; Angulo, Candy ; Baxi, Jatayu ; Krizhanovsky, Andrew ; Krizhanovskaya, Natalia ; Salesky, Elizabeth ; Vania, Clara ; Ivanova, Sardana ; White, Jennifer ; Rowan Hall Maudslay ; Valvoda, Josef ; Zmigrod, Ran ; Czarnowska, Paula ; Nikkarinen, Irene ; Salchak, Aelita ; Bhatt, Brijesh ; Straughn, Christopher ; Liu, Zoey ; Jonathan North Washington ; Pinter, Yuval ; Ataman, Duygu ; Wolinski, Marcin ; Suhardijanto, Totok ; Yablonskaya, Anna ; Stoehr, Niklas ; Dolatian, Hossep ; Nuriah, Zahroh ; Ratan, Shyam ; Tyers, Francis M ; Ponti, Edoardo M ; Aiton, Grant ; Arora, Aryaman ; Hatcher, Richard J ; Kumar, Ritesh ; Young, Jeremiah ; Rodionova, Daria ; Yemelina, Anastasia ; Andrushko, Taras ; Marchenko, Igor ; Mashkovtseva, Polina ; Serova, Alexandra ; Prud'hommeaux, Emily ; Nepomniashchaya, Maria ; Giunchiglia, Fausto ; Chodroff, Eleanor ; Mans Hulden ; Silfverberg, Miikka ; McCarthy, Arya D ; Yarowsky, David ; Cotterell, Ryan ; Tsarfaty, Reut ; Vylomova, Ekaterina</creator><creatorcontrib>Batsuren, Khuyagbaatar ; Goldman, Omer ; Khalifa, Salam ; Habash, Nizar ; Kieraś, Witold ; Bella, Gábor ; Leonard, Brian ; Garrett, Nicolai ; Gorman, Kyle ; Ate, Yustinus Ghanggo ; Ryskina, Maria ; Mielke, Sabrina J ; Budianskaya, Elena ; El-Khaissi, Charbel ; Pimentel, Tiago ; Gasser, Michael ; Lane, William ; Raj, Mohit ; Coler, Matt ; Jaime Rafael Montoya Samame ; Delio Siticonatzi Camaiteri ; Sagot, Benoît ; Rojas, Esaú Zumaeta ; Didier López Francis ; Oncevay, Arturo ; Juan López Bautista ; Gema Celeste Silva Villegas ; Lucas Torroba Hennigen ; Ek, Adam ; Guriel, David ; Dirix, Peter ; Bernardy, Jean-Philippe ; Scherbakov, Andrey ; Bayyr-ool, Aziyana ; Anastasopoulos, Antonios ; Zariquiey, Roberto ; Sheifer, Karina ; Ganieva, Sofya ; Cruz, Hilaria ; Karahóǧa, Ritván ; Markantonatou, Stella ; Pavlidis, George ; Plugaryov, Matvey ; Klyachko, Elena ; Salehi, Ali ; Angulo, Candy ; Baxi, Jatayu ; Krizhanovsky, Andrew ; Krizhanovskaya, Natalia ; Salesky, Elizabeth ; Vania, Clara ; Ivanova, Sardana ; White, Jennifer ; Rowan Hall Maudslay ; Valvoda, Josef ; Zmigrod, Ran ; Czarnowska, Paula ; Nikkarinen, Irene ; Salchak, Aelita ; Bhatt, Brijesh ; Straughn, Christopher ; Liu, Zoey ; Jonathan North Washington ; Pinter, Yuval ; Ataman, Duygu ; Wolinski, Marcin ; Suhardijanto, Totok ; Yablonskaya, Anna ; Stoehr, Niklas ; Dolatian, Hossep ; Nuriah, Zahroh ; Ratan, Shyam ; Tyers, Francis M ; Ponti, Edoardo M ; Aiton, Grant ; Arora, Aryaman ; Hatcher, Richard J ; Kumar, Ritesh ; Young, Jeremiah ; Rodionova, Daria ; Yemelina, Anastasia ; Andrushko, Taras ; Marchenko, Igor ; Mashkovtseva, Polina ; Serova, Alexandra ; Prud'hommeaux, Emily ; Nepomniashchaya, Maria ; Giunchiglia, Fausto ; Chodroff, Eleanor ; Mans Hulden ; Silfverberg, Miikka ; McCarthy, Arya D ; Yarowsky, David ; Cotterell, Ryan ; Tsarfaty, Reut ; Vylomova, Ekaterina</creatorcontrib><description>The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Collaboration ; Languages ; Morphology ; Segmentation ; Structural hierarchy</subject><ispartof>arXiv.org, 2022-06</ispartof><rights>2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Batsuren, Khuyagbaatar</creatorcontrib><creatorcontrib>Goldman, Omer</creatorcontrib><creatorcontrib>Khalifa, Salam</creatorcontrib><creatorcontrib>Habash, Nizar</creatorcontrib><creatorcontrib>Kieraś, Witold</creatorcontrib><creatorcontrib>Bella, Gábor</creatorcontrib><creatorcontrib>Leonard, Brian</creatorcontrib><creatorcontrib>Garrett, Nicolai</creatorcontrib><creatorcontrib>Gorman, Kyle</creatorcontrib><creatorcontrib>Ate, Yustinus Ghanggo</creatorcontrib><creatorcontrib>Ryskina, Maria</creatorcontrib><creatorcontrib>Mielke, Sabrina J</creatorcontrib><creatorcontrib>Budianskaya, Elena</creatorcontrib><creatorcontrib>El-Khaissi, Charbel</creatorcontrib><creatorcontrib>Pimentel, Tiago</creatorcontrib><creatorcontrib>Gasser, Michael</creatorcontrib><creatorcontrib>Lane, William</creatorcontrib><creatorcontrib>Raj, Mohit</creatorcontrib><creatorcontrib>Coler, Matt</creatorcontrib><creatorcontrib>Jaime Rafael Montoya Samame</creatorcontrib><creatorcontrib>Delio Siticonatzi Camaiteri</creatorcontrib><creatorcontrib>Sagot, Benoît</creatorcontrib><creatorcontrib>Rojas, Esaú Zumaeta</creatorcontrib><creatorcontrib>Didier López Francis</creatorcontrib><creatorcontrib>Oncevay, Arturo</creatorcontrib><creatorcontrib>Juan López Bautista</creatorcontrib><creatorcontrib>Gema Celeste Silva Villegas</creatorcontrib><creatorcontrib>Lucas Torroba Hennigen</creatorcontrib><creatorcontrib>Ek, Adam</creatorcontrib><creatorcontrib>Guriel, David</creatorcontrib><creatorcontrib>Dirix, Peter</creatorcontrib><creatorcontrib>Bernardy, Jean-Philippe</creatorcontrib><creatorcontrib>Scherbakov, Andrey</creatorcontrib><creatorcontrib>Bayyr-ool, Aziyana</creatorcontrib><creatorcontrib>Anastasopoulos, Antonios</creatorcontrib><creatorcontrib>Zariquiey, Roberto</creatorcontrib><creatorcontrib>Sheifer, Karina</creatorcontrib><creatorcontrib>Ganieva, Sofya</creatorcontrib><creatorcontrib>Cruz, Hilaria</creatorcontrib><creatorcontrib>Karahóǧa, Ritván</creatorcontrib><creatorcontrib>Markantonatou, Stella</creatorcontrib><creatorcontrib>Pavlidis, George</creatorcontrib><creatorcontrib>Plugaryov, Matvey</creatorcontrib><creatorcontrib>Klyachko, Elena</creatorcontrib><creatorcontrib>Salehi, Ali</creatorcontrib><creatorcontrib>Angulo, Candy</creatorcontrib><creatorcontrib>Baxi, Jatayu</creatorcontrib><creatorcontrib>Krizhanovsky, Andrew</creatorcontrib><creatorcontrib>Krizhanovskaya, Natalia</creatorcontrib><creatorcontrib>Salesky, Elizabeth</creatorcontrib><creatorcontrib>Vania, Clara</creatorcontrib><creatorcontrib>Ivanova, Sardana</creatorcontrib><creatorcontrib>White, Jennifer</creatorcontrib><creatorcontrib>Rowan Hall Maudslay</creatorcontrib><creatorcontrib>Valvoda, Josef</creatorcontrib><creatorcontrib>Zmigrod, Ran</creatorcontrib><creatorcontrib>Czarnowska, Paula</creatorcontrib><creatorcontrib>Nikkarinen, Irene</creatorcontrib><creatorcontrib>Salchak, Aelita</creatorcontrib><creatorcontrib>Bhatt, Brijesh</creatorcontrib><creatorcontrib>Straughn, Christopher</creatorcontrib><creatorcontrib>Liu, Zoey</creatorcontrib><creatorcontrib>Jonathan North Washington</creatorcontrib><creatorcontrib>Pinter, Yuval</creatorcontrib><creatorcontrib>Ataman, Duygu</creatorcontrib><creatorcontrib>Wolinski, Marcin</creatorcontrib><creatorcontrib>Suhardijanto, Totok</creatorcontrib><creatorcontrib>Yablonskaya, Anna</creatorcontrib><creatorcontrib>Stoehr, Niklas</creatorcontrib><creatorcontrib>Dolatian, Hossep</creatorcontrib><creatorcontrib>Nuriah, Zahroh</creatorcontrib><creatorcontrib>Ratan, Shyam</creatorcontrib><creatorcontrib>Tyers, Francis M</creatorcontrib><creatorcontrib>Ponti, Edoardo M</creatorcontrib><creatorcontrib>Aiton, Grant</creatorcontrib><creatorcontrib>Arora, Aryaman</creatorcontrib><creatorcontrib>Hatcher, Richard J</creatorcontrib><creatorcontrib>Kumar, Ritesh</creatorcontrib><creatorcontrib>Young, Jeremiah</creatorcontrib><creatorcontrib>Rodionova, Daria</creatorcontrib><creatorcontrib>Yemelina, Anastasia</creatorcontrib><creatorcontrib>Andrushko, Taras</creatorcontrib><creatorcontrib>Marchenko, Igor</creatorcontrib><creatorcontrib>Mashkovtseva, Polina</creatorcontrib><creatorcontrib>Serova, Alexandra</creatorcontrib><creatorcontrib>Prud'hommeaux, Emily</creatorcontrib><creatorcontrib>Nepomniashchaya, Maria</creatorcontrib><creatorcontrib>Giunchiglia, Fausto</creatorcontrib><creatorcontrib>Chodroff, Eleanor</creatorcontrib><creatorcontrib>Mans Hulden</creatorcontrib><creatorcontrib>Silfverberg, Miikka</creatorcontrib><creatorcontrib>McCarthy, Arya D</creatorcontrib><creatorcontrib>Yarowsky, David</creatorcontrib><creatorcontrib>Cotterell, Ryan</creatorcontrib><creatorcontrib>Tsarfaty, Reut</creatorcontrib><creatorcontrib>Vylomova, Ekaterina</creatorcontrib><title>UniMorph 4.0: Universal Morphology</title><title>arXiv.org</title><description>The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.</description><subject>Annotations</subject><subject>Collaboration</subject><subject>Languages</subject><subject>Morphology</subject><subject>Segmentation</subject><subject>Structural hierarchy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mRQCs3L9M0vKshQMNEzsFIA8spSi4oTcxTAgvk5-emVPAysaYk5xam8UJqbQdnNNcTZQ7egKL-wNLW4JD4rv7QoDygVb2RmZmRoZm4ONJo4VQAXCy1M</recordid><startdate>20220619</startdate><enddate>20220619</enddate><creator>Batsuren, Khuyagbaatar</creator><creator>Goldman, Omer</creator><creator>Khalifa, Salam</creator><creator>Habash, Nizar</creator><creator>Kieraś, Witold</creator><creator>Bella, Gábor</creator><creator>Leonard, Brian</creator><creator>Garrett, Nicolai</creator><creator>Gorman, Kyle</creator><creator>Ate, Yustinus Ghanggo</creator><creator>Ryskina, Maria</creator><creator>Mielke, Sabrina J</creator><creator>Budianskaya, Elena</creator><creator>El-Khaissi, Charbel</creator><creator>Pimentel, Tiago</creator><creator>Gasser, Michael</creator><creator>Lane, William</creator><creator>Raj, Mohit</creator><creator>Coler, Matt</creator><creator>Jaime Rafael Montoya Samame</creator><creator>Delio Siticonatzi Camaiteri</creator><creator>Sagot, Benoît</creator><creator>Rojas, Esaú Zumaeta</creator><creator>Didier López Francis</creator><creator>Oncevay, Arturo</creator><creator>Juan López Bautista</creator><creator>Gema Celeste Silva Villegas</creator><creator>Lucas Torroba Hennigen</creator><creator>Ek, Adam</creator><creator>Guriel, David</creator><creator>Dirix, Peter</creator><creator>Bernardy, Jean-Philippe</creator><creator>Scherbakov, Andrey</creator><creator>Bayyr-ool, Aziyana</creator><creator>Anastasopoulos, Antonios</creator><creator>Zariquiey, Roberto</creator><creator>Sheifer, Karina</creator><creator>Ganieva, Sofya</creator><creator>Cruz, Hilaria</creator><creator>Karahóǧa, Ritván</creator><creator>Markantonatou, Stella</creator><creator>Pavlidis, George</creator><creator>Plugaryov, Matvey</creator><creator>Klyachko, Elena</creator><creator>Salehi, Ali</creator><creator>Angulo, Candy</creator><creator>Baxi, Jatayu</creator><creator>Krizhanovsky, Andrew</creator><creator>Krizhanovskaya, Natalia</creator><creator>Salesky, Elizabeth</creator><creator>Vania, Clara</creator><creator>Ivanova, Sardana</creator><creator>White, Jennifer</creator><creator>Rowan Hall Maudslay</creator><creator>Valvoda, Josef</creator><creator>Zmigrod, Ran</creator><creator>Czarnowska, Paula</creator><creator>Nikkarinen, Irene</creator><creator>Salchak, Aelita</creator><creator>Bhatt, Brijesh</creator><creator>Straughn, Christopher</creator><creator>Liu, Zoey</creator><creator>Jonathan North Washington</creator><creator>Pinter, Yuval</creator><creator>Ataman, Duygu</creator><creator>Wolinski, Marcin</creator><creator>Suhardijanto, Totok</creator><creator>Yablonskaya, Anna</creator><creator>Stoehr, Niklas</creator><creator>Dolatian, Hossep</creator><creator>Nuriah, Zahroh</creator><creator>Ratan, Shyam</creator><creator>Tyers, Francis M</creator><creator>Ponti, Edoardo M</creator><creator>Aiton, Grant</creator><creator>Arora, Aryaman</creator><creator>Hatcher, Richard J</creator><creator>Kumar, Ritesh</creator><creator>Young, Jeremiah</creator><creator>Rodionova, Daria</creator><creator>Yemelina, Anastasia</creator><creator>Andrushko, Taras</creator><creator>Marchenko, Igor</creator><creator>Mashkovtseva, Polina</creator><creator>Serova, Alexandra</creator><creator>Prud'hommeaux, Emily</creator><creator>Nepomniashchaya, Maria</creator><creator>Giunchiglia, Fausto</creator><creator>Chodroff, Eleanor</creator><creator>Mans Hulden</creator><creator>Silfverberg, Miikka</creator><creator>McCarthy, Arya D</creator><creator>Yarowsky, David</creator><creator>Cotterell, Ryan</creator><creator>Tsarfaty, Reut</creator><creator>Vylomova, Ekaterina</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220619</creationdate><title>UniMorph 4.0: Universal Morphology</title><author>Batsuren, Khuyagbaatar ; Goldman, Omer ; Khalifa, Salam ; Habash, Nizar ; Kieraś, Witold ; Bella, Gábor ; Leonard, Brian ; Garrett, Nicolai ; Gorman, Kyle ; Ate, Yustinus Ghanggo ; Ryskina, Maria ; Mielke, Sabrina J ; Budianskaya, Elena ; El-Khaissi, Charbel ; Pimentel, Tiago ; Gasser, Michael ; Lane, William ; Raj, Mohit ; Coler, Matt ; Jaime Rafael Montoya Samame ; Delio Siticonatzi Camaiteri ; Sagot, Benoît ; Rojas, Esaú Zumaeta ; Didier López Francis ; Oncevay, Arturo ; Juan López Bautista ; Gema Celeste Silva Villegas ; Lucas Torroba Hennigen ; Ek, Adam ; Guriel, David ; Dirix, Peter ; Bernardy, Jean-Philippe ; Scherbakov, Andrey ; Bayyr-ool, Aziyana ; Anastasopoulos, Antonios ; Zariquiey, Roberto ; Sheifer, Karina ; Ganieva, Sofya ; Cruz, Hilaria ; Karahóǧa, Ritván ; Markantonatou, Stella ; Pavlidis, George ; Plugaryov, Matvey ; Klyachko, Elena ; Salehi, Ali ; Angulo, Candy ; Baxi, Jatayu ; Krizhanovsky, Andrew ; Krizhanovskaya, Natalia ; Salesky, Elizabeth ; Vania, Clara ; Ivanova, Sardana ; White, Jennifer ; Rowan Hall Maudslay ; Valvoda, Josef ; Zmigrod, Ran ; Czarnowska, Paula ; Nikkarinen, Irene ; Salchak, Aelita ; Bhatt, Brijesh ; Straughn, Christopher ; Liu, Zoey ; Jonathan North Washington ; Pinter, Yuval ; Ataman, Duygu ; Wolinski, Marcin ; Suhardijanto, Totok ; Yablonskaya, Anna ; Stoehr, Niklas ; Dolatian, Hossep ; Nuriah, Zahroh ; Ratan, Shyam ; Tyers, Francis M ; Ponti, Edoardo M ; Aiton, Grant ; Arora, Aryaman ; Hatcher, Richard J ; Kumar, Ritesh ; Young, Jeremiah ; Rodionova, Daria ; Yemelina, Anastasia ; Andrushko, Taras ; Marchenko, Igor ; Mashkovtseva, Polina ; Serova, Alexandra ; Prud'hommeaux, Emily ; Nepomniashchaya, Maria ; Giunchiglia, Fausto ; Chodroff, Eleanor ; Mans Hulden ; Silfverberg, Miikka ; McCarthy, Arya D ; Yarowsky, David ; Cotterell, Ryan ; Tsarfaty, Reut ; Vylomova, Ekaterina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26621677553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Annotations</topic><topic>Collaboration</topic><topic>Languages</topic><topic>Morphology</topic><topic>Segmentation</topic><topic>Structural hierarchy</topic><toplevel>online_resources</toplevel><creatorcontrib>Batsuren, Khuyagbaatar</creatorcontrib><creatorcontrib>Goldman, Omer</creatorcontrib><creatorcontrib>Khalifa, Salam</creatorcontrib><creatorcontrib>Habash, Nizar</creatorcontrib><creatorcontrib>Kieraś, Witold</creatorcontrib><creatorcontrib>Bella, Gábor</creatorcontrib><creatorcontrib>Leonard, Brian</creatorcontrib><creatorcontrib>Garrett, Nicolai</creatorcontrib><creatorcontrib>Gorman, Kyle</creatorcontrib><creatorcontrib>Ate, Yustinus Ghanggo</creatorcontrib><creatorcontrib>Ryskina, Maria</creatorcontrib><creatorcontrib>Mielke, Sabrina J</creatorcontrib><creatorcontrib>Budianskaya, Elena</creatorcontrib><creatorcontrib>El-Khaissi, Charbel</creatorcontrib><creatorcontrib>Pimentel, Tiago</creatorcontrib><creatorcontrib>Gasser, Michael</creatorcontrib><creatorcontrib>Lane, William</creatorcontrib><creatorcontrib>Raj, Mohit</creatorcontrib><creatorcontrib>Coler, Matt</creatorcontrib><creatorcontrib>Jaime Rafael Montoya Samame</creatorcontrib><creatorcontrib>Delio Siticonatzi Camaiteri</creatorcontrib><creatorcontrib>Sagot, Benoît</creatorcontrib><creatorcontrib>Rojas, Esaú Zumaeta</creatorcontrib><creatorcontrib>Didier López Francis</creatorcontrib><creatorcontrib>Oncevay, Arturo</creatorcontrib><creatorcontrib>Juan López Bautista</creatorcontrib><creatorcontrib>Gema Celeste Silva Villegas</creatorcontrib><creatorcontrib>Lucas Torroba Hennigen</creatorcontrib><creatorcontrib>Ek, Adam</creatorcontrib><creatorcontrib>Guriel, David</creatorcontrib><creatorcontrib>Dirix, Peter</creatorcontrib><creatorcontrib>Bernardy, Jean-Philippe</creatorcontrib><creatorcontrib>Scherbakov, Andrey</creatorcontrib><creatorcontrib>Bayyr-ool, Aziyana</creatorcontrib><creatorcontrib>Anastasopoulos, Antonios</creatorcontrib><creatorcontrib>Zariquiey, Roberto</creatorcontrib><creatorcontrib>Sheifer, Karina</creatorcontrib><creatorcontrib>Ganieva, Sofya</creatorcontrib><creatorcontrib>Cruz, Hilaria</creatorcontrib><creatorcontrib>Karahóǧa, Ritván</creatorcontrib><creatorcontrib>Markantonatou, Stella</creatorcontrib><creatorcontrib>Pavlidis, George</creatorcontrib><creatorcontrib>Plugaryov, Matvey</creatorcontrib><creatorcontrib>Klyachko, Elena</creatorcontrib><creatorcontrib>Salehi, Ali</creatorcontrib><creatorcontrib>Angulo, Candy</creatorcontrib><creatorcontrib>Baxi, Jatayu</creatorcontrib><creatorcontrib>Krizhanovsky, Andrew</creatorcontrib><creatorcontrib>Krizhanovskaya, Natalia</creatorcontrib><creatorcontrib>Salesky, Elizabeth</creatorcontrib><creatorcontrib>Vania, Clara</creatorcontrib><creatorcontrib>Ivanova, Sardana</creatorcontrib><creatorcontrib>White, Jennifer</creatorcontrib><creatorcontrib>Rowan Hall Maudslay</creatorcontrib><creatorcontrib>Valvoda, Josef</creatorcontrib><creatorcontrib>Zmigrod, Ran</creatorcontrib><creatorcontrib>Czarnowska, Paula</creatorcontrib><creatorcontrib>Nikkarinen, Irene</creatorcontrib><creatorcontrib>Salchak, Aelita</creatorcontrib><creatorcontrib>Bhatt, Brijesh</creatorcontrib><creatorcontrib>Straughn, Christopher</creatorcontrib><creatorcontrib>Liu, Zoey</creatorcontrib><creatorcontrib>Jonathan North Washington</creatorcontrib><creatorcontrib>Pinter, Yuval</creatorcontrib><creatorcontrib>Ataman, Duygu</creatorcontrib><creatorcontrib>Wolinski, Marcin</creatorcontrib><creatorcontrib>Suhardijanto, Totok</creatorcontrib><creatorcontrib>Yablonskaya, Anna</creatorcontrib><creatorcontrib>Stoehr, Niklas</creatorcontrib><creatorcontrib>Dolatian, Hossep</creatorcontrib><creatorcontrib>Nuriah, Zahroh</creatorcontrib><creatorcontrib>Ratan, Shyam</creatorcontrib><creatorcontrib>Tyers, Francis M</creatorcontrib><creatorcontrib>Ponti, Edoardo M</creatorcontrib><creatorcontrib>Aiton, Grant</creatorcontrib><creatorcontrib>Arora, Aryaman</creatorcontrib><creatorcontrib>Hatcher, Richard J</creatorcontrib><creatorcontrib>Kumar, Ritesh</creatorcontrib><creatorcontrib>Young, Jeremiah</creatorcontrib><creatorcontrib>Rodionova, Daria</creatorcontrib><creatorcontrib>Yemelina, Anastasia</creatorcontrib><creatorcontrib>Andrushko, Taras</creatorcontrib><creatorcontrib>Marchenko, Igor</creatorcontrib><creatorcontrib>Mashkovtseva, Polina</creatorcontrib><creatorcontrib>Serova, Alexandra</creatorcontrib><creatorcontrib>Prud'hommeaux, Emily</creatorcontrib><creatorcontrib>Nepomniashchaya, Maria</creatorcontrib><creatorcontrib>Giunchiglia, Fausto</creatorcontrib><creatorcontrib>Chodroff, Eleanor</creatorcontrib><creatorcontrib>Mans Hulden</creatorcontrib><creatorcontrib>Silfverberg, Miikka</creatorcontrib><creatorcontrib>McCarthy, Arya D</creatorcontrib><creatorcontrib>Yarowsky, David</creatorcontrib><creatorcontrib>Cotterell, Ryan</creatorcontrib><creatorcontrib>Tsarfaty, Reut</creatorcontrib><creatorcontrib>Vylomova, Ekaterina</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Batsuren, Khuyagbaatar</au><au>Goldman, Omer</au><au>Khalifa, Salam</au><au>Habash, Nizar</au><au>Kieraś, Witold</au><au>Bella, Gábor</au><au>Leonard, Brian</au><au>Garrett, Nicolai</au><au>Gorman, Kyle</au><au>Ate, Yustinus Ghanggo</au><au>Ryskina, Maria</au><au>Mielke, Sabrina J</au><au>Budianskaya, Elena</au><au>El-Khaissi, Charbel</au><au>Pimentel, Tiago</au><au>Gasser, Michael</au><au>Lane, William</au><au>Raj, Mohit</au><au>Coler, Matt</au><au>Jaime Rafael Montoya Samame</au><au>Delio Siticonatzi Camaiteri</au><au>Sagot, Benoît</au><au>Rojas, Esaú Zumaeta</au><au>Didier López Francis</au><au>Oncevay, Arturo</au><au>Juan López Bautista</au><au>Gema Celeste Silva Villegas</au><au>Lucas Torroba Hennigen</au><au>Ek, Adam</au><au>Guriel, David</au><au>Dirix, Peter</au><au>Bernardy, Jean-Philippe</au><au>Scherbakov, Andrey</au><au>Bayyr-ool, Aziyana</au><au>Anastasopoulos, Antonios</au><au>Zariquiey, Roberto</au><au>Sheifer, Karina</au><au>Ganieva, Sofya</au><au>Cruz, Hilaria</au><au>Karahóǧa, Ritván</au><au>Markantonatou, Stella</au><au>Pavlidis, George</au><au>Plugaryov, Matvey</au><au>Klyachko, Elena</au><au>Salehi, Ali</au><au>Angulo, Candy</au><au>Baxi, Jatayu</au><au>Krizhanovsky, Andrew</au><au>Krizhanovskaya, Natalia</au><au>Salesky, Elizabeth</au><au>Vania, Clara</au><au>Ivanova, Sardana</au><au>White, Jennifer</au><au>Rowan Hall Maudslay</au><au>Valvoda, Josef</au><au>Zmigrod, Ran</au><au>Czarnowska, Paula</au><au>Nikkarinen, Irene</au><au>Salchak, Aelita</au><au>Bhatt, Brijesh</au><au>Straughn, Christopher</au><au>Liu, Zoey</au><au>Jonathan North Washington</au><au>Pinter, Yuval</au><au>Ataman, Duygu</au><au>Wolinski, Marcin</au><au>Suhardijanto, Totok</au><au>Yablonskaya, Anna</au><au>Stoehr, Niklas</au><au>Dolatian, Hossep</au><au>Nuriah, Zahroh</au><au>Ratan, Shyam</au><au>Tyers, Francis M</au><au>Ponti, Edoardo M</au><au>Aiton, Grant</au><au>Arora, Aryaman</au><au>Hatcher, Richard J</au><au>Kumar, Ritesh</au><au>Young, Jeremiah</au><au>Rodionova, Daria</au><au>Yemelina, Anastasia</au><au>Andrushko, Taras</au><au>Marchenko, Igor</au><au>Mashkovtseva, Polina</au><au>Serova, Alexandra</au><au>Prud'hommeaux, Emily</au><au>Nepomniashchaya, Maria</au><au>Giunchiglia, Fausto</au><au>Chodroff, Eleanor</au><au>Mans Hulden</au><au>Silfverberg, Miikka</au><au>McCarthy, Arya D</au><au>Yarowsky, David</au><au>Cotterell, Ryan</au><au>Tsarfaty, Reut</au><au>Vylomova, Ekaterina</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>UniMorph 4.0: Universal Morphology</atitle><jtitle>arXiv.org</jtitle><date>2022-06-19</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2022-06 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2662167755 |
source | Free E- Journals |
subjects | Annotations Collaboration Languages Morphology Segmentation Structural hierarchy |
title | UniMorph 4.0: Universal Morphology |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T14%3A39%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=UniMorph%204.0:%20Universal%20Morphology&rft.jtitle=arXiv.org&rft.au=Batsuren,%20Khuyagbaatar&rft.date=2022-06-19&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2662167755%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2662167755&rft_id=info:pmid/&rfr_iscdi=true |