MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms

We introduce a large-scale dataset of math word problems and an interpretable neural math problem solver that learns to map problems to operation programs. Due to annotation challenges, current datasets in this domain have been either relatively small in scale or did not offer precise operational an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Amini, Aida, Gabriel, Saadia, Lin, Peter, Koncel-Kedziorski, Rik, Choi, Yejin, Hajishirzi, Hannaneh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Amini, Aida
Gabriel, Saadia
Lin, Peter
Koncel-Kedziorski, Rik
Choi, Yejin
Hajishirzi, Hannaneh
description We introduce a large-scale dataset of math word problems and an interpretable neural math problem solver that learns to map problems to operation programs. Due to annotation challenges, current datasets in this domain have been either relatively small in scale or did not offer precise operational annotations over diverse problem types. We introduce a new representation language to model precise operation programs corresponding to each math problem that aim to improve both the performance and the interpretability of the learned models. Using this representation language, our new dataset, MathQA, significantly enhances the AQuA dataset with fully-specified operational programs. We additionally introduce a neural sequence-to-program model enhanced with automatic problem categorization. Our experiments show improvements over competitive baselines in our MathQA as well as the AQuA dataset. The results are still significantly lower than human performance indicating that the dataset poses new challenges for future research. Our dataset is available at: https://math-qa.github.io/math-QA/
doi_str_mv 10.48550/arxiv.1905.13319
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1905_13319</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1905_13319</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-c603b3c8738cf6b4b7c171eb66ed0e57cdd26362fe3de5b5004cb9a7c4f3fecf3</originalsourceid><addsrcrecordid>eNotj9FKwzAYRnPjhUwfwCvzAq3J0iStd3M4HUymWNhl-ZP80UDblLRs-vZuc1cffAcOHELuOMuLUkr2AOkn7HNeMZlzIXh1TXZvMH1_LB5pHQ-Q3EjX_YRpSDiBaZGeKN3F5Oh7isejo5-x3Yf-ix7CkWwHTDCF2GdPMKKjq5g6aMPYjTfkykM74u1lZ6RePdfL12yzfVkvF5sMlK4yq5gwwpZalNYrUxhtueZolELHUGrr3FwJNfcoHEojGSusqUDbwguP1osZuf_XnsuaIYUO0m9zKmzOheIPu0lNHw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms</title><source>arXiv.org</source><creator>Amini, Aida ; Gabriel, Saadia ; Lin, Peter ; Koncel-Kedziorski, Rik ; Choi, Yejin ; Hajishirzi, Hannaneh</creator><creatorcontrib>Amini, Aida ; Gabriel, Saadia ; Lin, Peter ; Koncel-Kedziorski, Rik ; Choi, Yejin ; Hajishirzi, Hannaneh</creatorcontrib><description>We introduce a large-scale dataset of math word problems and an interpretable neural math problem solver that learns to map problems to operation programs. Due to annotation challenges, current datasets in this domain have been either relatively small in scale or did not offer precise operational annotations over diverse problem types. We introduce a new representation language to model precise operation programs corresponding to each math problem that aim to improve both the performance and the interpretability of the learned models. Using this representation language, our new dataset, MathQA, significantly enhances the AQuA dataset with fully-specified operational programs. We additionally introduce a neural sequence-to-program model enhanced with automatic problem categorization. Our experiments show improvements over competitive baselines in our MathQA as well as the AQuA dataset. The results are still significantly lower than human performance indicating that the dataset poses new challenges for future research. Our dataset is available at: https://math-qa.github.io/math-QA/</description><identifier>DOI: 10.48550/arxiv.1905.13319</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2019-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1905.13319$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1905.13319$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Amini, Aida</creatorcontrib><creatorcontrib>Gabriel, Saadia</creatorcontrib><creatorcontrib>Lin, Peter</creatorcontrib><creatorcontrib>Koncel-Kedziorski, Rik</creatorcontrib><creatorcontrib>Choi, Yejin</creatorcontrib><creatorcontrib>Hajishirzi, Hannaneh</creatorcontrib><title>MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms</title><description>We introduce a large-scale dataset of math word problems and an interpretable neural math problem solver that learns to map problems to operation programs. Due to annotation challenges, current datasets in this domain have been either relatively small in scale or did not offer precise operational annotations over diverse problem types. We introduce a new representation language to model precise operation programs corresponding to each math problem that aim to improve both the performance and the interpretability of the learned models. Using this representation language, our new dataset, MathQA, significantly enhances the AQuA dataset with fully-specified operational programs. We additionally introduce a neural sequence-to-program model enhanced with automatic problem categorization. Our experiments show improvements over competitive baselines in our MathQA as well as the AQuA dataset. The results are still significantly lower than human performance indicating that the dataset poses new challenges for future research. Our dataset is available at: https://math-qa.github.io/math-QA/</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj9FKwzAYRnPjhUwfwCvzAq3J0iStd3M4HUymWNhl-ZP80UDblLRs-vZuc1cffAcOHELuOMuLUkr2AOkn7HNeMZlzIXh1TXZvMH1_LB5pHQ-Q3EjX_YRpSDiBaZGeKN3F5Oh7isejo5-x3Yf-ix7CkWwHTDCF2GdPMKKjq5g6aMPYjTfkykM74u1lZ6RePdfL12yzfVkvF5sMlK4yq5gwwpZalNYrUxhtueZolELHUGrr3FwJNfcoHEojGSusqUDbwguP1osZuf_XnsuaIYUO0m9zKmzOheIPu0lNHw</recordid><startdate>20190530</startdate><enddate>20190530</enddate><creator>Amini, Aida</creator><creator>Gabriel, Saadia</creator><creator>Lin, Peter</creator><creator>Koncel-Kedziorski, Rik</creator><creator>Choi, Yejin</creator><creator>Hajishirzi, Hannaneh</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190530</creationdate><title>MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms</title><author>Amini, Aida ; Gabriel, Saadia ; Lin, Peter ; Koncel-Kedziorski, Rik ; Choi, Yejin ; Hajishirzi, Hannaneh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-c603b3c8738cf6b4b7c171eb66ed0e57cdd26362fe3de5b5004cb9a7c4f3fecf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Amini, Aida</creatorcontrib><creatorcontrib>Gabriel, Saadia</creatorcontrib><creatorcontrib>Lin, Peter</creatorcontrib><creatorcontrib>Koncel-Kedziorski, Rik</creatorcontrib><creatorcontrib>Choi, Yejin</creatorcontrib><creatorcontrib>Hajishirzi, Hannaneh</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Amini, Aida</au><au>Gabriel, Saadia</au><au>Lin, Peter</au><au>Koncel-Kedziorski, Rik</au><au>Choi, Yejin</au><au>Hajishirzi, Hannaneh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms</atitle><date>2019-05-30</date><risdate>2019</risdate><abstract>We introduce a large-scale dataset of math word problems and an interpretable neural math problem solver that learns to map problems to operation programs. Due to annotation challenges, current datasets in this domain have been either relatively small in scale or did not offer precise operational annotations over diverse problem types. We introduce a new representation language to model precise operation programs corresponding to each math problem that aim to improve both the performance and the interpretability of the learned models. Using this representation language, our new dataset, MathQA, significantly enhances the AQuA dataset with fully-specified operational programs. We additionally introduce a neural sequence-to-program model enhanced with automatic problem categorization. Our experiments show improvements over competitive baselines in our MathQA as well as the AQuA dataset. The results are still significantly lower than human performance indicating that the dataset poses new challenges for future research. Our dataset is available at: https://math-qa.github.io/math-QA/</abstract><doi>10.48550/arxiv.1905.13319</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1905.13319
ispartof
issn
language eng
recordid cdi_arxiv_primary_1905_13319
source arXiv.org
subjects Computer Science - Computation and Language
title MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T10%3A16%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MathQA:%20Towards%20Interpretable%20Math%20Word%20Problem%20Solving%20with%20Operation-Based%20Formalisms&rft.au=Amini,%20Aida&rft.date=2019-05-30&rft_id=info:doi/10.48550/arxiv.1905.13319&rft_dat=%3Carxiv_GOX%3E1905_13319%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true