Automated Creation and Human-assisted Curation of Computable Scientific Models from Code and Text
Scientific models hold the key to better understanding and predicting the behavior of complex systems. The most comprehensive manifestation of a scientific model, including crucial assumptions and parameters that underpin its usability, is usually embedded in associated source code and documentation...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Mulwad, Varish Crapo, Andrew Kumar, Vijay S Jobin, James Gabaldon, Alfredo Virani, Nurali Dixit, Sharad Joshi, Narendra |
description | Scientific models hold the key to better understanding and predicting the
behavior of complex systems. The most comprehensive manifestation of a
scientific model, including crucial assumptions and parameters that underpin
its usability, is usually embedded in associated source code and documentation,
which may employ a variety of (potentially outdated) programming practices and
languages. Domain experts cannot gain a complete understanding of the
implementation of a scientific model if they are not familiar with the code.
Furthermore, rapid research and development iterations make it challenging to
keep up with constantly evolving scientific model codebases. To address these
challenges, we develop a system for the automated creation and human-assisted
curation of a knowledge graph of computable scientific models that analyzes a
model's code in the context of any associated inline comments and external
documentation. Our system uses knowledge-driven as well as data-driven
approaches to identify and extract relevant concepts from code and equations
from textual documents to semantically annotate models using domain
terminology. These models are converted into executable Python functions and
then can further be composed into complex workflows to answer different forms
of domain-driven questions. We present experimental results obtained using a
dataset of code and associated text derived from NASA's Hypersonic Aerodynamics
website. |
doi_str_mv | 10.48550/arxiv.2202.13739 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2202_13739</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2202_13739</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-8c1e6a84d382eb6cada699e04eda9532593ef1b93ad79a14ce8285e2c208e6233</originalsourceid><addsrcrecordid>eNotj71OwzAURr10QIUHYMIvkBDb-bHHKioUqYiB7NGNfS1ZiuPKdlB5eyBl-obz6UiHkEdWlbVsmuoZ4tV9lZxXvGSiE-qOwGHNwUNGQ_uIkF1YKCyGnlYPSwEpubSxNd5YsLQP_rJmmGakn9rhkp11mr4Hg3OiNgb_-zC4WQa85nuyszAnfPjfPRlejkN_Ks4fr2_94VxA26lCaoYtyNoIyXFqNRholcKqRgOqEbxRAi2blADTKWC1Rsllg1zzSmLLhdiTp5t2axwv0XmI3-Nf67i1ih9Bu0_L</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Automated Creation and Human-assisted Curation of Computable Scientific Models from Code and Text</title><source>arXiv.org</source><creator>Mulwad, Varish ; Crapo, Andrew ; Kumar, Vijay S ; Jobin, James ; Gabaldon, Alfredo ; Virani, Nurali ; Dixit, Sharad ; Joshi, Narendra</creator><creatorcontrib>Mulwad, Varish ; Crapo, Andrew ; Kumar, Vijay S ; Jobin, James ; Gabaldon, Alfredo ; Virani, Nurali ; Dixit, Sharad ; Joshi, Narendra</creatorcontrib><description>Scientific models hold the key to better understanding and predicting the
behavior of complex systems. The most comprehensive manifestation of a
scientific model, including crucial assumptions and parameters that underpin
its usability, is usually embedded in associated source code and documentation,
which may employ a variety of (potentially outdated) programming practices and
languages. Domain experts cannot gain a complete understanding of the
implementation of a scientific model if they are not familiar with the code.
Furthermore, rapid research and development iterations make it challenging to
keep up with constantly evolving scientific model codebases. To address these
challenges, we develop a system for the automated creation and human-assisted
curation of a knowledge graph of computable scientific models that analyzes a
model's code in the context of any associated inline comments and external
documentation. Our system uses knowledge-driven as well as data-driven
approaches to identify and extract relevant concepts from code and equations
from textual documents to semantically annotate models using domain
terminology. These models are converted into executable Python functions and
then can further be composed into complex workflows to answer different forms
of domain-driven questions. We present experimental results obtained using a
dataset of code and associated text derived from NASA's Hypersonic Aerodynamics
website.</description><identifier>DOI: 10.48550/arxiv.2202.13739</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Software Engineering</subject><creationdate>2022-01</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2202.13739$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2202.13739$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Mulwad, Varish</creatorcontrib><creatorcontrib>Crapo, Andrew</creatorcontrib><creatorcontrib>Kumar, Vijay S</creatorcontrib><creatorcontrib>Jobin, James</creatorcontrib><creatorcontrib>Gabaldon, Alfredo</creatorcontrib><creatorcontrib>Virani, Nurali</creatorcontrib><creatorcontrib>Dixit, Sharad</creatorcontrib><creatorcontrib>Joshi, Narendra</creatorcontrib><title>Automated Creation and Human-assisted Curation of Computable Scientific Models from Code and Text</title><description>Scientific models hold the key to better understanding and predicting the
behavior of complex systems. The most comprehensive manifestation of a
scientific model, including crucial assumptions and parameters that underpin
its usability, is usually embedded in associated source code and documentation,
which may employ a variety of (potentially outdated) programming practices and
languages. Domain experts cannot gain a complete understanding of the
implementation of a scientific model if they are not familiar with the code.
Furthermore, rapid research and development iterations make it challenging to
keep up with constantly evolving scientific model codebases. To address these
challenges, we develop a system for the automated creation and human-assisted
curation of a knowledge graph of computable scientific models that analyzes a
model's code in the context of any associated inline comments and external
documentation. Our system uses knowledge-driven as well as data-driven
approaches to identify and extract relevant concepts from code and equations
from textual documents to semantically annotate models using domain
terminology. These models are converted into executable Python functions and
then can further be composed into complex workflows to answer different forms
of domain-driven questions. We present experimental results obtained using a
dataset of code and associated text derived from NASA's Hypersonic Aerodynamics
website.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Software Engineering</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj71OwzAURr10QIUHYMIvkBDb-bHHKioUqYiB7NGNfS1ZiuPKdlB5eyBl-obz6UiHkEdWlbVsmuoZ4tV9lZxXvGSiE-qOwGHNwUNGQ_uIkF1YKCyGnlYPSwEpubSxNd5YsLQP_rJmmGakn9rhkp11mr4Hg3OiNgb_-zC4WQa85nuyszAnfPjfPRlejkN_Ks4fr2_94VxA26lCaoYtyNoIyXFqNRholcKqRgOqEbxRAi2blADTKWC1Rsllg1zzSmLLhdiTp5t2axwv0XmI3-Nf67i1ih9Bu0_L</recordid><startdate>20220128</startdate><enddate>20220128</enddate><creator>Mulwad, Varish</creator><creator>Crapo, Andrew</creator><creator>Kumar, Vijay S</creator><creator>Jobin, James</creator><creator>Gabaldon, Alfredo</creator><creator>Virani, Nurali</creator><creator>Dixit, Sharad</creator><creator>Joshi, Narendra</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220128</creationdate><title>Automated Creation and Human-assisted Curation of Computable Scientific Models from Code and Text</title><author>Mulwad, Varish ; Crapo, Andrew ; Kumar, Vijay S ; Jobin, James ; Gabaldon, Alfredo ; Virani, Nurali ; Dixit, Sharad ; Joshi, Narendra</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-8c1e6a84d382eb6cada699e04eda9532593ef1b93ad79a14ce8285e2c208e6233</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Software Engineering</topic><toplevel>online_resources</toplevel><creatorcontrib>Mulwad, Varish</creatorcontrib><creatorcontrib>Crapo, Andrew</creatorcontrib><creatorcontrib>Kumar, Vijay S</creatorcontrib><creatorcontrib>Jobin, James</creatorcontrib><creatorcontrib>Gabaldon, Alfredo</creatorcontrib><creatorcontrib>Virani, Nurali</creatorcontrib><creatorcontrib>Dixit, Sharad</creatorcontrib><creatorcontrib>Joshi, Narendra</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mulwad, Varish</au><au>Crapo, Andrew</au><au>Kumar, Vijay S</au><au>Jobin, James</au><au>Gabaldon, Alfredo</au><au>Virani, Nurali</au><au>Dixit, Sharad</au><au>Joshi, Narendra</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automated Creation and Human-assisted Curation of Computable Scientific Models from Code and Text</atitle><date>2022-01-28</date><risdate>2022</risdate><abstract>Scientific models hold the key to better understanding and predicting the
behavior of complex systems. The most comprehensive manifestation of a
scientific model, including crucial assumptions and parameters that underpin
its usability, is usually embedded in associated source code and documentation,
which may employ a variety of (potentially outdated) programming practices and
languages. Domain experts cannot gain a complete understanding of the
implementation of a scientific model if they are not familiar with the code.
Furthermore, rapid research and development iterations make it challenging to
keep up with constantly evolving scientific model codebases. To address these
challenges, we develop a system for the automated creation and human-assisted
curation of a knowledge graph of computable scientific models that analyzes a
model's code in the context of any associated inline comments and external
documentation. Our system uses knowledge-driven as well as data-driven
approaches to identify and extract relevant concepts from code and equations
from textual documents to semantically annotate models using domain
terminology. These models are converted into executable Python functions and
then can further be composed into complex workflows to answer different forms
of domain-driven questions. We present experimental results obtained using a
dataset of code and associated text derived from NASA's Hypersonic Aerodynamics
website.</abstract><doi>10.48550/arxiv.2202.13739</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2202.13739 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2202_13739 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Computation and Language Computer Science - Software Engineering |
title | Automated Creation and Human-assisted Curation of Computable Scientific Models from Code and Text |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-16T22%3A15%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automated%20Creation%20and%20Human-assisted%20Curation%20of%20Computable%20Scientific%20Models%20from%20Code%20and%20Text&rft.au=Mulwad,%20Varish&rft.date=2022-01-28&rft_id=info:doi/10.48550/arxiv.2202.13739&rft_dat=%3Carxiv_GOX%3E2202_13739%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |