NLI-GSC: A Natural Language Interface for Generating SourceCode

There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of advanced computer science & applications 2022, Vol.13 (1)
Hauptverfasser:	Ansari, Aaqib Ahmed R.H., Vora, Deepali R.
Format:	Artikel
Sprache:	eng
Schlagworte:	Datasets Extensible Markup Language Natural language Natural language processing Programming languages Uniqueness
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	1
container_start_page
container_title	International journal of advanced computer science & applications
container_volume	13
creator	Ansari, Aaqib Ahmed R.H. Vora, Deepali R.
description	There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.
doi_str_mv	10.14569/IJACSA.2022.0130198
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2652931008</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2652931008</sourcerecordid><originalsourceid>FETCH-LOGICAL-c274t-c079a8f588354f5b5f5f706f86f81bd02e3d06653aea0ef66bd8efdfbaddc0793</originalsourceid><addsrcrecordid>eNotkE9LxDAQxYMouKz7DTwEPHedJE2aepFSdK2U9VAFbyFtJssua7um7cFvb_fPMDBzeG_e8CPknsGSxVKlj8V7llfZkgPnS2ACWKqvyIwzqSIpE7g-7TpikHzfkkXf72AqkXKlxYw8r8siWlX5E83o2g5jsHta2nYz2g3Soh0weNsg9V2gK2wx2GHbbmjVjaHBvHN4R2683fe4uMw5-Xp9-czfovJjVeRZGTU8iYeogSS12kuthYy9rKWXPgHl9dSsdsBROFBKCosW0CtVO43e-do6d_SKOXk43z2E7nfEfjC76Yd2ijRcSZ4KBqAnVXxWNaHr-4DeHML2x4Y_w8CcaJkzLXOkZS60xD-gYlvy</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2652931008</pqid></control><display><type>article</type><title>NLI-GSC: A Natural Language Interface for Generating SourceCode</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Ansari, Aaqib Ahmed R.H. ; Vora, Deepali R.</creator><creatorcontrib>Ansari, Aaqib Ahmed R.H. ; Vora, Deepali R.</creatorcontrib><description>There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2022.0130198</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Datasets ; Extensible Markup Language ; Natural language ; Natural language processing ; Programming languages ; Uniqueness</subject><ispartof>International journal of advanced computer science & applications, 2022, Vol.13 (1)</ispartof><rights>2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4024,27923,27924,27925</link.rule.ids></links><search><creatorcontrib>Ansari, Aaqib Ahmed R.H.</creatorcontrib><creatorcontrib>Vora, Deepali R.</creatorcontrib><title>NLI-GSC: A Natural Language Interface for Generating SourceCode</title><title>International journal of advanced computer science & applications</title><description>There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.</description><subject>Datasets</subject><subject>Extensible Markup Language</subject><subject>Natural language</subject><subject>Natural language processing</subject><subject>Programming languages</subject><subject>Uniqueness</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotkE9LxDAQxYMouKz7DTwEPHedJE2aepFSdK2U9VAFbyFtJssua7um7cFvb_fPMDBzeG_e8CPknsGSxVKlj8V7llfZkgPnS2ACWKqvyIwzqSIpE7g-7TpikHzfkkXf72AqkXKlxYw8r8siWlX5E83o2g5jsHta2nYz2g3Soh0weNsg9V2gK2wx2GHbbmjVjaHBvHN4R2683fe4uMw5-Xp9-czfovJjVeRZGTU8iYeogSS12kuthYy9rKWXPgHl9dSsdsBROFBKCosW0CtVO43e-do6d_SKOXk43z2E7nfEfjC76Yd2ijRcSZ4KBqAnVXxWNaHr-4DeHML2x4Y_w8CcaJkzLXOkZS60xD-gYlvy</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Ansari, Aaqib Ahmed R.H.</creator><creator>Vora, Deepali R.</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>2022</creationdate><title>NLI-GSC: A Natural Language Interface for Generating SourceCode</title><author>Ansari, Aaqib Ahmed R.H. ; Vora, Deepali R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c274t-c079a8f588354f5b5f5f706f86f81bd02e3d06653aea0ef66bd8efdfbaddc0793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Datasets</topic><topic>Extensible Markup Language</topic><topic>Natural language</topic><topic>Natural language processing</topic><topic>Programming languages</topic><topic>Uniqueness</topic><toplevel>online_resources</toplevel><creatorcontrib>Ansari, Aaqib Ahmed R.H.</creatorcontrib><creatorcontrib>Vora, Deepali R.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science & applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ansari, Aaqib Ahmed R.H.</au><au>Vora, Deepali R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>NLI-GSC: A Natural Language Interface for Generating SourceCode</atitle><jtitle>International journal of advanced computer science & applications</jtitle><date>2022</date><risdate>2022</risdate><volume>13</volume><issue>1</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2022.0130198</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2158-107X
ispartof	International journal of advanced computer science & applications, 2022, Vol.13 (1)
issn	2158-107X 2156-5570
language	eng
recordid	cdi_proquest_journals_2652931008
source	EZB-FREE-00999 freely available EZB journals
subjects	Datasets Extensible Markup Language Natural language Natural language processing Programming languages Uniqueness
title	NLI-GSC: A Natural Language Interface for Generating SourceCode
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T07%3A56%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=NLI-GSC:%20A%20Natural%20Language%20Interface%20for%20Generating%20SourceCode&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=Ansari,%20Aaqib%20Ahmed%20R.H.&rft.date=2022&rft.volume=13&rft.issue=1&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2022.0130198&rft_dat=%3Cproquest_cross%3E2652931008%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2652931008&rft_id=info:pmid/&rfr_iscdi=true