NLI-GSC: A Natural Language Interface for Generating SourceCode
There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs t...
Gespeichert in:
Veröffentlicht in: | International journal of advanced computer science & applications 2022, Vol.13 (1) |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | International journal of advanced computer science & applications |
container_volume | 13 |
creator | Ansari, Aaqib Ahmed R.H. Vora, Deepali R. |
description | There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset. |
doi_str_mv | 10.14569/IJACSA.2022.0130198 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2652931008</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2652931008</sourcerecordid><originalsourceid>FETCH-LOGICAL-c274t-c079a8f588354f5b5f5f706f86f81bd02e3d06653aea0ef66bd8efdfbaddc0793</originalsourceid><addsrcrecordid>eNotkE9LxDAQxYMouKz7DTwEPHedJE2aepFSdK2U9VAFbyFtJssua7um7cFvb_fPMDBzeG_e8CPknsGSxVKlj8V7llfZkgPnS2ACWKqvyIwzqSIpE7g-7TpikHzfkkXf72AqkXKlxYw8r8siWlX5E83o2g5jsHta2nYz2g3Soh0weNsg9V2gK2wx2GHbbmjVjaHBvHN4R2683fe4uMw5-Xp9-czfovJjVeRZGTU8iYeogSS12kuthYy9rKWXPgHl9dSsdsBROFBKCosW0CtVO43e-do6d_SKOXk43z2E7nfEfjC76Yd2ijRcSZ4KBqAnVXxWNaHr-4DeHML2x4Y_w8CcaJkzLXOkZS60xD-gYlvy</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2652931008</pqid></control><display><type>article</type><title>NLI-GSC: A Natural Language Interface for Generating SourceCode</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Ansari, Aaqib Ahmed R.H. ; Vora, Deepali R.</creator><creatorcontrib>Ansari, Aaqib Ahmed R.H. ; Vora, Deepali R.</creatorcontrib><description>There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2022.0130198</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Datasets ; Extensible Markup Language ; Natural language ; Natural language processing ; Programming languages ; Uniqueness</subject><ispartof>International journal of advanced computer science & applications, 2022, Vol.13 (1)</ispartof><rights>2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4024,27923,27924,27925</link.rule.ids></links><search><creatorcontrib>Ansari, Aaqib Ahmed R.H.</creatorcontrib><creatorcontrib>Vora, Deepali R.</creatorcontrib><title>NLI-GSC: A Natural Language Interface for Generating SourceCode</title><title>International journal of advanced computer science & applications</title><description>There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.</description><subject>Datasets</subject><subject>Extensible Markup Language</subject><subject>Natural language</subject><subject>Natural language processing</subject><subject>Programming languages</subject><subject>Uniqueness</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotkE9LxDAQxYMouKz7DTwEPHedJE2aepFSdK2U9VAFbyFtJssua7um7cFvb_fPMDBzeG_e8CPknsGSxVKlj8V7llfZkgPnS2ACWKqvyIwzqSIpE7g-7TpikHzfkkXf72AqkXKlxYw8r8siWlX5E83o2g5jsHta2nYz2g3Soh0weNsg9V2gK2wx2GHbbmjVjaHBvHN4R2683fe4uMw5-Xp9-czfovJjVeRZGTU8iYeogSS12kuthYy9rKWXPgHl9dSsdsBROFBKCosW0CtVO43e-do6d_SKOXk43z2E7nfEfjC76Yd2ijRcSZ4KBqAnVXxWNaHr-4DeHML2x4Y_w8CcaJkzLXOkZS60xD-gYlvy</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Ansari, Aaqib Ahmed R.H.</creator><creator>Vora, Deepali R.</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>2022</creationdate><title>NLI-GSC: A Natural Language Interface for Generating SourceCode</title><author>Ansari, Aaqib Ahmed R.H. ; Vora, Deepali R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c274t-c079a8f588354f5b5f5f706f86f81bd02e3d06653aea0ef66bd8efdfbaddc0793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Datasets</topic><topic>Extensible Markup Language</topic><topic>Natural language</topic><topic>Natural language processing</topic><topic>Programming languages</topic><topic>Uniqueness</topic><toplevel>online_resources</toplevel><creatorcontrib>Ansari, Aaqib Ahmed R.H.</creatorcontrib><creatorcontrib>Vora, Deepali R.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science & applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ansari, Aaqib Ahmed R.H.</au><au>Vora, Deepali R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>NLI-GSC: A Natural Language Interface for Generating SourceCode</atitle><jtitle>International journal of advanced computer science & applications</jtitle><date>2022</date><risdate>2022</risdate><volume>13</volume><issue>1</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>There are many different programming languages and each programming language has its own structure or way of writing the code, it becomes difficult to learn and frequently switch between different programming languages. Due to this reason, a person working with multiple programming languages needs to look at documentations frequently which costs time and effort. In the past few years, there have been significant increase in the amount of papers published on this topic, each providing a unique solution to this problem. Many of these papers are based on applying NLP concepts in unique configuration to get the desired results. Some have used AI along with NLP to train the system to generate source-code in specific language, and some have trained the AI directly without pre-processing the dataset with NLP. All of these papers face two problems: a lack of proper dataset for this particular application and each paper can convent natural language into only one specified programming language source-code. This proposed system shows that a language independent solution is a feasible alternate for writing source-code without having full knowledge about a programming language. The proposed system uses Natural Lan-guage Processing to convert Natural Language into programming language-independent pseudo code using custom Named Entity Recognition and save it in XML (eXtensible Markup Language) format which is an intermediate step. Then, using traditional programming, this system converts the generated pseudo code into programming language-dependent source-code. In this paper, another novel method has been proposed to create dataset from scratch using predefined structure that is filled with predefined keywords creating unique combination of training dataset.</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2022.0130198</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2158-107X |
ispartof | International journal of advanced computer science & applications, 2022, Vol.13 (1) |
issn | 2158-107X 2156-5570 |
language | eng |
recordid | cdi_proquest_journals_2652931008 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | Datasets Extensible Markup Language Natural language Natural language processing Programming languages Uniqueness |
title | NLI-GSC: A Natural Language Interface for Generating SourceCode |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T07%3A56%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=NLI-GSC:%20A%20Natural%20Language%20Interface%20for%20Generating%20SourceCode&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=Ansari,%20Aaqib%20Ahmed%20R.H.&rft.date=2022&rft.volume=13&rft.issue=1&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2022.0130198&rft_dat=%3Cproquest_cross%3E2652931008%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2652931008&rft_id=info:pmid/&rfr_iscdi=true |