A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL

In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilize...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Wireless personal communications 2019-10, Vol.108 (3), p.1909-1931
Hauptverfasser:	Banerjee, Partha Sarathy, Chakraborty, Baisakhi, Tripathi, Deepak, Gupta, Hardik, Kumar, Sourabh S.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Communications Engineering Computer Communication Networks Data processing Engineering Information retrieval Language Natural language Natural language processing Networks Pattern matching Pattern recognition Query languages Query processing Questions Signal,Image and Speech Processing Storage Structured Query Language-SQL Unstructured data
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1931
container_issue	3
container_start_page	1909
container_title	Wireless personal communications
container_volume	108
creator	Banerjee, Partha Sarathy Chakraborty, Baisakhi Tripathi, Deepak Gupta, Hardik Kumar, Sourabh S.
description	In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.
doi_str_mv	10.1007/s11277-019-06501-z
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2292138856</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2292138856</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</originalsourceid><addsrcrecordid>eNp9kMtOAyEUhonRxFp9AVckrlEuc4Flbao2aTStNrojDMPoNC1TgdHYp5fpmOjK1QmH8_0HPgDOCb4kGOdXnhCa5wgTgXCWYoJ2B2BA0pwizpKXQzDAggqUUUKPwYn3K4wjJugA2BGc2qpxGxXqxsKFCa42H2oNr5U3JYyteWv8_k7ZEo6s_zSutq_70_1kASMLl9YH1-rQuoj8jXuuw1vTBrj0HfI4n52Co0qtvTn7qUOwvJk8je_Q7OF2Oh7NkGZEBJTwkmjFGaakoAkxRXwsLxJVMZVixosyzRTPS5qYnKWFKTTDWBuqE82N4IKxIbjoc7euee8-IFdN62xcKSkVlDDO0yxO0X5Ku8Z7Zyq5dfVGuS9JsOy8yt6rjF7l3qvcRYj1kN92Ioz7jf6H-gYgQHwc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2292138856</pqid></control><display><type>article</type><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><source>SpringerLink Journals - AutoHoldings</source><creator>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</creator><creatorcontrib>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</creatorcontrib><description>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</description><identifier>ISSN: 0929-6212</identifier><identifier>EISSN: 1572-834X</identifier><identifier>DOI: 10.1007/s11277-019-06501-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Communications Engineering ; Computer Communication Networks ; Data processing ; Engineering ; Information retrieval ; Language ; Natural language ; Natural language processing ; Networks ; Pattern matching ; Pattern recognition ; Query languages ; Query processing ; Questions ; Signal,Image and Speech Processing ; Storage ; Structured Query Language-SQL ; Unstructured data</subject><ispartof>Wireless personal communications, 2019-10, Vol.108 (3), p.1909-1931</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</citedby><cites>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</cites><orcidid>0000-0001-6459-5988</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11277-019-06501-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11277-019-06501-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,777,781,27905,27906,41469,42538,51300</link.rule.ids></links><search><creatorcontrib>Banerjee, Partha Sarathy</creatorcontrib><creatorcontrib>Chakraborty, Baisakhi</creatorcontrib><creatorcontrib>Tripathi, Deepak</creatorcontrib><creatorcontrib>Gupta, Hardik</creatorcontrib><creatorcontrib>Kumar, Sourabh S.</creatorcontrib><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><title>Wireless personal communications</title><addtitle>Wireless Pers Commun</addtitle><description>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</description><subject>Algorithms</subject><subject>Communications Engineering</subject><subject>Computer Communication Networks</subject><subject>Data processing</subject><subject>Engineering</subject><subject>Information retrieval</subject><subject>Language</subject><subject>Natural language</subject><subject>Natural language processing</subject><subject>Networks</subject><subject>Pattern matching</subject><subject>Pattern recognition</subject><subject>Query languages</subject><subject>Query processing</subject><subject>Questions</subject><subject>Signal,Image and Speech Processing</subject><subject>Storage</subject><subject>Structured Query Language-SQL</subject><subject>Unstructured data</subject><issn>0929-6212</issn><issn>1572-834X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOAyEUhonRxFp9AVckrlEuc4Flbao2aTStNrojDMPoNC1TgdHYp5fpmOjK1QmH8_0HPgDOCb4kGOdXnhCa5wgTgXCWYoJ2B2BA0pwizpKXQzDAggqUUUKPwYn3K4wjJugA2BGc2qpxGxXqxsKFCa42H2oNr5U3JYyteWv8_k7ZEo6s_zSutq_70_1kASMLl9YH1-rQuoj8jXuuw1vTBrj0HfI4n52Co0qtvTn7qUOwvJk8je_Q7OF2Oh7NkGZEBJTwkmjFGaakoAkxRXwsLxJVMZVixosyzRTPS5qYnKWFKTTDWBuqE82N4IKxIbjoc7euee8-IFdN62xcKSkVlDDO0yxO0X5Ku8Z7Zyq5dfVGuS9JsOy8yt6rjF7l3qvcRYj1kN92Ioz7jf6H-gYgQHwc</recordid><startdate>20191001</startdate><enddate>20191001</enddate><creator>Banerjee, Partha Sarathy</creator><creator>Chakraborty, Baisakhi</creator><creator>Tripathi, Deepak</creator><creator>Gupta, Hardik</creator><creator>Kumar, Sourabh S.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-6459-5988</orcidid></search><sort><creationdate>20191001</creationdate><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><author>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Communications Engineering</topic><topic>Computer Communication Networks</topic><topic>Data processing</topic><topic>Engineering</topic><topic>Information retrieval</topic><topic>Language</topic><topic>Natural language</topic><topic>Natural language processing</topic><topic>Networks</topic><topic>Pattern matching</topic><topic>Pattern recognition</topic><topic>Query languages</topic><topic>Query processing</topic><topic>Questions</topic><topic>Signal,Image and Speech Processing</topic><topic>Storage</topic><topic>Structured Query Language-SQL</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Banerjee, Partha Sarathy</creatorcontrib><creatorcontrib>Chakraborty, Baisakhi</creatorcontrib><creatorcontrib>Tripathi, Deepak</creatorcontrib><creatorcontrib>Gupta, Hardik</creatorcontrib><creatorcontrib>Kumar, Sourabh S.</creatorcontrib><collection>CrossRef</collection><jtitle>Wireless personal communications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Banerjee, Partha Sarathy</au><au>Chakraborty, Baisakhi</au><au>Tripathi, Deepak</au><au>Gupta, Hardik</au><au>Kumar, Sourabh S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</atitle><jtitle>Wireless personal communications</jtitle><stitle>Wireless Pers Commun</stitle><date>2019-10-01</date><risdate>2019</risdate><volume>108</volume><issue>3</issue><spage>1909</spage><epage>1931</epage><pages>1909-1931</pages><issn>0929-6212</issn><eissn>1572-834X</eissn><abstract>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11277-019-06501-z</doi><tpages>23</tpages><orcidid>https://orcid.org/0000-0001-6459-5988</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0929-6212
ispartof	Wireless personal communications, 2019-10, Vol.108 (3), p.1909-1931
issn	0929-6212 1572-834X
language	eng
recordid	cdi_proquest_journals_2292138856
source	SpringerLink Journals - AutoHoldings
subjects	Algorithms Communications Engineering Computer Communication Networks Data processing Engineering Information retrieval Language Natural language Natural language processing Networks Pattern matching Pattern recognition Query languages Query processing Questions Signal,Image and Speech Processing Storage Structured Query Language-SQL Unstructured data
title	A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T16%3A47%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Information%20Retrieval%20Based%20on%20Question%20and%20Answering%20and%20NER%20for%20Unstructured%20Information%20Without%20Using%20SQL&rft.jtitle=Wireless%20personal%20communications&rft.au=Banerjee,%20Partha%20Sarathy&rft.date=2019-10-01&rft.volume=108&rft.issue=3&rft.spage=1909&rft.epage=1931&rft.pages=1909-1931&rft.issn=0929-6212&rft.eissn=1572-834X&rft_id=info:doi/10.1007/s11277-019-06501-z&rft_dat=%3Cproquest_cross%3E2292138856%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2292138856&rft_id=info:pmid/&rfr_iscdi=true