A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL

In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilize...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Wireless personal communications 2019-10, Vol.108 (3), p.1909-1931
Hauptverfasser: Banerjee, Partha Sarathy, Chakraborty, Baisakhi, Tripathi, Deepak, Gupta, Hardik, Kumar, Sourabh S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1931
container_issue 3
container_start_page 1909
container_title Wireless personal communications
container_volume 108
creator Banerjee, Partha Sarathy
Chakraborty, Baisakhi
Tripathi, Deepak
Gupta, Hardik
Kumar, Sourabh S.
description In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.
doi_str_mv 10.1007/s11277-019-06501-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2292138856</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2292138856</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</originalsourceid><addsrcrecordid>eNp9kMtOAyEUhonRxFp9AVckrlEuc4Flbao2aTStNrojDMPoNC1TgdHYp5fpmOjK1QmH8_0HPgDOCb4kGOdXnhCa5wgTgXCWYoJ2B2BA0pwizpKXQzDAggqUUUKPwYn3K4wjJugA2BGc2qpxGxXqxsKFCa42H2oNr5U3JYyteWv8_k7ZEo6s_zSutq_70_1kASMLl9YH1-rQuoj8jXuuw1vTBrj0HfI4n52Co0qtvTn7qUOwvJk8je_Q7OF2Oh7NkGZEBJTwkmjFGaakoAkxRXwsLxJVMZVixosyzRTPS5qYnKWFKTTDWBuqE82N4IKxIbjoc7euee8-IFdN62xcKSkVlDDO0yxO0X5Ku8Z7Zyq5dfVGuS9JsOy8yt6rjF7l3qvcRYj1kN92Ioz7jf6H-gYgQHwc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2292138856</pqid></control><display><type>article</type><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><source>SpringerLink Journals - AutoHoldings</source><creator>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</creator><creatorcontrib>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</creatorcontrib><description>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&amp;A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</description><identifier>ISSN: 0929-6212</identifier><identifier>EISSN: 1572-834X</identifier><identifier>DOI: 10.1007/s11277-019-06501-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Communications Engineering ; Computer Communication Networks ; Data processing ; Engineering ; Information retrieval ; Language ; Natural language ; Natural language processing ; Networks ; Pattern matching ; Pattern recognition ; Query languages ; Query processing ; Questions ; Signal,Image and Speech Processing ; Storage ; Structured Query Language-SQL ; Unstructured data</subject><ispartof>Wireless personal communications, 2019-10, Vol.108 (3), p.1909-1931</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</citedby><cites>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</cites><orcidid>0000-0001-6459-5988</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11277-019-06501-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11277-019-06501-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,777,781,27905,27906,41469,42538,51300</link.rule.ids></links><search><creatorcontrib>Banerjee, Partha Sarathy</creatorcontrib><creatorcontrib>Chakraborty, Baisakhi</creatorcontrib><creatorcontrib>Tripathi, Deepak</creatorcontrib><creatorcontrib>Gupta, Hardik</creatorcontrib><creatorcontrib>Kumar, Sourabh S.</creatorcontrib><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><title>Wireless personal communications</title><addtitle>Wireless Pers Commun</addtitle><description>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&amp;A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</description><subject>Algorithms</subject><subject>Communications Engineering</subject><subject>Computer Communication Networks</subject><subject>Data processing</subject><subject>Engineering</subject><subject>Information retrieval</subject><subject>Language</subject><subject>Natural language</subject><subject>Natural language processing</subject><subject>Networks</subject><subject>Pattern matching</subject><subject>Pattern recognition</subject><subject>Query languages</subject><subject>Query processing</subject><subject>Questions</subject><subject>Signal,Image and Speech Processing</subject><subject>Storage</subject><subject>Structured Query Language-SQL</subject><subject>Unstructured data</subject><issn>0929-6212</issn><issn>1572-834X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOAyEUhonRxFp9AVckrlEuc4Flbao2aTStNrojDMPoNC1TgdHYp5fpmOjK1QmH8_0HPgDOCb4kGOdXnhCa5wgTgXCWYoJ2B2BA0pwizpKXQzDAggqUUUKPwYn3K4wjJugA2BGc2qpxGxXqxsKFCa42H2oNr5U3JYyteWv8_k7ZEo6s_zSutq_70_1kASMLl9YH1-rQuoj8jXuuw1vTBrj0HfI4n52Co0qtvTn7qUOwvJk8je_Q7OF2Oh7NkGZEBJTwkmjFGaakoAkxRXwsLxJVMZVixosyzRTPS5qYnKWFKTTDWBuqE82N4IKxIbjoc7euee8-IFdN62xcKSkVlDDO0yxO0X5Ku8Z7Zyq5dfVGuS9JsOy8yt6rjF7l3qvcRYj1kN92Ioz7jf6H-gYgQHwc</recordid><startdate>20191001</startdate><enddate>20191001</enddate><creator>Banerjee, Partha Sarathy</creator><creator>Chakraborty, Baisakhi</creator><creator>Tripathi, Deepak</creator><creator>Gupta, Hardik</creator><creator>Kumar, Sourabh S.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-6459-5988</orcidid></search><sort><creationdate>20191001</creationdate><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><author>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Communications Engineering</topic><topic>Computer Communication Networks</topic><topic>Data processing</topic><topic>Engineering</topic><topic>Information retrieval</topic><topic>Language</topic><topic>Natural language</topic><topic>Natural language processing</topic><topic>Networks</topic><topic>Pattern matching</topic><topic>Pattern recognition</topic><topic>Query languages</topic><topic>Query processing</topic><topic>Questions</topic><topic>Signal,Image and Speech Processing</topic><topic>Storage</topic><topic>Structured Query Language-SQL</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Banerjee, Partha Sarathy</creatorcontrib><creatorcontrib>Chakraborty, Baisakhi</creatorcontrib><creatorcontrib>Tripathi, Deepak</creatorcontrib><creatorcontrib>Gupta, Hardik</creatorcontrib><creatorcontrib>Kumar, Sourabh S.</creatorcontrib><collection>CrossRef</collection><jtitle>Wireless personal communications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Banerjee, Partha Sarathy</au><au>Chakraborty, Baisakhi</au><au>Tripathi, Deepak</au><au>Gupta, Hardik</au><au>Kumar, Sourabh S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</atitle><jtitle>Wireless personal communications</jtitle><stitle>Wireless Pers Commun</stitle><date>2019-10-01</date><risdate>2019</risdate><volume>108</volume><issue>3</issue><spage>1909</spage><epage>1931</epage><pages>1909-1931</pages><issn>0929-6212</issn><eissn>1572-834X</eissn><abstract>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&amp;A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11277-019-06501-z</doi><tpages>23</tpages><orcidid>https://orcid.org/0000-0001-6459-5988</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0929-6212
ispartof Wireless personal communications, 2019-10, Vol.108 (3), p.1909-1931
issn 0929-6212
1572-834X
language eng
recordid cdi_proquest_journals_2292138856
source SpringerLink Journals - AutoHoldings
subjects Algorithms
Communications Engineering
Computer Communication Networks
Data processing
Engineering
Information retrieval
Language
Natural language
Natural language processing
Networks
Pattern matching
Pattern recognition
Query languages
Query processing
Questions
Signal,Image and Speech Processing
Storage
Structured Query Language-SQL
Unstructured data
title A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T16%3A47%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Information%20Retrieval%20Based%20on%20Question%20and%20Answering%20and%20NER%20for%20Unstructured%20Information%20Without%20Using%20SQL&rft.jtitle=Wireless%20personal%20communications&rft.au=Banerjee,%20Partha%20Sarathy&rft.date=2019-10-01&rft.volume=108&rft.issue=3&rft.spage=1909&rft.epage=1931&rft.pages=1909-1931&rft.issn=0929-6212&rft.eissn=1572-834X&rft_id=info:doi/10.1007/s11277-019-06501-z&rft_dat=%3Cproquest_cross%3E2292138856%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2292138856&rft_id=info:pmid/&rfr_iscdi=true