A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL
In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilize...
Gespeichert in:
Veröffentlicht in: | Wireless personal communications 2019-10, Vol.108 (3), p.1909-1931 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1931 |
---|---|
container_issue | 3 |
container_start_page | 1909 |
container_title | Wireless personal communications |
container_volume | 108 |
creator | Banerjee, Partha Sarathy Chakraborty, Baisakhi Tripathi, Deepak Gupta, Hardik Kumar, Sourabh S. |
description | In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information. |
doi_str_mv | 10.1007/s11277-019-06501-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2292138856</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2292138856</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</originalsourceid><addsrcrecordid>eNp9kMtOAyEUhonRxFp9AVckrlEuc4Flbao2aTStNrojDMPoNC1TgdHYp5fpmOjK1QmH8_0HPgDOCb4kGOdXnhCa5wgTgXCWYoJ2B2BA0pwizpKXQzDAggqUUUKPwYn3K4wjJugA2BGc2qpxGxXqxsKFCa42H2oNr5U3JYyteWv8_k7ZEo6s_zSutq_70_1kASMLl9YH1-rQuoj8jXuuw1vTBrj0HfI4n52Co0qtvTn7qUOwvJk8je_Q7OF2Oh7NkGZEBJTwkmjFGaakoAkxRXwsLxJVMZVixosyzRTPS5qYnKWFKTTDWBuqE82N4IKxIbjoc7euee8-IFdN62xcKSkVlDDO0yxO0X5Ku8Z7Zyq5dfVGuS9JsOy8yt6rjF7l3qvcRYj1kN92Ioz7jf6H-gYgQHwc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2292138856</pqid></control><display><type>article</type><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><source>SpringerLink Journals - AutoHoldings</source><creator>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</creator><creatorcontrib>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</creatorcontrib><description>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</description><identifier>ISSN: 0929-6212</identifier><identifier>EISSN: 1572-834X</identifier><identifier>DOI: 10.1007/s11277-019-06501-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Communications Engineering ; Computer Communication Networks ; Data processing ; Engineering ; Information retrieval ; Language ; Natural language ; Natural language processing ; Networks ; Pattern matching ; Pattern recognition ; Query languages ; Query processing ; Questions ; Signal,Image and Speech Processing ; Storage ; Structured Query Language-SQL ; Unstructured data</subject><ispartof>Wireless personal communications, 2019-10, Vol.108 (3), p.1909-1931</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</citedby><cites>FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</cites><orcidid>0000-0001-6459-5988</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11277-019-06501-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11277-019-06501-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,777,781,27905,27906,41469,42538,51300</link.rule.ids></links><search><creatorcontrib>Banerjee, Partha Sarathy</creatorcontrib><creatorcontrib>Chakraborty, Baisakhi</creatorcontrib><creatorcontrib>Tripathi, Deepak</creatorcontrib><creatorcontrib>Gupta, Hardik</creatorcontrib><creatorcontrib>Kumar, Sourabh S.</creatorcontrib><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><title>Wireless personal communications</title><addtitle>Wireless Pers Commun</addtitle><description>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</description><subject>Algorithms</subject><subject>Communications Engineering</subject><subject>Computer Communication Networks</subject><subject>Data processing</subject><subject>Engineering</subject><subject>Information retrieval</subject><subject>Language</subject><subject>Natural language</subject><subject>Natural language processing</subject><subject>Networks</subject><subject>Pattern matching</subject><subject>Pattern recognition</subject><subject>Query languages</subject><subject>Query processing</subject><subject>Questions</subject><subject>Signal,Image and Speech Processing</subject><subject>Storage</subject><subject>Structured Query Language-SQL</subject><subject>Unstructured data</subject><issn>0929-6212</issn><issn>1572-834X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOAyEUhonRxFp9AVckrlEuc4Flbao2aTStNrojDMPoNC1TgdHYp5fpmOjK1QmH8_0HPgDOCb4kGOdXnhCa5wgTgXCWYoJ2B2BA0pwizpKXQzDAggqUUUKPwYn3K4wjJugA2BGc2qpxGxXqxsKFCa42H2oNr5U3JYyteWv8_k7ZEo6s_zSutq_70_1kASMLl9YH1-rQuoj8jXuuw1vTBrj0HfI4n52Co0qtvTn7qUOwvJk8je_Q7OF2Oh7NkGZEBJTwkmjFGaakoAkxRXwsLxJVMZVixosyzRTPS5qYnKWFKTTDWBuqE82N4IKxIbjoc7euee8-IFdN62xcKSkVlDDO0yxO0X5Ku8Z7Zyq5dfVGuS9JsOy8yt6rjF7l3qvcRYj1kN92Ioz7jf6H-gYgQHwc</recordid><startdate>20191001</startdate><enddate>20191001</enddate><creator>Banerjee, Partha Sarathy</creator><creator>Chakraborty, Baisakhi</creator><creator>Tripathi, Deepak</creator><creator>Gupta, Hardik</creator><creator>Kumar, Sourabh S.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-6459-5988</orcidid></search><sort><creationdate>20191001</creationdate><title>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</title><author>Banerjee, Partha Sarathy ; Chakraborty, Baisakhi ; Tripathi, Deepak ; Gupta, Hardik ; Kumar, Sourabh S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-48d1ca83021b241eb0098b4af3a5038bd56a87d24e735bebc300ce2c4c8e98933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Communications Engineering</topic><topic>Computer Communication Networks</topic><topic>Data processing</topic><topic>Engineering</topic><topic>Information retrieval</topic><topic>Language</topic><topic>Natural language</topic><topic>Natural language processing</topic><topic>Networks</topic><topic>Pattern matching</topic><topic>Pattern recognition</topic><topic>Query languages</topic><topic>Query processing</topic><topic>Questions</topic><topic>Signal,Image and Speech Processing</topic><topic>Storage</topic><topic>Structured Query Language-SQL</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Banerjee, Partha Sarathy</creatorcontrib><creatorcontrib>Chakraborty, Baisakhi</creatorcontrib><creatorcontrib>Tripathi, Deepak</creatorcontrib><creatorcontrib>Gupta, Hardik</creatorcontrib><creatorcontrib>Kumar, Sourabh S.</creatorcontrib><collection>CrossRef</collection><jtitle>Wireless personal communications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Banerjee, Partha Sarathy</au><au>Chakraborty, Baisakhi</au><au>Tripathi, Deepak</au><au>Gupta, Hardik</au><au>Kumar, Sourabh S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL</atitle><jtitle>Wireless personal communications</jtitle><stitle>Wireless Pers Commun</stitle><date>2019-10-01</date><risdate>2019</risdate><volume>108</volume><issue>3</issue><spage>1909</spage><epage>1931</epage><pages>1909-1931</pages><issn>0929-6212</issn><eissn>1572-834X</eissn><abstract>In today’s world, the availability of information in the form of unstructured data is in abundance. The unstructured information received is more often than not in the form of natural language text. For any defense establishment, the spy data or any sensitive information received may be best utilized when the information can be extracted efficiently and easily. The proposed model is applicable wherever the influx of text-heavy (unstructured data) is high like the information from the world wide web, documents related to a particular domain, or any other source where the information is in the form of natural language. The proposed Natural Language Information Interpretation and Representation System (NLIIRS) accepts the information in the form of natural language text, processes the information and allows the user to retrieve information by rendering questions in natural language. The questions thus asked by the user are responded by NLIIRS in the form of factoid or phrase based answers. In comparison to the conventional question and answering systems the proposed NLIIRS uses the advantages of both named entity recognition as well as sequential pattern matching based answer search technique. The proposed technique helps us to avoid the use of structured query language (SQL) at the back-end for information processing, storage and extraction. The conversion of user query to SQL statements and also storing the unstructured text in the form of relation tables can be avoided by using NLIIRS. By using this approach in our novel text processing algorithm, after every execution step, the pattern matching and extraction process of the answers to the queries becomes concise and faster. The whole system has been designed on natural language tool kit of Stanford University which helped us to generate parts of speech tag, tokenize the data, and forming tree structure. The novel text processing algorithm utilizes the lemmatizer, stemmer and ne_chunker to prepare the text for information retrieval via Q&A. The advantage of this system is that it does not need training. This system will enable the user to retrieve any information of his/her choice from the available unstructured information.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11277-019-06501-z</doi><tpages>23</tpages><orcidid>https://orcid.org/0000-0001-6459-5988</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0929-6212 |
ispartof | Wireless personal communications, 2019-10, Vol.108 (3), p.1909-1931 |
issn | 0929-6212 1572-834X |
language | eng |
recordid | cdi_proquest_journals_2292138856 |
source | SpringerLink Journals - AutoHoldings |
subjects | Algorithms Communications Engineering Computer Communication Networks Data processing Engineering Information retrieval Language Natural language Natural language processing Networks Pattern matching Pattern recognition Query languages Query processing Questions Signal,Image and Speech Processing Storage Structured Query Language-SQL Unstructured data |
title | A Information Retrieval Based on Question and Answering and NER for Unstructured Information Without Using SQL |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T16%3A47%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Information%20Retrieval%20Based%20on%20Question%20and%20Answering%20and%20NER%20for%20Unstructured%20Information%20Without%20Using%20SQL&rft.jtitle=Wireless%20personal%20communications&rft.au=Banerjee,%20Partha%20Sarathy&rft.date=2019-10-01&rft.volume=108&rft.issue=3&rft.spage=1909&rft.epage=1931&rft.pages=1909-1931&rft.issn=0929-6212&rft.eissn=1572-834X&rft_id=info:doi/10.1007/s11277-019-06501-z&rft_dat=%3Cproquest_cross%3E2292138856%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2292138856&rft_id=info:pmid/&rfr_iscdi=true |