Generative adversarial networks for open information extraction
Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the...
Gespeichert in:
Veröffentlicht in: | Advances in computational intelligence 2021-10, Vol.1 (4), p.6, Article 6 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 4 |
container_start_page | 6 |
container_title | Advances in computational intelligence |
container_volume | 1 |
creator | Han, Jiabao Wang, Hongzhi |
description | Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines. |
doi_str_mv | 10.1007/s43674-021-00006-8 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2932839362</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2932839362</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1598-4e022f99f4bc7c2586be124804f3221dde445f031efd0f18696293da64b542e13</originalsourceid><addsrcrecordid>eNp9kE9LAzEUxIMoWGq_gKcFz6vJSzZ_TiJFq1DwoueQ3X2RrW22Jtuq397UVbz5Lm8Ov5mBIeSc0UtGqbpKgkslSgqspPlkqY_IBBSnpdJUH_9qZcQpmaW0ygwoRgFgQq4XGDC6odtj4do9xuRi59ZFwOG9j6-p8H0s-i2GogtZbjLZhwI_huiagzwjJ96tE85-_pQ8390-ze_L5ePiYX6zLBtWGV0KzHXeGC_qRjVQaVkjA6Gp8ByAtS0KUXnKGfqWeqalkWB466SoKwHI-JRcjLnb2L_tMA121e9iyJU2g6C54RIyBSPVxD6liN5uY7dx8dMyag9b2XErm7ey31tZnU18NKUMhxeMf9H_uL4AqWRrWg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2932839362</pqid></control><display><type>article</type><title>Generative adversarial networks for open information extraction</title><source>Springer Nature - Complete Springer Journals</source><source>ProQuest Central</source><creator>Han, Jiabao ; Wang, Hongzhi</creator><creatorcontrib>Han, Jiabao ; Wang, Hongzhi</creatorcontrib><description>Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines.</description><identifier>ISSN: 2730-7794</identifier><identifier>EISSN: 2730-7808</identifier><identifier>DOI: 10.1007/s43674-021-00006-8</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Artificial Intelligence ; Artificial neural networks ; Bias ; Computational Intelligence ; Discriminators ; Engineering ; Generative adversarial networks ; Honnold, Alex ; Information retrieval ; Machine Learning ; Machine translation ; Natural language processing ; Neural networks ; Original Article</subject><ispartof>Advances in computational intelligence, 2021-10, Vol.1 (4), p.6, Article 6</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021</rights><rights>The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1598-4e022f99f4bc7c2586be124804f3221dde445f031efd0f18696293da64b542e13</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s43674-021-00006-8$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2932839362?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,776,780,21368,27903,27904,33723,41467,42536,43784,51297</link.rule.ids></links><search><creatorcontrib>Han, Jiabao</creatorcontrib><creatorcontrib>Wang, Hongzhi</creatorcontrib><title>Generative adversarial networks for open information extraction</title><title>Advances in computational intelligence</title><addtitle>Adv. in Comp. Int</addtitle><description>Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines.</description><subject>Artificial Intelligence</subject><subject>Artificial neural networks</subject><subject>Bias</subject><subject>Computational Intelligence</subject><subject>Discriminators</subject><subject>Engineering</subject><subject>Generative adversarial networks</subject><subject>Honnold, Alex</subject><subject>Information retrieval</subject><subject>Machine Learning</subject><subject>Machine translation</subject><subject>Natural language processing</subject><subject>Neural networks</subject><subject>Original Article</subject><issn>2730-7794</issn><issn>2730-7808</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9kE9LAzEUxIMoWGq_gKcFz6vJSzZ_TiJFq1DwoueQ3X2RrW22Jtuq397UVbz5Lm8Ov5mBIeSc0UtGqbpKgkslSgqspPlkqY_IBBSnpdJUH_9qZcQpmaW0ygwoRgFgQq4XGDC6odtj4do9xuRi59ZFwOG9j6-p8H0s-i2GogtZbjLZhwI_huiagzwjJ96tE85-_pQ8390-ze_L5ePiYX6zLBtWGV0KzHXeGC_qRjVQaVkjA6Gp8ByAtS0KUXnKGfqWeqalkWB466SoKwHI-JRcjLnb2L_tMA121e9iyJU2g6C54RIyBSPVxD6liN5uY7dx8dMyag9b2XErm7ey31tZnU18NKUMhxeMf9H_uL4AqWRrWg</recordid><startdate>20211001</startdate><enddate>20211001</enddate><creator>Han, Jiabao</creator><creator>Wang, Hongzhi</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>88I</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2P</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20211001</creationdate><title>Generative adversarial networks for open information extraction</title><author>Han, Jiabao ; Wang, Hongzhi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1598-4e022f99f4bc7c2586be124804f3221dde445f031efd0f18696293da64b542e13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Artificial Intelligence</topic><topic>Artificial neural networks</topic><topic>Bias</topic><topic>Computational Intelligence</topic><topic>Discriminators</topic><topic>Engineering</topic><topic>Generative adversarial networks</topic><topic>Honnold, Alex</topic><topic>Information retrieval</topic><topic>Machine Learning</topic><topic>Machine translation</topic><topic>Natural language processing</topic><topic>Neural networks</topic><topic>Original Article</topic><toplevel>online_resources</toplevel><creatorcontrib>Han, Jiabao</creatorcontrib><creatorcontrib>Wang, Hongzhi</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Science Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Advances in computational intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Han, Jiabao</au><au>Wang, Hongzhi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Generative adversarial networks for open information extraction</atitle><jtitle>Advances in computational intelligence</jtitle><stitle>Adv. in Comp. Int</stitle><date>2021-10-01</date><risdate>2021</risdate><volume>1</volume><issue>4</issue><spage>6</spage><pages>6-</pages><artnum>6</artnum><issn>2730-7794</issn><eissn>2730-7808</eissn><abstract>Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s43674-021-00006-8</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2730-7794 |
ispartof | Advances in computational intelligence, 2021-10, Vol.1 (4), p.6, Article 6 |
issn | 2730-7794 2730-7808 |
language | eng |
recordid | cdi_proquest_journals_2932839362 |
source | Springer Nature - Complete Springer Journals; ProQuest Central |
subjects | Artificial Intelligence Artificial neural networks Bias Computational Intelligence Discriminators Engineering Generative adversarial networks Honnold, Alex Information retrieval Machine Learning Machine translation Natural language processing Neural networks Original Article |
title | Generative adversarial networks for open information extraction |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T00%3A50%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Generative%20adversarial%20networks%20for%20open%20information%20extraction&rft.jtitle=Advances%20in%20computational%20intelligence&rft.au=Han,%20Jiabao&rft.date=2021-10-01&rft.volume=1&rft.issue=4&rft.spage=6&rft.pages=6-&rft.artnum=6&rft.issn=2730-7794&rft.eissn=2730-7808&rft_id=info:doi/10.1007/s43674-021-00006-8&rft_dat=%3Cproquest_cross%3E2932839362%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2932839362&rft_id=info:pmid/&rfr_iscdi=true |