Event trigger identification for biomedical events extraction using domain knowledge
In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting...
Gespeichert in:
Veröffentlicht in: | Bioinformatics 2014-06, Vol.30 (11), p.1587-1594 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1594 |
---|---|
container_issue | 11 |
container_start_page | 1587 |
container_title | Bioinformatics |
container_volume | 30 |
creator | Zhou, Deyu Zhong, Dayou He, Yulan |
description | In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification.
In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that >2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework. |
doi_str_mv | 10.1093/bioinformatics/btu061 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1786198690</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1786198690</sourcerecordid><originalsourceid>FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</originalsourceid><addsrcrecordid>eNqNkUlPwzAQhS0EoqXwE0A5cgm14yXOEVVsUiUu5Rx5mUSGLMVOWP49Li2VOMFpFn3znjQPoXOCrwgu6Fy73nVV71s1OBPmehixIAdoSqjIUyYJOdz3mE7QSQjPGGOOuThGk4wxWVAhp2h18wbdkAze1TX4xNk4ucqZqNp3SdRPolELNm6aBDZsSOBj8Mp8A2NwXZ3YvlWuS166_r0BW8MpOqpUE-BsV2fo6fZmtbhPl493D4vrZWqYlENaWM5B55ngymqiWQ7cFEZoJbkuuLaagQGWcS2AYo21NgSwZFZbUJXBls7Q5VZ37fvXEcJQti4YaBrVQT-GkuRSkEKKAv-NcspkFl8i_4FmkjIiMh5RvkWN70PwUJVr71rlP0uCy01M5e-Yym1M8e5iZzHq-Nz91U8u9AsROpZg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1528341625</pqid></control><display><type>article</type><title>Event trigger identification for biomedical events extraction using domain knowledge</title><source>MEDLINE</source><source>Oxford University Press Open Access</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>EZB Electronic Journals Library</source><creator>Zhou, Deyu ; Zhong, Dayou ; He, Yulan</creator><creatorcontrib>Zhou, Deyu ; Zhong, Dayou ; He, Yulan</creatorcontrib><description>In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification.
In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that >2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1367-4811</identifier><identifier>EISSN: 1460-2059</identifier><identifier>DOI: 10.1093/bioinformatics/btu061</identifier><identifier>PMID: 24489368</identifier><language>eng</language><publisher>England</publisher><subject>Artificial Intelligence ; Binding ; Biomolecules ; Classifiers ; Data Mining - methods ; Diseases ; Drugs ; Extraction ; MEDLINE ; Neural Networks (Computer) ; Ribonucleic acids ; Semantics</subject><ispartof>Bioinformatics, 2014-06, Vol.30 (11), p.1587-1594</ispartof><rights>The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</citedby><cites>FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24489368$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhou, Deyu</creatorcontrib><creatorcontrib>Zhong, Dayou</creatorcontrib><creatorcontrib>He, Yulan</creatorcontrib><title>Event trigger identification for biomedical events extraction using domain knowledge</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification.
In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that >2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.</description><subject>Artificial Intelligence</subject><subject>Binding</subject><subject>Biomolecules</subject><subject>Classifiers</subject><subject>Data Mining - methods</subject><subject>Diseases</subject><subject>Drugs</subject><subject>Extraction</subject><subject>MEDLINE</subject><subject>Neural Networks (Computer)</subject><subject>Ribonucleic acids</subject><subject>Semantics</subject><issn>1367-4803</issn><issn>1367-4811</issn><issn>1460-2059</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkUlPwzAQhS0EoqXwE0A5cgm14yXOEVVsUiUu5Rx5mUSGLMVOWP49Li2VOMFpFn3znjQPoXOCrwgu6Fy73nVV71s1OBPmehixIAdoSqjIUyYJOdz3mE7QSQjPGGOOuThGk4wxWVAhp2h18wbdkAze1TX4xNk4ucqZqNp3SdRPolELNm6aBDZsSOBj8Mp8A2NwXZ3YvlWuS166_r0BW8MpOqpUE-BsV2fo6fZmtbhPl493D4vrZWqYlENaWM5B55ngymqiWQ7cFEZoJbkuuLaagQGWcS2AYo21NgSwZFZbUJXBls7Q5VZ37fvXEcJQti4YaBrVQT-GkuRSkEKKAv-NcspkFl8i_4FmkjIiMh5RvkWN70PwUJVr71rlP0uCy01M5e-Yym1M8e5iZzHq-Nz91U8u9AsROpZg</recordid><startdate>20140601</startdate><enddate>20140601</enddate><creator>Zhou, Deyu</creator><creator>Zhong, Dayou</creator><creator>He, Yulan</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7QO</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7SC</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20140601</creationdate><title>Event trigger identification for biomedical events extraction using domain knowledge</title><author>Zhou, Deyu ; Zhong, Dayou ; He, Yulan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Artificial Intelligence</topic><topic>Binding</topic><topic>Biomolecules</topic><topic>Classifiers</topic><topic>Data Mining - methods</topic><topic>Diseases</topic><topic>Drugs</topic><topic>Extraction</topic><topic>MEDLINE</topic><topic>Neural Networks (Computer)</topic><topic>Ribonucleic acids</topic><topic>Semantics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Deyu</creatorcontrib><creatorcontrib>Zhong, Dayou</creatorcontrib><creatorcontrib>He, Yulan</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhou, Deyu</au><au>Zhong, Dayou</au><au>He, Yulan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Event trigger identification for biomedical events extraction using domain knowledge</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2014-06-01</date><risdate>2014</risdate><volume>30</volume><issue>11</issue><spage>1587</spage><epage>1594</epage><pages>1587-1594</pages><issn>1367-4803</issn><eissn>1367-4811</eissn><eissn>1460-2059</eissn><abstract>In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification.
In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that >2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.</abstract><cop>England</cop><pmid>24489368</pmid><doi>10.1093/bioinformatics/btu061</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1367-4803 |
ispartof | Bioinformatics, 2014-06, Vol.30 (11), p.1587-1594 |
issn | 1367-4803 1367-4811 1460-2059 |
language | eng |
recordid | cdi_proquest_miscellaneous_1786198690 |
source | MEDLINE; Oxford University Press Open Access; PubMed Central; Alma/SFX Local Collection; EZB Electronic Journals Library |
subjects | Artificial Intelligence Binding Biomolecules Classifiers Data Mining - methods Diseases Drugs Extraction MEDLINE Neural Networks (Computer) Ribonucleic acids Semantics |
title | Event trigger identification for biomedical events extraction using domain knowledge |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T16%3A11%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Event%20trigger%20identification%20for%20biomedical%20events%20extraction%20using%20domain%20knowledge&rft.jtitle=Bioinformatics&rft.au=Zhou,%20Deyu&rft.date=2014-06-01&rft.volume=30&rft.issue=11&rft.spage=1587&rft.epage=1594&rft.pages=1587-1594&rft.issn=1367-4803&rft.eissn=1367-4811&rft_id=info:doi/10.1093/bioinformatics/btu061&rft_dat=%3Cproquest_cross%3E1786198690%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1528341625&rft_id=info:pmid/24489368&rfr_iscdi=true |