Event trigger identification for biomedical events extraction using domain knowledge

In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2014-06, Vol.30 (11), p.1587-1594
Hauptverfasser: Zhou, Deyu, Zhong, Dayou, He, Yulan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1594
container_issue 11
container_start_page 1587
container_title Bioinformatics
container_volume 30
creator Zhou, Deyu
Zhong, Dayou
He, Yulan
description In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification. In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that >2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.
doi_str_mv 10.1093/bioinformatics/btu061
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1786198690</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1786198690</sourcerecordid><originalsourceid>FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</originalsourceid><addsrcrecordid>eNqNkUlPwzAQhS0EoqXwE0A5cgm14yXOEVVsUiUu5Rx5mUSGLMVOWP49Li2VOMFpFn3znjQPoXOCrwgu6Fy73nVV71s1OBPmehixIAdoSqjIUyYJOdz3mE7QSQjPGGOOuThGk4wxWVAhp2h18wbdkAze1TX4xNk4ucqZqNp3SdRPolELNm6aBDZsSOBj8Mp8A2NwXZ3YvlWuS166_r0BW8MpOqpUE-BsV2fo6fZmtbhPl493D4vrZWqYlENaWM5B55ngymqiWQ7cFEZoJbkuuLaagQGWcS2AYo21NgSwZFZbUJXBls7Q5VZ37fvXEcJQti4YaBrVQT-GkuRSkEKKAv-NcspkFl8i_4FmkjIiMh5RvkWN70PwUJVr71rlP0uCy01M5e-Yym1M8e5iZzHq-Nz91U8u9AsROpZg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1528341625</pqid></control><display><type>article</type><title>Event trigger identification for biomedical events extraction using domain knowledge</title><source>MEDLINE</source><source>Oxford University Press Open Access</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>EZB Electronic Journals Library</source><creator>Zhou, Deyu ; Zhong, Dayou ; He, Yulan</creator><creatorcontrib>Zhou, Deyu ; Zhong, Dayou ; He, Yulan</creatorcontrib><description>In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification. In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that &gt;2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1367-4811</identifier><identifier>EISSN: 1460-2059</identifier><identifier>DOI: 10.1093/bioinformatics/btu061</identifier><identifier>PMID: 24489368</identifier><language>eng</language><publisher>England</publisher><subject>Artificial Intelligence ; Binding ; Biomolecules ; Classifiers ; Data Mining - methods ; Diseases ; Drugs ; Extraction ; MEDLINE ; Neural Networks (Computer) ; Ribonucleic acids ; Semantics</subject><ispartof>Bioinformatics, 2014-06, Vol.30 (11), p.1587-1594</ispartof><rights>The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</citedby><cites>FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24489368$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhou, Deyu</creatorcontrib><creatorcontrib>Zhong, Dayou</creatorcontrib><creatorcontrib>He, Yulan</creatorcontrib><title>Event trigger identification for biomedical events extraction using domain knowledge</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification. In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that &gt;2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.</description><subject>Artificial Intelligence</subject><subject>Binding</subject><subject>Biomolecules</subject><subject>Classifiers</subject><subject>Data Mining - methods</subject><subject>Diseases</subject><subject>Drugs</subject><subject>Extraction</subject><subject>MEDLINE</subject><subject>Neural Networks (Computer)</subject><subject>Ribonucleic acids</subject><subject>Semantics</subject><issn>1367-4803</issn><issn>1367-4811</issn><issn>1460-2059</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkUlPwzAQhS0EoqXwE0A5cgm14yXOEVVsUiUu5Rx5mUSGLMVOWP49Li2VOMFpFn3znjQPoXOCrwgu6Fy73nVV71s1OBPmehixIAdoSqjIUyYJOdz3mE7QSQjPGGOOuThGk4wxWVAhp2h18wbdkAze1TX4xNk4ucqZqNp3SdRPolELNm6aBDZsSOBj8Mp8A2NwXZ3YvlWuS166_r0BW8MpOqpUE-BsV2fo6fZmtbhPl493D4vrZWqYlENaWM5B55ngymqiWQ7cFEZoJbkuuLaagQGWcS2AYo21NgSwZFZbUJXBls7Q5VZ37fvXEcJQti4YaBrVQT-GkuRSkEKKAv-NcspkFl8i_4FmkjIiMh5RvkWN70PwUJVr71rlP0uCy01M5e-Yym1M8e5iZzHq-Nz91U8u9AsROpZg</recordid><startdate>20140601</startdate><enddate>20140601</enddate><creator>Zhou, Deyu</creator><creator>Zhong, Dayou</creator><creator>He, Yulan</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7QO</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7SC</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20140601</creationdate><title>Event trigger identification for biomedical events extraction using domain knowledge</title><author>Zhou, Deyu ; Zhong, Dayou ; He, Yulan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c488t-9d55eb7265adb1b47e5c9c6ba85b95bdb4ece425b6e30b0bbc1e084dbdeafc0d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Artificial Intelligence</topic><topic>Binding</topic><topic>Biomolecules</topic><topic>Classifiers</topic><topic>Data Mining - methods</topic><topic>Diseases</topic><topic>Drugs</topic><topic>Extraction</topic><topic>MEDLINE</topic><topic>Neural Networks (Computer)</topic><topic>Ribonucleic acids</topic><topic>Semantics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Deyu</creatorcontrib><creatorcontrib>Zhong, Dayou</creatorcontrib><creatorcontrib>He, Yulan</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhou, Deyu</au><au>Zhong, Dayou</au><au>He, Yulan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Event trigger identification for biomedical events extraction using domain knowledge</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2014-06-01</date><risdate>2014</risdate><volume>30</volume><issue>11</issue><spage>1587</spage><epage>1594</epage><pages>1587-1594</pages><issn>1367-4803</issn><eissn>1367-4811</eissn><eissn>1460-2059</eissn><abstract>In molecular biology, molecular events describe observable alterations of biomolecules, such as binding of proteins or RNA production. These events might be responsible for drug reactions or development of certain diseases. As such, biomedical event extraction, the process of automatically detecting description of molecular interactions in research articles, attracted substantial research interest recently. Event trigger identification, detecting the words describing the event types, is a crucial and prerequisite step in the pipeline process of biomedical event extraction. Taking the event types as classes, event trigger identification can be viewed as a classification task. For each word in a sentence, a trained classifier predicts whether the word corresponds to an event type and which event type based on the context features. Therefore, a well-designed feature set with a good level of discrimination and generalization is crucial for the performance of event trigger identification. In this article, we propose a novel framework for event trigger identification. In particular, we learn biomedical domain knowledge from a large text corpus built from Medline and embed it into word features using neural language modeling. The embedded features are then combined with the syntactic and semantic context features using the multiple kernel learning method. The combined feature set is used for training the event trigger classifier. Experimental results on the golden standard corpus show that &gt;2.5% improvement on F-score is achieved by the proposed framework when compared with the state-of-the-art approach, demonstrating the effectiveness of the proposed framework.</abstract><cop>England</cop><pmid>24489368</pmid><doi>10.1093/bioinformatics/btu061</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2014-06, Vol.30 (11), p.1587-1594
issn 1367-4803
1367-4811
1460-2059
language eng
recordid cdi_proquest_miscellaneous_1786198690
source MEDLINE; Oxford University Press Open Access; PubMed Central; Alma/SFX Local Collection; EZB Electronic Journals Library
subjects Artificial Intelligence
Binding
Biomolecules
Classifiers
Data Mining - methods
Diseases
Drugs
Extraction
MEDLINE
Neural Networks (Computer)
Ribonucleic acids
Semantics
title Event trigger identification for biomedical events extraction using domain knowledge
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T16%3A11%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Event%20trigger%20identification%20for%20biomedical%20events%20extraction%20using%20domain%20knowledge&rft.jtitle=Bioinformatics&rft.au=Zhou,%20Deyu&rft.date=2014-06-01&rft.volume=30&rft.issue=11&rft.spage=1587&rft.epage=1594&rft.pages=1587-1594&rft.issn=1367-4803&rft.eissn=1367-4811&rft_id=info:doi/10.1093/bioinformatics/btu061&rft_dat=%3Cproquest_cross%3E1786198690%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1528341625&rft_id=info:pmid/24489368&rfr_iscdi=true