Automatic classification of RDoC positive valence severity with a neural network

[Display omitted] •We trained a machine learning-based system to determine psychiatric symptom severity.•Regularization and feature selection via mutual information reduced overfitting.•Increasing the amount of annotated data increased accuracy by several percent. Our objective was to develop a mach...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biomedical informatics 2017-11, Vol.75, p.S120-S128
Hauptverfasser: Clark, Cheryl, Wellner, Ben, Davis, Rachel, Aberdeen, John, Hirschman, Lynette
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page S128
container_issue
container_start_page S120
container_title Journal of biomedical informatics
container_volume 75
creator Clark, Cheryl
Wellner, Ben
Davis, Rachel
Aberdeen, John
Hirschman, Lynette
description [Display omitted] •We trained a machine learning-based system to determine psychiatric symptom severity.•Regularization and feature selection via mutual information reduced overfitting.•Increasing the amount of annotated data increased accuracy by several percent. Our objective was to develop a machine learning-based system to determine the severity of Positive Valance symptoms for a patient, based on information included in their initial psychiatric evaluation. Severity was rated on an ordinal scale of 0–3 as follows: 0 (absent=no symptoms), 1 (mild=modest significance), 2 (moderate=requires treatment), 3 (severe=causes substantial impairment) by experts. We treated the task of assigning Positive Valence severity as a text classification problem. During development, we experimented with regularized multinomial logistic regression classifiers, gradient boosted trees, and feedforward, fully-connected neural networks. We found both regularization and feature selection via mutual information to be very important in preventing models from overfitting the data. Our best configuration was a neural network with three fully connected hidden layers with rectified linear unit activations. Our best performing system achieved a score of 77.86%. The evaluation metric is an inverse normalization of the Mean Absolute Error presented as a percentage number between 0 and 100, where 100 means the highest performance. Error analysis showed that 90% of the system errors involved neighboring severity categories. Machine learning text classification techniques with feature selection can be trained to recognize broad differences in Positive Valence symptom severity with a modest amount of training data (in this case 600 documents, 167 of which were unannotated). An increase in the amount of annotated data can increase accuracy of symptom severity classification by several percentage points. Additional features and/or a larger training corpus may further improve accuracy.
doi_str_mv 10.1016/j.jbi.2017.07.005
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5705444</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046417301612</els_id><sourcerecordid>1917961535</sourcerecordid><originalsourceid>FETCH-LOGICAL-c451t-38f70dcdc925282b238a6cebf49703e23795162eed34c1513cdae7ed6c29e02d3</originalsourceid><addsrcrecordid>eNp9UdtKJDEQDbLi_QN8WfK4LzOm0klfEBZkdr2AoIg-h0xSvWa2pzObpFv8ezOMO-iLUFBV1KlTxTmEnAKbAoPybDFdzN2UM6imLAeTO-QAZMEnTNTs27YuxT45jHHBGICU5R7Z53XZCID6gNxfDMkvdXKGmk7H6Fpncud76lv68MvP6MpHl9yIdNQd9gZpxBGDS6_0xaVnqmmPQ9BdTunFh7_HZLfVXcST93xEni5_P86uJ7d3Vzezi9uJERLSpKjbilljTcMlr_mcF7UuDc5b0VSsQF5UjYSSI9pCGJBQGKuxQlsa3iDjtjgiPze8q2G-RGuwT_kLtQpuqcOr8tqpz5PePas_flSyYlIIkQl-vBME_2_AmNTSRYNdp3v0Q1TQQNWUWUKZobCBmuBjDNhuzwBTayfUQmUn1NoJxXKw9c73j_9tN_5LnwHnGwBmlUaHQUXj1gJbF9AkZb37gv4NxSCbHg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1917961535</pqid></control><display><type>article</type><title>Automatic classification of RDoC positive valence severity with a neural network</title><source>Elsevier ScienceDirect Journals Complete - AutoHoldings</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Clark, Cheryl ; Wellner, Ben ; Davis, Rachel ; Aberdeen, John ; Hirschman, Lynette</creator><creatorcontrib>Clark, Cheryl ; Wellner, Ben ; Davis, Rachel ; Aberdeen, John ; Hirschman, Lynette</creatorcontrib><description>[Display omitted] •We trained a machine learning-based system to determine psychiatric symptom severity.•Regularization and feature selection via mutual information reduced overfitting.•Increasing the amount of annotated data increased accuracy by several percent. Our objective was to develop a machine learning-based system to determine the severity of Positive Valance symptoms for a patient, based on information included in their initial psychiatric evaluation. Severity was rated on an ordinal scale of 0–3 as follows: 0 (absent=no symptoms), 1 (mild=modest significance), 2 (moderate=requires treatment), 3 (severe=causes substantial impairment) by experts. We treated the task of assigning Positive Valence severity as a text classification problem. During development, we experimented with regularized multinomial logistic regression classifiers, gradient boosted trees, and feedforward, fully-connected neural networks. We found both regularization and feature selection via mutual information to be very important in preventing models from overfitting the data. Our best configuration was a neural network with three fully connected hidden layers with rectified linear unit activations. Our best performing system achieved a score of 77.86%. The evaluation metric is an inverse normalization of the Mean Absolute Error presented as a percentage number between 0 and 100, where 100 means the highest performance. Error analysis showed that 90% of the system errors involved neighboring severity categories. Machine learning text classification techniques with feature selection can be trained to recognize broad differences in Positive Valence symptom severity with a modest amount of training data (in this case 600 documents, 167 of which were unannotated). An increase in the amount of annotated data can increase accuracy of symptom severity classification by several percentage points. Additional features and/or a larger training corpus may further improve accuracy.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2017.07.005</identifier><identifier>PMID: 28694118</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Machine learning ; Mental disorder severity ; Positive valance ; Research domain criteria (RDoC) ; Text classification</subject><ispartof>Journal of biomedical informatics, 2017-11, Vol.75, p.S120-S128</ispartof><rights>2017</rights><rights>Copyright © 2017. Published by Elsevier Inc.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c451t-38f70dcdc925282b238a6cebf49703e23795162eed34c1513cdae7ed6c29e02d3</citedby><cites>FETCH-LOGICAL-c451t-38f70dcdc925282b238a6cebf49703e23795162eed34c1513cdae7ed6c29e02d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jbi.2017.07.005$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,885,3549,27923,27924,45994</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28694118$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Clark, Cheryl</creatorcontrib><creatorcontrib>Wellner, Ben</creatorcontrib><creatorcontrib>Davis, Rachel</creatorcontrib><creatorcontrib>Aberdeen, John</creatorcontrib><creatorcontrib>Hirschman, Lynette</creatorcontrib><title>Automatic classification of RDoC positive valence severity with a neural network</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted] •We trained a machine learning-based system to determine psychiatric symptom severity.•Regularization and feature selection via mutual information reduced overfitting.•Increasing the amount of annotated data increased accuracy by several percent. Our objective was to develop a machine learning-based system to determine the severity of Positive Valance symptoms for a patient, based on information included in their initial psychiatric evaluation. Severity was rated on an ordinal scale of 0–3 as follows: 0 (absent=no symptoms), 1 (mild=modest significance), 2 (moderate=requires treatment), 3 (severe=causes substantial impairment) by experts. We treated the task of assigning Positive Valence severity as a text classification problem. During development, we experimented with regularized multinomial logistic regression classifiers, gradient boosted trees, and feedforward, fully-connected neural networks. We found both regularization and feature selection via mutual information to be very important in preventing models from overfitting the data. Our best configuration was a neural network with three fully connected hidden layers with rectified linear unit activations. Our best performing system achieved a score of 77.86%. The evaluation metric is an inverse normalization of the Mean Absolute Error presented as a percentage number between 0 and 100, where 100 means the highest performance. Error analysis showed that 90% of the system errors involved neighboring severity categories. Machine learning text classification techniques with feature selection can be trained to recognize broad differences in Positive Valence symptom severity with a modest amount of training data (in this case 600 documents, 167 of which were unannotated). An increase in the amount of annotated data can increase accuracy of symptom severity classification by several percentage points. Additional features and/or a larger training corpus may further improve accuracy.</description><subject>Machine learning</subject><subject>Mental disorder severity</subject><subject>Positive valance</subject><subject>Research domain criteria (RDoC)</subject><subject>Text classification</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp9UdtKJDEQDbLi_QN8WfK4LzOm0klfEBZkdr2AoIg-h0xSvWa2pzObpFv8ezOMO-iLUFBV1KlTxTmEnAKbAoPybDFdzN2UM6imLAeTO-QAZMEnTNTs27YuxT45jHHBGICU5R7Z53XZCID6gNxfDMkvdXKGmk7H6Fpncud76lv68MvP6MpHl9yIdNQd9gZpxBGDS6_0xaVnqmmPQ9BdTunFh7_HZLfVXcST93xEni5_P86uJ7d3Vzezi9uJERLSpKjbilljTcMlr_mcF7UuDc5b0VSsQF5UjYSSI9pCGJBQGKuxQlsa3iDjtjgiPze8q2G-RGuwT_kLtQpuqcOr8tqpz5PePas_flSyYlIIkQl-vBME_2_AmNTSRYNdp3v0Q1TQQNWUWUKZobCBmuBjDNhuzwBTayfUQmUn1NoJxXKw9c73j_9tN_5LnwHnGwBmlUaHQUXj1gJbF9AkZb37gv4NxSCbHg</recordid><startdate>20171101</startdate><enddate>20171101</enddate><creator>Clark, Cheryl</creator><creator>Wellner, Ben</creator><creator>Davis, Rachel</creator><creator>Aberdeen, John</creator><creator>Hirschman, Lynette</creator><general>Elsevier Inc</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20171101</creationdate><title>Automatic classification of RDoC positive valence severity with a neural network</title><author>Clark, Cheryl ; Wellner, Ben ; Davis, Rachel ; Aberdeen, John ; Hirschman, Lynette</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c451t-38f70dcdc925282b238a6cebf49703e23795162eed34c1513cdae7ed6c29e02d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Machine learning</topic><topic>Mental disorder severity</topic><topic>Positive valance</topic><topic>Research domain criteria (RDoC)</topic><topic>Text classification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Clark, Cheryl</creatorcontrib><creatorcontrib>Wellner, Ben</creatorcontrib><creatorcontrib>Davis, Rachel</creatorcontrib><creatorcontrib>Aberdeen, John</creatorcontrib><creatorcontrib>Hirschman, Lynette</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Clark, Cheryl</au><au>Wellner, Ben</au><au>Davis, Rachel</au><au>Aberdeen, John</au><au>Hirschman, Lynette</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic classification of RDoC positive valence severity with a neural network</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2017-11-01</date><risdate>2017</risdate><volume>75</volume><spage>S120</spage><epage>S128</epage><pages>S120-S128</pages><issn>1532-0464</issn><eissn>1532-0480</eissn><abstract>[Display omitted] •We trained a machine learning-based system to determine psychiatric symptom severity.•Regularization and feature selection via mutual information reduced overfitting.•Increasing the amount of annotated data increased accuracy by several percent. Our objective was to develop a machine learning-based system to determine the severity of Positive Valance symptoms for a patient, based on information included in their initial psychiatric evaluation. Severity was rated on an ordinal scale of 0–3 as follows: 0 (absent=no symptoms), 1 (mild=modest significance), 2 (moderate=requires treatment), 3 (severe=causes substantial impairment) by experts. We treated the task of assigning Positive Valence severity as a text classification problem. During development, we experimented with regularized multinomial logistic regression classifiers, gradient boosted trees, and feedforward, fully-connected neural networks. We found both regularization and feature selection via mutual information to be very important in preventing models from overfitting the data. Our best configuration was a neural network with three fully connected hidden layers with rectified linear unit activations. Our best performing system achieved a score of 77.86%. The evaluation metric is an inverse normalization of the Mean Absolute Error presented as a percentage number between 0 and 100, where 100 means the highest performance. Error analysis showed that 90% of the system errors involved neighboring severity categories. Machine learning text classification techniques with feature selection can be trained to recognize broad differences in Positive Valence symptom severity with a modest amount of training data (in this case 600 documents, 167 of which were unannotated). An increase in the amount of annotated data can increase accuracy of symptom severity classification by several percentage points. Additional features and/or a larger training corpus may further improve accuracy.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>28694118</pmid><doi>10.1016/j.jbi.2017.07.005</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1532-0464
ispartof Journal of biomedical informatics, 2017-11, Vol.75, p.S120-S128
issn 1532-0464
1532-0480
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5705444
source Elsevier ScienceDirect Journals Complete - AutoHoldings; EZB-FREE-00999 freely available EZB journals
subjects Machine learning
Mental disorder severity
Positive valance
Research domain criteria (RDoC)
Text classification
title Automatic classification of RDoC positive valence severity with a neural network
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T16%3A22%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20classification%20of%20RDoC%20positive%20valence%20severity%20with%20a%20neural%20network&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Clark,%20Cheryl&rft.date=2017-11-01&rft.volume=75&rft.spage=S120&rft.epage=S128&rft.pages=S120-S128&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2017.07.005&rft_dat=%3Cproquest_pubme%3E1917961535%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1917961535&rft_id=info:pmid/28694118&rft_els_id=S1532046417301612&rfr_iscdi=true