Identifying outliers and implausible values in growth trajectory data

Abstract Purpose To illustrate how conditional growth percentiles can be adapted for use to systematically identify implausible measurements in growth trajectory data. Methods The use of conditional growth percentiles as a tool to assess serial weight data was reviewed. The approach was applied to 8...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Annals of epidemiology 2016-01, Vol.26 (1), p.77-80.e2
Hauptverfasser: Yang, Seungmi, PhD, Hutcheon, Jennifer A., PhD
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 80.e2
container_issue 1
container_start_page 77
container_title Annals of epidemiology
container_volume 26
creator Yang, Seungmi, PhD
Hutcheon, Jennifer A., PhD
description Abstract Purpose To illustrate how conditional growth percentiles can be adapted for use to systematically identify implausible measurements in growth trajectory data. Methods The use of conditional growth percentiles as a tool to assess serial weight data was reviewed. The approach was applied to 86,427 weight measurements (kg) taken between birth and age 6.5 years in 8217 girls participating in the Promotion of Breast Feeding Intervention Trial in Belarus. A conditional mean and variance was calculated for each weight measurement, which reflects the expected weight at a current visit given the girl's previous weights. Measurements were flagged as outliers if they were more than 4 standard deviation (SD) above or below the expected (conditional) weight. Results The method identified 234 weight measurements (0.3%) from 216 girls as potential outliers. Review of these trajectories confirmed the implausibility of the flagged measurements, and that the approach identified observations that would not have been identified using a conventional cross-sectional approach (±4 SD of the population mean) for identifying implausible values. Stata code to implement the approach is provided. Conclusions Conditional growth percentiles can be used to systematically identify implausible values in growth trajectory data and may be particularly useful for large data sets where the high number of trajectories makes ad hoc approaches unfeasible.
doi_str_mv 10.1016/j.annepidem.2015.10.002
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4732581</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>1_s2_0_S1047279715004184</els_id><sourcerecordid>1751194669</sourcerecordid><originalsourceid>FETCH-LOGICAL-c530t-905e8126a5872f9e25583973e89065eb677929196b2707b9e28dbc9e0c2b17003</originalsourceid><addsrcrecordid>eNqNUk1v3CAQRVWj5qP9C62PvXg7YGPMJVIUpW2kSD0kOSOMZze4GLZgb7T_vlibrNqeegLx3rwZ3htCPlFYUaDNl2Glvcet7XFcMaA8v64A2BtyRltRlYy3_G2-Qy1KJqQ4JecpDQAgWsHekVPWcJmx5ozc3PboJ7veW78pwjw5izEV2veFHbdOz8l2DouddjOmwvpiE8Pz9FRMUQ9ophD3Ra8n_Z6crLVL-OHlvCCPX28err-Xdz--3V5f3ZWGVzCVEji2lDWa5zHWEhnnbSVFha2EhmPXCCGZpLLpmADRZULbd0YiGNZRAVBdkMuD7nbuRuxNHj1qp7bRjjruVdBW_Y14-6Q2YadqUWVPaBb4_CIQw6_8pUmNNhl0TnsMc1JUcEpl3TQyU8WBamJIKeL62IaCWkJQgzqGoJYQFiCHkCs__jnlse7V9Uy4OhAwe7XLjqtkLHqDvY3ZVdUH-x9NLv_RMM56a7T7iXtMQ5ijz1EoqhJToO6XXVhWgXKAmrZ19RuNabIV</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1751194669</pqid></control><display><type>article</type><title>Identifying outliers and implausible values in growth trajectory data</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><creator>Yang, Seungmi, PhD ; Hutcheon, Jennifer A., PhD</creator><creatorcontrib>Yang, Seungmi, PhD ; Hutcheon, Jennifer A., PhD</creatorcontrib><description>Abstract Purpose To illustrate how conditional growth percentiles can be adapted for use to systematically identify implausible measurements in growth trajectory data. Methods The use of conditional growth percentiles as a tool to assess serial weight data was reviewed. The approach was applied to 86,427 weight measurements (kg) taken between birth and age 6.5 years in 8217 girls participating in the Promotion of Breast Feeding Intervention Trial in Belarus. A conditional mean and variance was calculated for each weight measurement, which reflects the expected weight at a current visit given the girl's previous weights. Measurements were flagged as outliers if they were more than 4 standard deviation (SD) above or below the expected (conditional) weight. Results The method identified 234 weight measurements (0.3%) from 216 girls as potential outliers. Review of these trajectories confirmed the implausibility of the flagged measurements, and that the approach identified observations that would not have been identified using a conventional cross-sectional approach (±4 SD of the population mean) for identifying implausible values. Stata code to implement the approach is provided. Conclusions Conditional growth percentiles can be used to systematically identify implausible values in growth trajectory data and may be particularly useful for large data sets where the high number of trajectories makes ad hoc approaches unfeasible.</description><identifier>ISSN: 1047-2797</identifier><identifier>EISSN: 1873-2585</identifier><identifier>DOI: 10.1016/j.annepidem.2015.10.002</identifier><identifier>PMID: 26590476</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Child ; Child Development - physiology ; Child, Preschool ; Data cleaning ; Data Interpretation, Statistical ; Female ; Growth Charts ; Humans ; Infant ; Infant, Newborn ; Internal Medicine ; Longitudinal growth data ; Longitudinal Studies ; Male ; Models, Statistical ; Outliers identification ; Weight Gain - physiology</subject><ispartof>Annals of epidemiology, 2016-01, Vol.26 (1), p.77-80.e2</ispartof><rights>Elsevier Inc.</rights><rights>2016 Elsevier Inc.</rights><rights>Copyright © 2016 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c530t-905e8126a5872f9e25583973e89065eb677929196b2707b9e28dbc9e0c2b17003</citedby><cites>FETCH-LOGICAL-c530t-905e8126a5872f9e25583973e89065eb677929196b2707b9e28dbc9e0c2b17003</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.annepidem.2015.10.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,780,784,885,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26590476$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Yang, Seungmi, PhD</creatorcontrib><creatorcontrib>Hutcheon, Jennifer A., PhD</creatorcontrib><title>Identifying outliers and implausible values in growth trajectory data</title><title>Annals of epidemiology</title><addtitle>Ann Epidemiol</addtitle><description>Abstract Purpose To illustrate how conditional growth percentiles can be adapted for use to systematically identify implausible measurements in growth trajectory data. Methods The use of conditional growth percentiles as a tool to assess serial weight data was reviewed. The approach was applied to 86,427 weight measurements (kg) taken between birth and age 6.5 years in 8217 girls participating in the Promotion of Breast Feeding Intervention Trial in Belarus. A conditional mean and variance was calculated for each weight measurement, which reflects the expected weight at a current visit given the girl's previous weights. Measurements were flagged as outliers if they were more than 4 standard deviation (SD) above or below the expected (conditional) weight. Results The method identified 234 weight measurements (0.3%) from 216 girls as potential outliers. Review of these trajectories confirmed the implausibility of the flagged measurements, and that the approach identified observations that would not have been identified using a conventional cross-sectional approach (±4 SD of the population mean) for identifying implausible values. Stata code to implement the approach is provided. Conclusions Conditional growth percentiles can be used to systematically identify implausible values in growth trajectory data and may be particularly useful for large data sets where the high number of trajectories makes ad hoc approaches unfeasible.</description><subject>Child</subject><subject>Child Development - physiology</subject><subject>Child, Preschool</subject><subject>Data cleaning</subject><subject>Data Interpretation, Statistical</subject><subject>Female</subject><subject>Growth Charts</subject><subject>Humans</subject><subject>Infant</subject><subject>Infant, Newborn</subject><subject>Internal Medicine</subject><subject>Longitudinal growth data</subject><subject>Longitudinal Studies</subject><subject>Male</subject><subject>Models, Statistical</subject><subject>Outliers identification</subject><subject>Weight Gain - physiology</subject><issn>1047-2797</issn><issn>1873-2585</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNUk1v3CAQRVWj5qP9C62PvXg7YGPMJVIUpW2kSD0kOSOMZze4GLZgb7T_vlibrNqeegLx3rwZ3htCPlFYUaDNl2Glvcet7XFcMaA8v64A2BtyRltRlYy3_G2-Qy1KJqQ4JecpDQAgWsHekVPWcJmx5ozc3PboJ7veW78pwjw5izEV2veFHbdOz8l2DouddjOmwvpiE8Pz9FRMUQ9ophD3Ra8n_Z6crLVL-OHlvCCPX28err-Xdz--3V5f3ZWGVzCVEji2lDWa5zHWEhnnbSVFha2EhmPXCCGZpLLpmADRZULbd0YiGNZRAVBdkMuD7nbuRuxNHj1qp7bRjjruVdBW_Y14-6Q2YadqUWVPaBb4_CIQw6_8pUmNNhl0TnsMc1JUcEpl3TQyU8WBamJIKeL62IaCWkJQgzqGoJYQFiCHkCs__jnlse7V9Uy4OhAwe7XLjqtkLHqDvY3ZVdUH-x9NLv_RMM56a7T7iXtMQ5ijz1EoqhJToO6XXVhWgXKAmrZ19RuNabIV</recordid><startdate>20160101</startdate><enddate>20160101</enddate><creator>Yang, Seungmi, PhD</creator><creator>Hutcheon, Jennifer A., PhD</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20160101</creationdate><title>Identifying outliers and implausible values in growth trajectory data</title><author>Yang, Seungmi, PhD ; Hutcheon, Jennifer A., PhD</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c530t-905e8126a5872f9e25583973e89065eb677929196b2707b9e28dbc9e0c2b17003</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Child</topic><topic>Child Development - physiology</topic><topic>Child, Preschool</topic><topic>Data cleaning</topic><topic>Data Interpretation, Statistical</topic><topic>Female</topic><topic>Growth Charts</topic><topic>Humans</topic><topic>Infant</topic><topic>Infant, Newborn</topic><topic>Internal Medicine</topic><topic>Longitudinal growth data</topic><topic>Longitudinal Studies</topic><topic>Male</topic><topic>Models, Statistical</topic><topic>Outliers identification</topic><topic>Weight Gain - physiology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Seungmi, PhD</creatorcontrib><creatorcontrib>Hutcheon, Jennifer A., PhD</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Annals of epidemiology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yang, Seungmi, PhD</au><au>Hutcheon, Jennifer A., PhD</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identifying outliers and implausible values in growth trajectory data</atitle><jtitle>Annals of epidemiology</jtitle><addtitle>Ann Epidemiol</addtitle><date>2016-01-01</date><risdate>2016</risdate><volume>26</volume><issue>1</issue><spage>77</spage><epage>80.e2</epage><pages>77-80.e2</pages><issn>1047-2797</issn><eissn>1873-2585</eissn><abstract>Abstract Purpose To illustrate how conditional growth percentiles can be adapted for use to systematically identify implausible measurements in growth trajectory data. Methods The use of conditional growth percentiles as a tool to assess serial weight data was reviewed. The approach was applied to 86,427 weight measurements (kg) taken between birth and age 6.5 years in 8217 girls participating in the Promotion of Breast Feeding Intervention Trial in Belarus. A conditional mean and variance was calculated for each weight measurement, which reflects the expected weight at a current visit given the girl's previous weights. Measurements were flagged as outliers if they were more than 4 standard deviation (SD) above or below the expected (conditional) weight. Results The method identified 234 weight measurements (0.3%) from 216 girls as potential outliers. Review of these trajectories confirmed the implausibility of the flagged measurements, and that the approach identified observations that would not have been identified using a conventional cross-sectional approach (±4 SD of the population mean) for identifying implausible values. Stata code to implement the approach is provided. Conclusions Conditional growth percentiles can be used to systematically identify implausible values in growth trajectory data and may be particularly useful for large data sets where the high number of trajectories makes ad hoc approaches unfeasible.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>26590476</pmid><doi>10.1016/j.annepidem.2015.10.002</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1047-2797
ispartof Annals of epidemiology, 2016-01, Vol.26 (1), p.77-80.e2
issn 1047-2797
1873-2585
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4732581
source MEDLINE; Access via ScienceDirect (Elsevier)
subjects Child
Child Development - physiology
Child, Preschool
Data cleaning
Data Interpretation, Statistical
Female
Growth Charts
Humans
Infant
Infant, Newborn
Internal Medicine
Longitudinal growth data
Longitudinal Studies
Male
Models, Statistical
Outliers identification
Weight Gain - physiology
title Identifying outliers and implausible values in growth trajectory data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T19%3A07%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identifying%20outliers%20and%20implausible%20values%20in%20growth%20trajectory%20data&rft.jtitle=Annals%20of%20epidemiology&rft.au=Yang,%20Seungmi,%20PhD&rft.date=2016-01-01&rft.volume=26&rft.issue=1&rft.spage=77&rft.epage=80.e2&rft.pages=77-80.e2&rft.issn=1047-2797&rft.eissn=1873-2585&rft_id=info:doi/10.1016/j.annepidem.2015.10.002&rft_dat=%3Cproquest_pubme%3E1751194669%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1751194669&rft_id=info:pmid/26590476&rft_els_id=1_s2_0_S1047279715004184&rfr_iscdi=true