Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method

This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone a...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Choi, Hong-Jun, Na, Dongbin, Cho, Kyungjin, Bae, Byunguk, Kong, Seo Taek, An, Hyunjoon
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Choi, Hong-Jun Na, Dongbin Cho, Kyungjin Bae, Byunguk Kong, Seo Taek An, Hyunjoon
description	This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.
doi_str_mv	10.48550/arxiv.2303.16557
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2303_16557</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2303_16557</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-4ceee80d0fdb3700d09dce1a9525f261edc055bf37fd9da49d405ac75bb341383</originalsourceid><addsrcrecordid>eNotj81OhDAURrtxYUYfwJV9AbCllMJynPiXzMTFoHFHLu0t0wSKaYHo24ujiy_nW53kEHLDWZqXUrI7CF9uSTPBRMoLKdUl-ThibxPQeh7mHia3IH130Y2e1gF8tGMYMNAV9H70SLfduhgxxgH9RN-i8x2dTkiPMC_YBXCeHnA6jeaKXFjoI17_c0Pqx4d695zsX59edtt9AoVSSa4RsWSGWdMKxdZTGY0cKplJmxUcjWZStlYoayoDeWVyJkEr2bYi56IUG3L7pz2nNZ_BDRC-m9_E5pwofgBovE1Y</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</title><source>arXiv.org</source><creator>Choi, Hong-Jun ; Na, Dongbin ; Cho, Kyungjin ; Bae, Byunguk ; Kong, Seo Taek ; An, Hyunjoon</creator><creatorcontrib>Choi, Hong-Jun ; Na, Dongbin ; Cho, Kyungjin ; Bae, Byunguk ; Kong, Seo Taek ; An, Hyunjoon</creatorcontrib><description>This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.</description><identifier>DOI: 10.48550/arxiv.2303.16557</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2303.16557$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2303.16557$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Choi, Hong-Jun</creatorcontrib><creatorcontrib>Na, Dongbin</creatorcontrib><creatorcontrib>Cho, Kyungjin</creatorcontrib><creatorcontrib>Bae, Byunguk</creatorcontrib><creatorcontrib>Kong, Seo Taek</creatorcontrib><creatorcontrib>An, Hyunjoon</creatorcontrib><title>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</title><description>This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj81OhDAURrtxYUYfwJV9AbCllMJynPiXzMTFoHFHLu0t0wSKaYHo24ujiy_nW53kEHLDWZqXUrI7CF9uSTPBRMoLKdUl-ThibxPQeh7mHia3IH130Y2e1gF8tGMYMNAV9H70SLfduhgxxgH9RN-i8x2dTkiPMC_YBXCeHnA6jeaKXFjoI17_c0Pqx4d695zsX59edtt9AoVSSa4RsWSGWdMKxdZTGY0cKplJmxUcjWZStlYoayoDeWVyJkEr2bYi56IUG3L7pz2nNZ_BDRC-m9_E5pwofgBovE1Y</recordid><startdate>20230329</startdate><enddate>20230329</enddate><creator>Choi, Hong-Jun</creator><creator>Na, Dongbin</creator><creator>Cho, Kyungjin</creator><creator>Bae, Byunguk</creator><creator>Kong, Seo Taek</creator><creator>An, Hyunjoon</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230329</creationdate><title>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</title><author>Choi, Hong-Jun ; Na, Dongbin ; Cho, Kyungjin ; Bae, Byunguk ; Kong, Seo Taek ; An, Hyunjoon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-4ceee80d0fdb3700d09dce1a9525f261edc055bf37fd9da49d405ac75bb341383</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Choi, Hong-Jun</creatorcontrib><creatorcontrib>Na, Dongbin</creatorcontrib><creatorcontrib>Cho, Kyungjin</creatorcontrib><creatorcontrib>Bae, Byunguk</creatorcontrib><creatorcontrib>Kong, Seo Taek</creatorcontrib><creatorcontrib>An, Hyunjoon</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Choi, Hong-Jun</au><au>Na, Dongbin</au><au>Cho, Kyungjin</au><au>Bae, Byunguk</au><au>Kong, Seo Taek</au><au>An, Hyunjoon</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</atitle><date>2023-03-29</date><risdate>2023</risdate><abstract>This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.</abstract><doi>10.48550/arxiv.2303.16557</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2303.16557
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2303_16557
source	arXiv.org
subjects	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition
title	Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T07%3A12%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Self-accumulative%20Vision%20Transformer%20for%20Bone%20Age%20Assessment%20Using%20the%20Sauvegrain%20Method&rft.au=Choi,%20Hong-Jun&rft.date=2023-03-29&rft_id=info:doi/10.48550/arxiv.2303.16557&rft_dat=%3Carxiv_GOX%3E2303_16557%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true