Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method

This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Choi, Hong-Jun, Na, Dongbin, Cho, Kyungjin, Bae, Byunguk, Kong, Seo Taek, An, Hyunjoon
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Choi, Hong-Jun
Na, Dongbin
Cho, Kyungjin
Bae, Byunguk
Kong, Seo Taek
An, Hyunjoon
description This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.
doi_str_mv 10.48550/arxiv.2303.16557
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2303_16557</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2303_16557</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-4ceee80d0fdb3700d09dce1a9525f261edc055bf37fd9da49d405ac75bb341383</originalsourceid><addsrcrecordid>eNotj81OhDAURrtxYUYfwJV9AbCllMJynPiXzMTFoHFHLu0t0wSKaYHo24ujiy_nW53kEHLDWZqXUrI7CF9uSTPBRMoLKdUl-ThibxPQeh7mHia3IH130Y2e1gF8tGMYMNAV9H70SLfduhgxxgH9RN-i8x2dTkiPMC_YBXCeHnA6jeaKXFjoI17_c0Pqx4d695zsX59edtt9AoVSSa4RsWSGWdMKxdZTGY0cKplJmxUcjWZStlYoayoDeWVyJkEr2bYi56IUG3L7pz2nNZ_BDRC-m9_E5pwofgBovE1Y</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</title><source>arXiv.org</source><creator>Choi, Hong-Jun ; Na, Dongbin ; Cho, Kyungjin ; Bae, Byunguk ; Kong, Seo Taek ; An, Hyunjoon</creator><creatorcontrib>Choi, Hong-Jun ; Na, Dongbin ; Cho, Kyungjin ; Bae, Byunguk ; Kong, Seo Taek ; An, Hyunjoon</creatorcontrib><description>This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.</description><identifier>DOI: 10.48550/arxiv.2303.16557</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2303.16557$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2303.16557$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Choi, Hong-Jun</creatorcontrib><creatorcontrib>Na, Dongbin</creatorcontrib><creatorcontrib>Cho, Kyungjin</creatorcontrib><creatorcontrib>Bae, Byunguk</creatorcontrib><creatorcontrib>Kong, Seo Taek</creatorcontrib><creatorcontrib>An, Hyunjoon</creatorcontrib><title>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</title><description>This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj81OhDAURrtxYUYfwJV9AbCllMJynPiXzMTFoHFHLu0t0wSKaYHo24ujiy_nW53kEHLDWZqXUrI7CF9uSTPBRMoLKdUl-ThibxPQeh7mHia3IH130Y2e1gF8tGMYMNAV9H70SLfduhgxxgH9RN-i8x2dTkiPMC_YBXCeHnA6jeaKXFjoI17_c0Pqx4d695zsX59edtt9AoVSSa4RsWSGWdMKxdZTGY0cKplJmxUcjWZStlYoayoDeWVyJkEr2bYi56IUG3L7pz2nNZ_BDRC-m9_E5pwofgBovE1Y</recordid><startdate>20230329</startdate><enddate>20230329</enddate><creator>Choi, Hong-Jun</creator><creator>Na, Dongbin</creator><creator>Cho, Kyungjin</creator><creator>Bae, Byunguk</creator><creator>Kong, Seo Taek</creator><creator>An, Hyunjoon</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230329</creationdate><title>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</title><author>Choi, Hong-Jun ; Na, Dongbin ; Cho, Kyungjin ; Bae, Byunguk ; Kong, Seo Taek ; An, Hyunjoon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-4ceee80d0fdb3700d09dce1a9525f261edc055bf37fd9da49d405ac75bb341383</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Choi, Hong-Jun</creatorcontrib><creatorcontrib>Na, Dongbin</creatorcontrib><creatorcontrib>Cho, Kyungjin</creatorcontrib><creatorcontrib>Bae, Byunguk</creatorcontrib><creatorcontrib>Kong, Seo Taek</creatorcontrib><creatorcontrib>An, Hyunjoon</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Choi, Hong-Jun</au><au>Na, Dongbin</au><au>Cho, Kyungjin</au><au>Bae, Byunguk</au><au>Kong, Seo Taek</au><au>An, Hyunjoon</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method</atitle><date>2023-03-29</date><risdate>2023</risdate><abstract>This study presents a novel approach to bone age assessment (BAA) using a multi-view, multi-task classification model based on the Sauvegrain method. A straightforward solution to automating the Sauvegrain method, which assesses a maturity score for each landmark in the elbow and predicts the bone age, is to train classifiers independently to score each region of interest (RoI), but this approach limits the accessible information to local morphologies and increases computational costs. As a result, this work proposes a self-accumulative vision transformer (SAT) that mitigates anisotropic behavior, which usually occurs in multi-view, multi-task problems and limits the effectiveness of a vision transformer, by applying token replay and regional attention bias. A number of experiments show that SAT successfully exploits the relationships between landmarks and learns global morphological features, resulting in a mean absolute error of BAA that is 0.11 lower than that of the previous work. Additionally, the proposed SAT has four times reduced parameters than an ensemble of individual classifiers of the previous work. Lastly, this work also provides informative implications for clinical practice, improving the accuracy and efficiency of BAA in diagnosing abnormal growth in adolescents.</abstract><doi>10.48550/arxiv.2303.16557</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2303.16557
ispartof
issn
language eng
recordid cdi_arxiv_primary_2303_16557
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computer Vision and Pattern Recognition
title Self-accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T07%3A12%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Self-accumulative%20Vision%20Transformer%20for%20Bone%20Age%20Assessment%20Using%20the%20Sauvegrain%20Method&rft.au=Choi,%20Hong-Jun&rft.date=2023-03-29&rft_id=info:doi/10.48550/arxiv.2303.16557&rft_dat=%3Carxiv_GOX%3E2303_16557%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true