Automated tongue segmentation using deep encoder-decoder model

This paper proposes a solution of tongue segmentation in images. The solution relies on a convolutional neural network, using deep U-Net with deep layers of encoder-decoder modules. The model is trained with a starting resolution of 512 x 512 pixels. To enhance the segmentation performances of the t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia tools and applications 2023-10, Vol.82 (24), p.37661-37686
Hauptverfasser:	Kusakunniran, Worapan, Borwarnginn, Punyanuch, Imaromkul, Thanandon, Aukkapinyo, Kittinun, Thongkanchorn, Kittikhun, Wattanadhirach, Disathon, Mongkolluksamee, Sophon, Thammasudjarit, Ratchainant, Ritthipravat, Panrasee, Tuakta, Pimchanok, Benjapornlert, Paitoon
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Artificial neural networks Automation Classification Coders Color temperature Computer Communication Networks Computer Science Data augmentation Data Structures and Information Theory Datasets Encoders-Decoders Gaussian process Image segmentation Medicine Methods Model accuracy Multimedia Multimedia Information Systems Neural networks Random noise Special Purpose and Application-Based Systems Tongue Track 2: Medical Applications of Multimedia
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper proposes a solution of tongue segmentation in images. The solution relies on a convolutional neural network, using deep U-Net with deep layers of encoder-decoder modules. The model is trained with a starting resolution of 512 x 512 pixels. To enhance the segmentation performances of the trained model across recording environments, three main types of data augmentations are added in the training process, including additive gaussian noise, multiply and add to brightness, and change color temperature. They could also handle an inadequate number of data samples in the limited datasets. The proposed method is evaluated based on four measurement metrics of Dice coefficient, mean IoU, Jaccard distance, and accuracy. The model is successfully trained on publicly available datasets, and then transferred to be tested with the self-collected dataset in the real-world environment.
ISSN:	1380-7501 1573-7721
DOI:	10.1007/s11042-023-15061-1