Automated tongue segmentation using deep encoder-decoder model
This paper proposes a solution of tongue segmentation in images. The solution relies on a convolutional neural network, using deep U-Net with deep layers of encoder-decoder modules. The model is trained with a starting resolution of 512 x 512 pixels. To enhance the segmentation performances of the t...
Gespeichert in:
Veröffentlicht in: | Multimedia tools and applications 2023-10, Vol.82 (24), p.37661-37686 |
---|---|
Hauptverfasser: | , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper proposes a solution of tongue segmentation in images. The solution relies on a convolutional neural network, using deep U-Net with deep layers of encoder-decoder modules. The model is trained with a starting resolution of 512 x 512 pixels. To enhance the segmentation performances of the trained model across recording environments, three main types of data augmentations are added in the training process, including additive gaussian noise, multiply and add to brightness, and change color temperature. They could also handle an inadequate number of data samples in the limited datasets. The proposed method is evaluated based on four measurement metrics of Dice coefficient, mean IoU, Jaccard distance, and accuracy. The model is successfully trained on publicly available datasets, and then transferred to be tested with the self-collected dataset in the real-world environment. |
---|---|
ISSN: | 1380-7501 1573-7721 |
DOI: | 10.1007/s11042-023-15061-1 |