Multi-Level Objective Alignment Transformer for Fine-Grained Oral Panoramic X-Ray Report Generation

Automatically generated oral panoramic X-ray report is highly beneficial for improving the efficiency of dental diagnosis. However, recent solutions adopt holistic methods, resulting in a cursory description of the oral condition. This may lead to reports lacking details, such as specific sites or l...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on multimedia 2024, Vol.26, p.7462-7474
Hauptverfasser:	Gao, Nan, Yao, Renyuan, Liang, Ronghua, Chen, Peng, Liu, Tianshuang, Dang, Yuanjie
Format:	Artikel
Sprache:	eng
Schlagworte:	Alignment Bones Bridge-tower connection Datasets Diseases Feature extraction fine-grained image report captioning Image quality Irrigation Medical imaging oral panoramic X-ray positional alignment graph radiology report generation Teeth Transformers Visualization X-ray imaging
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Automatically generated oral panoramic X-ray report is highly beneficial for improving the efficiency of dental diagnosis. However, recent solutions adopt holistic methods, resulting in a cursory description of the oral condition. This may lead to reports lacking details, such as specific sites or lesion contours. Therefore, we propose a Multi-Level objective Alignment Transformer(MLAT) network, which integrates all tooth and disease objects into a positional alignment graph to extract fine-grained object-level features. Specifically, we introduce a novel Object-Level Collaborative Encoder (OLCE) module, which uses a positional alignment graph to construct object relationships. OLCE enhances object-level feature extraction by eliminating interference information between pathologically unrelated objects. In addition, we build a high-quality panoramic X-ray image-report dataset consisting of 562 sets of images and reports labeled by 13 experienced dental specialists. Experiments on the collected dataset show that the proposed MLAT significantly outperforms the state-of-the-art baselines by more than 5% in 4 different metrics, including BLEUs, Meteor, Rouge, and BERTScore.
ISSN:	1520-9210 1941-0077
DOI:	10.1109/TMM.2024.3368922