Testing process for artificial intelligence applications in radiology practice

•A comprehensive testing process for AI applications in radiology was developed.•Systematic testing helps to identify biases, pitfalls, and unacceptable deviations.•A form template for AI evaluation was developed by our multidisciplinary team.•The described process is based on our experience with se...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Physica medica 2024-12, Vol.128, p.104842, Article 104842
Hauptverfasser: Ketola, Juuso H.J., Inkinen, Satu I., Mäkelä, Teemu, Syväranta, Suvi, Peltonen, Juha, Kaasalainen, Touko, Kortesniemi, Mika
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A comprehensive testing process for AI applications in radiology was developed.•Systematic testing helps to identify biases, pitfalls, and unacceptable deviations.•A form template for AI evaluation was developed by our multidisciplinary team.•The described process is based on our experience with several clinical AI algorithms.•Systematic testing of radiology AI applications ultimately benefits patient care. Artificial intelligence (AI) applications are becoming increasingly common in radiology. However, ensuring reliable operation and expected clinical benefits remains a challenge. A systematic testing process aims to facilitate clinical deployment by confirming software applicability to local patient populations, practises, adherence to regulatory and safety requirements, and compatibility with existing systems. In this work, we present our testing process developed based on practical experience. First, a survey and pre-evaluation is conducted, where information requests are sent for potential products, and the specifications are evaluated against predetermined requirements. In the second phase, data collection, testing, and analysis are conducted. In the retrospective stage, the application undergoes testing with a pre selected dataset and is evaluated against specified key performance indicators (KPIs). In the prospective stage, the application is integrated into the clinical workflow and evaluated with additional process-specific KPIs. In the final phase, the results are evaluated in terms of safety, effectiveness, productivity, and integration. The final report summarises the results and includes a procurement/deployment or rejection recommendation. The process allows termination at any phase if the application fails to meet essential criteria. In addition, we present practical remarks from our experiences in AI testing and provide forms to guide and document the testing process. The established AI testing process facilitates a systematic evaluation and documentation of new technologies ensuring that each application undergoes equal and sufficient validation. Testing with local data is crucial for identifying biases and pitfalls of AI algorithms to improve the quality and safety, ultimately benefiting patient care.
ISSN:1120-1797
1724-191X
1724-191X
DOI:10.1016/j.ejmp.2024.104842