Construction of an PFT database with various clinical information using optical character recognition and regular expression technique
The pulmonary function test (PFT) is an essential data source for evaluating the effect of drugs on the lungs or the status of lung function. However, the numeric values of PFT cannot be easily used for clinical studies without labor-intensive manual efforts, because PFTs are usually recorded as ima...
Gespeichert in:
Veröffentlicht in: | Inteonet jeongbo hakoe nonmunji = Journal of Korean Society for Internet Information 2017-10, Vol.18 (5), p.55-60 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | kor |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The pulmonary function test (PFT) is an essential data source for evaluating the effect of drugs on the lungs or the status of lung function. However, the numeric values of PFT cannot be easily used for clinical studies without labor-intensive manual efforts, because PFTs are usually recorded as image files. This study was aimed at constructing a de-identified, open-access PFT database with various clinical information. For constructing the PFT database, optical character recognition (OCR), regular expression, and the parsing technique were used to extract alphanumeric data from the PFT images in a Korean tertiary teaching hospital. This longitudinal observational database contains 413,000 measurements of PFT from 183,000 patients. |
---|---|
ISSN: | 1598-0170 2287-1136 |