A two‐stage data cleansing method for bridge global positioning system monitoring data based on bi‐direction long and short term memory anomaly identification and conditional generative adversarial networks data repair

Summary Data cleansing is an essential approach for improving data quality. Therefore, it is the key to avoiding the false alarm of the monitoring system due to the anomaly of the data itself. Data cleansing consists of two parts: anomaly identification and anomaly repair. However, current research...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Structural control and health monitoring 2022-09, Vol.29 (9), p.n/a
Hauptverfasser: Yang, Kang, Ding, Youliang, Jiang, Huachen, Zhao, Hanwei, Luo, Gan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Summary Data cleansing is an essential approach for improving data quality. Therefore, it is the key to avoiding the false alarm of the monitoring system due to the anomaly of the data itself. Data cleansing consists of two parts: anomaly identification and anomaly repair. However, current research on data cleansing has mainly focused on anomaly identification and lacks efficient data repair methods. The key to data repair lies in sensor correlation models based on mapping relationships between sensors. To obtain a good inter‐sensor relationship model, it is first necessary to exclude anomalous data from the training data set used for modeling. Therefore, a two‐stage data cleansing framework for collaborative multi‐sensor repair is proposed. First, based on the analysis of anomalous features of GPS data, a bidirectional long‐ and short‐term memory (Bi‐LSTM) neural network model is adopted for data anomalies classification and localization. As a result, the data segment to be repaired is determined. Then, on the basis of all sensor data in the time range of the day before the target repair data segment, the data set for data repair is constructed by excluding the anomaly data segments in the data set with the help of the above anomaly identification results. Then, a conditional generation adversarial network (CGAN) is proposed to achieve data repair. Experimental validation shows that the two‐stage data cleansing method of identification followed by repair can accurately identify and repair GPS anomalies. Finally, several factors affecting the repair effect are discussed.
ISSN:1545-2255
1545-2263
DOI:10.1002/stc.2993