Extracting full-field subpixel structural displacements from videos via deep learning

•Presented a deep learning approach for extracting full-field subpixel displacements from videos.•Proposed two network architectures based on convolutional neural networks.•Developed a mask-regularized training scheme to constrain the network.•Demonstrated effectiveness and generalizability of the p...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of sound and vibration 2021-08, Vol.505, p.116142, Article 116142
Hauptverfasser:	Luan, Lele, Zheng, Jingwei, Wang, Ming L., Yang, Yongchao, Rizzo, Piervincenzo, Sun, Hao
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Computer architecture Convolution neural networks Deep learning Digital imaging Displacement Displacement measurement Extraction processes Neural networks Phase matching Phase-based displacement extraction Pixels Point contact Real time Sensors Structural health monitoring Subpixel motion field Template matching Texture Time measurement Tracking Video Video camera
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•Presented a deep learning approach for extracting full-field subpixel displacements from videos.•Proposed two network architectures based on convolutional neural networks.•Developed a mask-regularized training scheme to constrain the network.•Demonstrated effectiveness and generalizability of the proposed networks via lab experiments. Conventional displacement sensing techniques (e.g., laser, linear variable differential transformer) have been widely used in structural health monitoring in the past two decades. Though these techniques are capable of measuring displacement time histories with high accuracy, distinct shortcoming remains such as point-to-point contact sensing which limits its applicability in real-world problems. Video cameras have been widely used in the past years due to advantages that include low price, agility, high spatial sensing resolution, and non-contact. Compared with target tracking approaches (e.g., digital image correlation, template matching, etc.), the phase-based method is powerful for detecting small subpixel motions without the use of paints or markers on the structure surface. Nevertheless, the complex computational procedure limits its real-time inference capacity. To address this fundamental issue, we develop a deep learning framework based on convolutional neural networks (CNNs) that enable real-time extraction of full-field subpixel structural displacements from videos. In particular, two new CNN architectures are designed and trained on a dataset generated by the phase-based motion extraction method from a single lab-recorded high-speed video of a dynamic structure. As displacement is only reliable in the regions with sufficient texture contrast, the sparsity of motion field induced by the texture mask is considered via the network architecture design and loss function definition. Results show that, with the supervision of full and sparse motion field, the trained network is capable of identifying the pixels with sufficient texture contrast as well as their subpixel motions. The performance of the trained networks is tested on various videos of other structures to extract the full-field motion (e.g., displacement time histories), which indicates that the trained networks have generalizability to accurately extract full-field subpixel displacements for pixels with sufficient texture contrast.
ISSN:	0022-460X 1095-8568
DOI:	10.1016/j.jsv.2021.116142