Electromyogram‐Based Lip‐Reading via Unobtrusive Dry Electrodes and Machine Learning Methods
Lip‐reading provides an effective speech communication interface for people with voice disorders and for intuitive human–machine interactions. Existing systems are generally challenged by bulkiness, obtrusiveness, and poor robustness against environmental interferences. The lack of a truly natural a...
Gespeichert in:
Veröffentlicht in: | Small (Weinheim an der Bergstrasse, Germany) Germany), 2023-04, Vol.19 (17), p.e2205058-n/a |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Lip‐reading provides an effective speech communication interface for people with voice disorders and for intuitive human–machine interactions. Existing systems are generally challenged by bulkiness, obtrusiveness, and poor robustness against environmental interferences. The lack of a truly natural and unobtrusive system for converting lip movements to speech precludes the continuous use and wide‐scale deployment of such devices. Here, the design of a hardware–software architecture to capture, analyze, and interpret lip movements associated with either normal or silent speech is presented. The system can recognize different and similar visemes. It is robust in a noisy or dark environment. Self‐adhesive, skin‐conformable, and semi‐transparent dry electrodes are developed to track high‐fidelity speech‐relevant electromyogram signals without impeding daily activities. The resulting skin‐like sensors can form seamless contact with the curvilinear and dynamic surfaces of the skin, which is crucial for a high signal‐to‐noise ratio and minimal interference. Machine learning algorithms are employed to decode electromyogram signals and convert them to spoken words. Finally, the applications of the developed lip‐reading system in augmented reality and medical service are demonstrated, which illustrate the great potential in immersive interaction and healthcare applications.
A lip‐reading system based on unobtrusive electrodes and machine learning methods is presented. The resulting self‐adhesive and skin‐conformable electrodes can track high‐fidelity speech‐relevant electromyogram signals. Even for very similar visemes, the lip‐reading system can recognize them and maintain robustness in noisy or dark environments. Additionally, based on the lip‐reading system, applications in augmented reality and healthcare are also presented. |
---|---|
ISSN: | 1613-6810 1613-6829 |
DOI: | 10.1002/smll.202205058 |