Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot
[abstFig src='/00290001/19.jpg' width='300' text='Human-voice enhancement system for a hose-shaped robot' ] This paper presents the design and implementation of a two-stage human-voice enhancement system for a hose-shaped rescue robot. When a microphone-equipped hose-sh...
Gespeichert in:
Veröffentlicht in: | Journal of robotics and mechatronics 2017-02, Vol.29 (1), p.198-212 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 212 |
---|---|
container_issue | 1 |
container_start_page | 198 |
container_title | Journal of robotics and mechatronics |
container_volume | 29 |
creator | Bando, Yoshiaki Saruwatari, Hiroshi Ono, Nobutaka Makino, Shoji Itoyama, Katsutoshi Kitamura, Daichi Ishimura, Masaru Takakusaki, Moe Mae, Narumi Yamaoka, Kouei Matsui, Yutaro Ambe, Yuichi Konyo, Masashi Tadokoro, Satoshi Yoshii, Kazuyoshi Okuno, Hiroshi G. |
description | [abstFig src='/00290001/19.jpg' width='300' text='Human-voice enhancement system for a hose-shaped robot' ] This paper presents the design and implementation of a two-stage human-voice enhancement system for a hose-shaped rescue robot. When a microphone-equipped hose-shaped robot is used to search for a victim under a collapsed building, human-voice enhancement is crucial because the sound captured by a microphone array is contaminated by the ego-noise of the robot. For achieving both low latency and high quality, our system combines online and offline human-voice enhancement, providing
an overview first and then details on demand
. The online enhancement is used for searching for a victim in real time, while the offline one facilitates scrutiny by listening to highly enhanced human voices. Our online enhancement is based on an online robust principal component analysis, and our offline enhancement is based on an independent low-rank matrix analysis. The two enhancement methods are integrated with Robot Operating System (ROS). Experimental results showed that both the online and offline enhancement methods outperformed conventional methods. |
doi_str_mv | 10.20965/jrm.2017.p0198 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2465811335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2465811335</sourcerecordid><originalsourceid>FETCH-LOGICAL-c460t-9be8a9d89640825c0bc40e74c82ef2518358e5214b24f69093fe59caff46208e3</originalsourceid><addsrcrecordid>eNotkE1Lw0AYhBdRsNSevS543nY_k92jlGqEgNhWr2GzedOmNNmYTSj598bqXGYOwww8CD0yuuTURGp16uopsXjZUmb0DZoxrQXRVJpbNKOGKSKM5PdoEcKJTlIyNiKeoSL1F5zaHho3YtsUOKkOR_wx2HPVj3h_8WTX2wPgZKhtQ7585YBsmqNtHNTQ9Hg3hh5qXPoOW5z4AGR3tC0UeAvBDYC3Pvf9A7or7TnA4t_n6PNls18nJH1_fVs_p8TJiPbE5KCtKbSJJNVcOZo7SSGWTnMouWJaKA2KM5lzWUaGGlGCMs6WpYw41SDm6Olvt-389wChz05-6JrpMuMyUpoxIdTUWv21XOdD6KDM2q6qbTdmjGZXmtlEM_ulmV1pih9EC2c8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2465811335</pqid></control><display><type>article</type><title>Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot</title><source>DOAJ Directory of Open Access Journals</source><source>J-STAGE (Japan Science & Technology Information Aggregator, Electronic) Freely Available Titles - Japanese</source><source>Open Access Titles of Japan</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Bando, Yoshiaki ; Saruwatari, Hiroshi ; Ono, Nobutaka ; Makino, Shoji ; Itoyama, Katsutoshi ; Kitamura, Daichi ; Ishimura, Masaru ; Takakusaki, Moe ; Mae, Narumi ; Yamaoka, Kouei ; Matsui, Yutaro ; Ambe, Yuichi ; Konyo, Masashi ; Tadokoro, Satoshi ; Yoshii, Kazuyoshi ; Okuno, Hiroshi G.</creator><creatorcontrib>Bando, Yoshiaki ; Saruwatari, Hiroshi ; Ono, Nobutaka ; Makino, Shoji ; Itoyama, Katsutoshi ; Kitamura, Daichi ; Ishimura, Masaru ; Takakusaki, Moe ; Mae, Narumi ; Yamaoka, Kouei ; Matsui, Yutaro ; Ambe, Yuichi ; Konyo, Masashi ; Tadokoro, Satoshi ; Yoshii, Kazuyoshi ; Okuno, Hiroshi G. ; Graduate School of Informatics, Kyoto University ; Graduate School of Information Science, Tohoku University ; Graduate School of Information Science and Technology, The University of Tokyo ; National Institute of Informatics ; Department of Informatics, School of Multidisciplinary Sciences, SOKENDAI ; Graduate Program for Embodiment Informatics, Waseda University ; Graduate School of Systems and Information Engineering, Tsukuba University</creatorcontrib><description>[abstFig src='/00290001/19.jpg' width='300' text='Human-voice enhancement system for a hose-shaped robot' ] This paper presents the design and implementation of a two-stage human-voice enhancement system for a hose-shaped rescue robot. When a microphone-equipped hose-shaped robot is used to search for a victim under a collapsed building, human-voice enhancement is crucial because the sound captured by a microphone array is contaminated by the ego-noise of the robot. For achieving both low latency and high quality, our system combines online and offline human-voice enhancement, providing
an overview first and then details on demand
. The online enhancement is used for searching for a victim in real time, while the offline one facilitates scrutiny by listening to highly enhanced human voices. Our online enhancement is based on an online robust principal component analysis, and our offline enhancement is based on an independent low-rank matrix analysis. The two enhancement methods are integrated with Robot Operating System (ROS). Experimental results showed that both the online and offline enhancement methods outperformed conventional methods.</description><identifier>ISSN: 0915-3942</identifier><identifier>EISSN: 1883-8049</identifier><identifier>DOI: 10.20965/jrm.2017.p0198</identifier><language>eng</language><publisher>Tokyo: Fuji Technology Press Co. Ltd</publisher><subject>Matrix methods ; Principal components analysis ; Robots ; Voice</subject><ispartof>Journal of robotics and mechatronics, 2017-02, Vol.29 (1), p.198-212</ispartof><rights>Copyright © 2017 Fuji Technology Press Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c460t-9be8a9d89640825c0bc40e74c82ef2518358e5214b24f69093fe59caff46208e3</citedby><cites>FETCH-LOGICAL-c460t-9be8a9d89640825c0bc40e74c82ef2518358e5214b24f69093fe59caff46208e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,864,27924,27925</link.rule.ids></links><search><creatorcontrib>Bando, Yoshiaki</creatorcontrib><creatorcontrib>Saruwatari, Hiroshi</creatorcontrib><creatorcontrib>Ono, Nobutaka</creatorcontrib><creatorcontrib>Makino, Shoji</creatorcontrib><creatorcontrib>Itoyama, Katsutoshi</creatorcontrib><creatorcontrib>Kitamura, Daichi</creatorcontrib><creatorcontrib>Ishimura, Masaru</creatorcontrib><creatorcontrib>Takakusaki, Moe</creatorcontrib><creatorcontrib>Mae, Narumi</creatorcontrib><creatorcontrib>Yamaoka, Kouei</creatorcontrib><creatorcontrib>Matsui, Yutaro</creatorcontrib><creatorcontrib>Ambe, Yuichi</creatorcontrib><creatorcontrib>Konyo, Masashi</creatorcontrib><creatorcontrib>Tadokoro, Satoshi</creatorcontrib><creatorcontrib>Yoshii, Kazuyoshi</creatorcontrib><creatorcontrib>Okuno, Hiroshi G.</creatorcontrib><creatorcontrib>Graduate School of Informatics, Kyoto University</creatorcontrib><creatorcontrib>Graduate School of Information Science, Tohoku University</creatorcontrib><creatorcontrib>Graduate School of Information Science and Technology, The University of Tokyo</creatorcontrib><creatorcontrib>National Institute of Informatics</creatorcontrib><creatorcontrib>Department of Informatics, School of Multidisciplinary Sciences, SOKENDAI</creatorcontrib><creatorcontrib>Graduate Program for Embodiment Informatics, Waseda University</creatorcontrib><creatorcontrib>Graduate School of Systems and Information Engineering, Tsukuba University</creatorcontrib><title>Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot</title><title>Journal of robotics and mechatronics</title><description>[abstFig src='/00290001/19.jpg' width='300' text='Human-voice enhancement system for a hose-shaped robot' ] This paper presents the design and implementation of a two-stage human-voice enhancement system for a hose-shaped rescue robot. When a microphone-equipped hose-shaped robot is used to search for a victim under a collapsed building, human-voice enhancement is crucial because the sound captured by a microphone array is contaminated by the ego-noise of the robot. For achieving both low latency and high quality, our system combines online and offline human-voice enhancement, providing
an overview first and then details on demand
. The online enhancement is used for searching for a victim in real time, while the offline one facilitates scrutiny by listening to highly enhanced human voices. Our online enhancement is based on an online robust principal component analysis, and our offline enhancement is based on an independent low-rank matrix analysis. The two enhancement methods are integrated with Robot Operating System (ROS). Experimental results showed that both the online and offline enhancement methods outperformed conventional methods.</description><subject>Matrix methods</subject><subject>Principal components analysis</subject><subject>Robots</subject><subject>Voice</subject><issn>0915-3942</issn><issn>1883-8049</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNotkE1Lw0AYhBdRsNSevS543nY_k92jlGqEgNhWr2GzedOmNNmYTSj598bqXGYOwww8CD0yuuTURGp16uopsXjZUmb0DZoxrQXRVJpbNKOGKSKM5PdoEcKJTlIyNiKeoSL1F5zaHho3YtsUOKkOR_wx2HPVj3h_8WTX2wPgZKhtQ7585YBsmqNtHNTQ9Hg3hh5qXPoOW5z4AGR3tC0UeAvBDYC3Pvf9A7or7TnA4t_n6PNls18nJH1_fVs_p8TJiPbE5KCtKbSJJNVcOZo7SSGWTnMouWJaKA2KM5lzWUaGGlGCMs6WpYw41SDm6Olvt-389wChz05-6JrpMuMyUpoxIdTUWv21XOdD6KDM2q6qbTdmjGZXmtlEM_ulmV1pih9EC2c8</recordid><startdate>20170201</startdate><enddate>20170201</enddate><creator>Bando, Yoshiaki</creator><creator>Saruwatari, Hiroshi</creator><creator>Ono, Nobutaka</creator><creator>Makino, Shoji</creator><creator>Itoyama, Katsutoshi</creator><creator>Kitamura, Daichi</creator><creator>Ishimura, Masaru</creator><creator>Takakusaki, Moe</creator><creator>Mae, Narumi</creator><creator>Yamaoka, Kouei</creator><creator>Matsui, Yutaro</creator><creator>Ambe, Yuichi</creator><creator>Konyo, Masashi</creator><creator>Tadokoro, Satoshi</creator><creator>Yoshii, Kazuyoshi</creator><creator>Okuno, Hiroshi G.</creator><general>Fuji Technology Press Co. Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope></search><sort><creationdate>20170201</creationdate><title>Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot</title><author>Bando, Yoshiaki ; Saruwatari, Hiroshi ; Ono, Nobutaka ; Makino, Shoji ; Itoyama, Katsutoshi ; Kitamura, Daichi ; Ishimura, Masaru ; Takakusaki, Moe ; Mae, Narumi ; Yamaoka, Kouei ; Matsui, Yutaro ; Ambe, Yuichi ; Konyo, Masashi ; Tadokoro, Satoshi ; Yoshii, Kazuyoshi ; Okuno, Hiroshi G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c460t-9be8a9d89640825c0bc40e74c82ef2518358e5214b24f69093fe59caff46208e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Matrix methods</topic><topic>Principal components analysis</topic><topic>Robots</topic><topic>Voice</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Bando, Yoshiaki</creatorcontrib><creatorcontrib>Saruwatari, Hiroshi</creatorcontrib><creatorcontrib>Ono, Nobutaka</creatorcontrib><creatorcontrib>Makino, Shoji</creatorcontrib><creatorcontrib>Itoyama, Katsutoshi</creatorcontrib><creatorcontrib>Kitamura, Daichi</creatorcontrib><creatorcontrib>Ishimura, Masaru</creatorcontrib><creatorcontrib>Takakusaki, Moe</creatorcontrib><creatorcontrib>Mae, Narumi</creatorcontrib><creatorcontrib>Yamaoka, Kouei</creatorcontrib><creatorcontrib>Matsui, Yutaro</creatorcontrib><creatorcontrib>Ambe, Yuichi</creatorcontrib><creatorcontrib>Konyo, Masashi</creatorcontrib><creatorcontrib>Tadokoro, Satoshi</creatorcontrib><creatorcontrib>Yoshii, Kazuyoshi</creatorcontrib><creatorcontrib>Okuno, Hiroshi G.</creatorcontrib><creatorcontrib>Graduate School of Informatics, Kyoto University</creatorcontrib><creatorcontrib>Graduate School of Information Science, Tohoku University</creatorcontrib><creatorcontrib>Graduate School of Information Science and Technology, The University of Tokyo</creatorcontrib><creatorcontrib>National Institute of Informatics</creatorcontrib><creatorcontrib>Department of Informatics, School of Multidisciplinary Sciences, SOKENDAI</creatorcontrib><creatorcontrib>Graduate Program for Embodiment Informatics, Waseda University</creatorcontrib><creatorcontrib>Graduate School of Systems and Information Engineering, Tsukuba University</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Journal of robotics and mechatronics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bando, Yoshiaki</au><au>Saruwatari, Hiroshi</au><au>Ono, Nobutaka</au><au>Makino, Shoji</au><au>Itoyama, Katsutoshi</au><au>Kitamura, Daichi</au><au>Ishimura, Masaru</au><au>Takakusaki, Moe</au><au>Mae, Narumi</au><au>Yamaoka, Kouei</au><au>Matsui, Yutaro</au><au>Ambe, Yuichi</au><au>Konyo, Masashi</au><au>Tadokoro, Satoshi</au><au>Yoshii, Kazuyoshi</au><au>Okuno, Hiroshi G.</au><aucorp>Graduate School of Informatics, Kyoto University</aucorp><aucorp>Graduate School of Information Science, Tohoku University</aucorp><aucorp>Graduate School of Information Science and Technology, The University of Tokyo</aucorp><aucorp>National Institute of Informatics</aucorp><aucorp>Department of Informatics, School of Multidisciplinary Sciences, SOKENDAI</aucorp><aucorp>Graduate Program for Embodiment Informatics, Waseda University</aucorp><aucorp>Graduate School of Systems and Information Engineering, Tsukuba University</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot</atitle><jtitle>Journal of robotics and mechatronics</jtitle><date>2017-02-01</date><risdate>2017</risdate><volume>29</volume><issue>1</issue><spage>198</spage><epage>212</epage><pages>198-212</pages><issn>0915-3942</issn><eissn>1883-8049</eissn><abstract>[abstFig src='/00290001/19.jpg' width='300' text='Human-voice enhancement system for a hose-shaped robot' ] This paper presents the design and implementation of a two-stage human-voice enhancement system for a hose-shaped rescue robot. When a microphone-equipped hose-shaped robot is used to search for a victim under a collapsed building, human-voice enhancement is crucial because the sound captured by a microphone array is contaminated by the ego-noise of the robot. For achieving both low latency and high quality, our system combines online and offline human-voice enhancement, providing
an overview first and then details on demand
. The online enhancement is used for searching for a victim in real time, while the offline one facilitates scrutiny by listening to highly enhanced human voices. Our online enhancement is based on an online robust principal component analysis, and our offline enhancement is based on an independent low-rank matrix analysis. The two enhancement methods are integrated with Robot Operating System (ROS). Experimental results showed that both the online and offline enhancement methods outperformed conventional methods.</abstract><cop>Tokyo</cop><pub>Fuji Technology Press Co. Ltd</pub><doi>10.20965/jrm.2017.p0198</doi><tpages>15</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0915-3942 |
ispartof | Journal of robotics and mechatronics, 2017-02, Vol.29 (1), p.198-212 |
issn | 0915-3942 1883-8049 |
language | eng |
recordid | cdi_proquest_journals_2465811335 |
source | DOAJ Directory of Open Access Journals; J-STAGE (Japan Science & Technology Information Aggregator, Electronic) Freely Available Titles - Japanese; Open Access Titles of Japan; EZB-FREE-00999 freely available EZB journals |
subjects | Matrix methods Principal components analysis Robots Voice |
title | Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T18%3A13%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Low%20Latency%20and%20High%20Quality%20Two-Stage%20Human-Voice-Enhancement%20System%20for%20a%20Hose-Shaped%20Rescue%20Robot&rft.jtitle=Journal%20of%20robotics%20and%20mechatronics&rft.au=Bando,%20Yoshiaki&rft.aucorp=Graduate%20School%20of%20Informatics,%20Kyoto%20University&rft.date=2017-02-01&rft.volume=29&rft.issue=1&rft.spage=198&rft.epage=212&rft.pages=198-212&rft.issn=0915-3942&rft.eissn=1883-8049&rft_id=info:doi/10.20965/jrm.2017.p0198&rft_dat=%3Cproquest_cross%3E2465811335%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2465811335&rft_id=info:pmid/&rfr_iscdi=true |