A Demand-Driven Perspective on Generative Audio AI

To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Oh, Sangshin, Kang, Minsung, Moon, Hyeongi, Choi, Keunwoo, Chon, Ben Sangbae
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Oh, Sangshin
Kang, Minsung
Moon, Hyeongi
Choi, Keunwoo
Chon, Ben Sangbae
description To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes that the availability of datasets is currently the main bottleneck for achieving high-quality audio generation. Finally, we suggest potential solutions for some revealed issues with empirical evidence.
doi_str_mv 10.48550/arxiv.2307.04292
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2307_04292</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2307_04292</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-fd594aaf6032537598e08f113d196259780382056dec930871dbca2d8f1eb48a3</originalsourceid><addsrcrecordid>eNotjs2KwjAURrOZhXR8AFeTF2i9uWmaZFnq-AOCs3Bfrs0tFMZa4ijj26vV1ceBj8MRYqYgy50xMKf4310z1GAzyNHjRGApF3ykPqSL2F25lz8czwM3fw-Qp16uuOdII5WX0J1kufkUHy39nnn63kTsl9_7ap1ud6tNVW5TKiymbTA-J2oL0Gi0Nd4xuFYpHZQv0HjrQDsEUwRuvAZnVTg0hOHx4UPuSCfi66Udo-shdkeKt_oZX4_x-g7ebDzN</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Demand-Driven Perspective on Generative Audio AI</title><source>arXiv.org</source><creator>Oh, Sangshin ; Kang, Minsung ; Moon, Hyeongi ; Choi, Keunwoo ; Chon, Ben Sangbae</creator><creatorcontrib>Oh, Sangshin ; Kang, Minsung ; Moon, Hyeongi ; Choi, Keunwoo ; Chon, Ben Sangbae</creatorcontrib><description>To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes that the availability of datasets is currently the main bottleneck for achieving high-quality audio generation. Finally, we suggest potential solutions for some revealed issues with empirical evidence.</description><identifier>DOI: 10.48550/arxiv.2307.04292</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence</subject><creationdate>2023-07</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2307.04292$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2307.04292$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Oh, Sangshin</creatorcontrib><creatorcontrib>Kang, Minsung</creatorcontrib><creatorcontrib>Moon, Hyeongi</creatorcontrib><creatorcontrib>Choi, Keunwoo</creatorcontrib><creatorcontrib>Chon, Ben Sangbae</creatorcontrib><title>A Demand-Driven Perspective on Generative Audio AI</title><description>To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes that the availability of datasets is currently the main bottleneck for achieving high-quality audio generation. Finally, we suggest potential solutions for some revealed issues with empirical evidence.</description><subject>Computer Science - Artificial Intelligence</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotjs2KwjAURrOZhXR8AFeTF2i9uWmaZFnq-AOCs3Bfrs0tFMZa4ijj26vV1ceBj8MRYqYgy50xMKf4310z1GAzyNHjRGApF3ykPqSL2F25lz8czwM3fw-Qp16uuOdII5WX0J1kufkUHy39nnn63kTsl9_7ap1ud6tNVW5TKiymbTA-J2oL0Gi0Nd4xuFYpHZQv0HjrQDsEUwRuvAZnVTg0hOHx4UPuSCfi66Udo-shdkeKt_oZX4_x-g7ebDzN</recordid><startdate>20230709</startdate><enddate>20230709</enddate><creator>Oh, Sangshin</creator><creator>Kang, Minsung</creator><creator>Moon, Hyeongi</creator><creator>Choi, Keunwoo</creator><creator>Chon, Ben Sangbae</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230709</creationdate><title>A Demand-Driven Perspective on Generative Audio AI</title><author>Oh, Sangshin ; Kang, Minsung ; Moon, Hyeongi ; Choi, Keunwoo ; Chon, Ben Sangbae</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-fd594aaf6032537598e08f113d196259780382056dec930871dbca2d8f1eb48a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><toplevel>online_resources</toplevel><creatorcontrib>Oh, Sangshin</creatorcontrib><creatorcontrib>Kang, Minsung</creatorcontrib><creatorcontrib>Moon, Hyeongi</creatorcontrib><creatorcontrib>Choi, Keunwoo</creatorcontrib><creatorcontrib>Chon, Ben Sangbae</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Oh, Sangshin</au><au>Kang, Minsung</au><au>Moon, Hyeongi</au><au>Choi, Keunwoo</au><au>Chon, Ben Sangbae</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Demand-Driven Perspective on Generative Audio AI</atitle><date>2023-07-09</date><risdate>2023</risdate><abstract>To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes that the availability of datasets is currently the main bottleneck for achieving high-quality audio generation. Finally, we suggest potential solutions for some revealed issues with empirical evidence.</abstract><doi>10.48550/arxiv.2307.04292</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2307.04292
ispartof
issn
language eng
recordid cdi_arxiv_primary_2307_04292
source arXiv.org
subjects Computer Science - Artificial Intelligence
title A Demand-Driven Perspective on Generative Audio AI
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T07%3A01%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Demand-Driven%20Perspective%20on%20Generative%20Audio%20AI&rft.au=Oh,%20Sangshin&rft.date=2023-07-09&rft_id=info:doi/10.48550/arxiv.2307.04292&rft_dat=%3Carxiv_GOX%3E2307_04292%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true