A Demand-Driven Perspective on Generative Audio AI
To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Oh, Sangshin Kang, Minsung Moon, Hyeongi Choi, Keunwoo Chon, Ben Sangbae |
description | To achieve successful deployment of AI research, it is crucial to understand
the demands of the industry. In this paper, we present the results of a survey
conducted with professional audio engineers, in order to determine research
priorities and define various research tasks. We also summarize the current
challenges in audio quality and controllability based on the survey. Our
analysis emphasizes that the availability of datasets is currently the main
bottleneck for achieving high-quality audio generation. Finally, we suggest
potential solutions for some revealed issues with empirical evidence. |
doi_str_mv | 10.48550/arxiv.2307.04292 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2307_04292</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2307_04292</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-fd594aaf6032537598e08f113d196259780382056dec930871dbca2d8f1eb48a3</originalsourceid><addsrcrecordid>eNotjs2KwjAURrOZhXR8AFeTF2i9uWmaZFnq-AOCs3Bfrs0tFMZa4ijj26vV1ceBj8MRYqYgy50xMKf4310z1GAzyNHjRGApF3ykPqSL2F25lz8czwM3fw-Qp16uuOdII5WX0J1kufkUHy39nnn63kTsl9_7ap1ud6tNVW5TKiymbTA-J2oL0Gi0Nd4xuFYpHZQv0HjrQDsEUwRuvAZnVTg0hOHx4UPuSCfi66Udo-shdkeKt_oZX4_x-g7ebDzN</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Demand-Driven Perspective on Generative Audio AI</title><source>arXiv.org</source><creator>Oh, Sangshin ; Kang, Minsung ; Moon, Hyeongi ; Choi, Keunwoo ; Chon, Ben Sangbae</creator><creatorcontrib>Oh, Sangshin ; Kang, Minsung ; Moon, Hyeongi ; Choi, Keunwoo ; Chon, Ben Sangbae</creatorcontrib><description>To achieve successful deployment of AI research, it is crucial to understand
the demands of the industry. In this paper, we present the results of a survey
conducted with professional audio engineers, in order to determine research
priorities and define various research tasks. We also summarize the current
challenges in audio quality and controllability based on the survey. Our
analysis emphasizes that the availability of datasets is currently the main
bottleneck for achieving high-quality audio generation. Finally, we suggest
potential solutions for some revealed issues with empirical evidence.</description><identifier>DOI: 10.48550/arxiv.2307.04292</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence</subject><creationdate>2023-07</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2307.04292$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2307.04292$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Oh, Sangshin</creatorcontrib><creatorcontrib>Kang, Minsung</creatorcontrib><creatorcontrib>Moon, Hyeongi</creatorcontrib><creatorcontrib>Choi, Keunwoo</creatorcontrib><creatorcontrib>Chon, Ben Sangbae</creatorcontrib><title>A Demand-Driven Perspective on Generative Audio AI</title><description>To achieve successful deployment of AI research, it is crucial to understand
the demands of the industry. In this paper, we present the results of a survey
conducted with professional audio engineers, in order to determine research
priorities and define various research tasks. We also summarize the current
challenges in audio quality and controllability based on the survey. Our
analysis emphasizes that the availability of datasets is currently the main
bottleneck for achieving high-quality audio generation. Finally, we suggest
potential solutions for some revealed issues with empirical evidence.</description><subject>Computer Science - Artificial Intelligence</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotjs2KwjAURrOZhXR8AFeTF2i9uWmaZFnq-AOCs3Bfrs0tFMZa4ijj26vV1ceBj8MRYqYgy50xMKf4310z1GAzyNHjRGApF3ykPqSL2F25lz8czwM3fw-Qp16uuOdII5WX0J1kufkUHy39nnn63kTsl9_7ap1ud6tNVW5TKiymbTA-J2oL0Gi0Nd4xuFYpHZQv0HjrQDsEUwRuvAZnVTg0hOHx4UPuSCfi66Udo-shdkeKt_oZX4_x-g7ebDzN</recordid><startdate>20230709</startdate><enddate>20230709</enddate><creator>Oh, Sangshin</creator><creator>Kang, Minsung</creator><creator>Moon, Hyeongi</creator><creator>Choi, Keunwoo</creator><creator>Chon, Ben Sangbae</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230709</creationdate><title>A Demand-Driven Perspective on Generative Audio AI</title><author>Oh, Sangshin ; Kang, Minsung ; Moon, Hyeongi ; Choi, Keunwoo ; Chon, Ben Sangbae</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-fd594aaf6032537598e08f113d196259780382056dec930871dbca2d8f1eb48a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><toplevel>online_resources</toplevel><creatorcontrib>Oh, Sangshin</creatorcontrib><creatorcontrib>Kang, Minsung</creatorcontrib><creatorcontrib>Moon, Hyeongi</creatorcontrib><creatorcontrib>Choi, Keunwoo</creatorcontrib><creatorcontrib>Chon, Ben Sangbae</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Oh, Sangshin</au><au>Kang, Minsung</au><au>Moon, Hyeongi</au><au>Choi, Keunwoo</au><au>Chon, Ben Sangbae</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Demand-Driven Perspective on Generative Audio AI</atitle><date>2023-07-09</date><risdate>2023</risdate><abstract>To achieve successful deployment of AI research, it is crucial to understand
the demands of the industry. In this paper, we present the results of a survey
conducted with professional audio engineers, in order to determine research
priorities and define various research tasks. We also summarize the current
challenges in audio quality and controllability based on the survey. Our
analysis emphasizes that the availability of datasets is currently the main
bottleneck for achieving high-quality audio generation. Finally, we suggest
potential solutions for some revealed issues with empirical evidence.</abstract><doi>10.48550/arxiv.2307.04292</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2307.04292 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2307_04292 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence |
title | A Demand-Driven Perspective on Generative Audio AI |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T07%3A01%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Demand-Driven%20Perspective%20on%20Generative%20Audio%20AI&rft.au=Oh,%20Sangshin&rft.date=2023-07-09&rft_id=info:doi/10.48550/arxiv.2307.04292&rft_dat=%3Carxiv_GOX%3E2307_04292%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |