Note segmentation and quantization for music information retrieval

Much research in music information retrieval has focused on query-by-humming systems, which search melodic databases using sung queries. The database retrieval aspect of such systems has received considerable attention, but query processing and the melodic representation have not been examined as ca...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on audio, speech, and language processing speech, and language processing, 2006-01, Vol.14 (1), p.131-141
Hauptverfasser:	Adams, N.H., Bartsch, M.A., Wakefield, G.H.
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Audio databases Content based retrieval Degradation Estimators Exact sciences and technology Information retrieval Information, signal and communications theory Miscellaneous Multiple signal classification Music Music information retrieval Natural languages pitch pitch quantization Quantization Queries Query processing query-by-example Representations Retrieval Segmentation Signal processing Speech processing Studies Telecommunications and information theory
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	141
container_issue	1
container_start_page	131
container_title	IEEE transactions on audio, speech, and language processing
container_volume	14
creator	Adams, N.H. Bartsch, M.A. Wakefield, G.H.
description	Much research in music information retrieval has focused on query-by-humming systems, which search melodic databases using sung queries. The database retrieval aspect of such systems has received considerable attention, but query processing and the melodic representation have not been examined as carefully. Common methods for query processing are based on musical intuition and historical momentum rather than specific performance criteria; existing systems often employ rudimentary note segmentation or coarse quantization of note estimates. In this work, we examine several alternative query processing methods as well as quantized melodic representations. One common difficulty with designing query-by-humming systems is the coupling between system components. We address this issue by measuring the performance of the query processing system both in isolation and coupled with a retrieval system. We first measure the segmentation performance of several note estimators. We then compute the retrieval accuracy of an experimental query-by-humming system that uses the various note estimators along with varying degrees of pitch and duration quantization. The results show that more advanced query processing can improve both segmentation performance and retrieval performance, although the best segmentation performance does not necessarily yield the best retrieval performance. Further, coarsely quantizing the melodic representation generally degrades retrieval accuracy.
doi_str_mv	10.1109/TSA.2005.854088
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_1561271</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1561271</ieee_id><sourcerecordid>889383491</sourcerecordid><originalsourceid>FETCH-LOGICAL-c350t-46a9c08c2a112d28cdfacf7c1ac278e1f6eff1112eb388c250641e2758ac6b3f3</originalsourceid><addsrcrecordid>eNpdkM1LAzEQxRdRsFbPHrwsgnjaNpOP3eyxFr-g6MF6Dmk6kZT9aJNdQf96U7YoeJrHvN8bhpckl0AmAKScLt9mE0qImEjBiZRHyQiEkFlRUn78qyE_Tc5C2BDCWc5hlNy9tB2mAT9qbDrdubZJdbNOd71uOvc9LGzr07oPzqSuiboeth477_BTV-fJidVVwIvDHCfvD_fL-VO2eH18ns8WmWGCdBnPdWmINFQD0DWVZm21sYUBbWghEWyO1kL0cMVkxASJDyIthNQmXzHLxsntcHfr212PoVO1CwarSjfY9kFJWTLJeAmRvP5HbtreN_E5VULBaaR4hKYDZHwbgkertt7V2n8pIGrfqIqNqn2jamg0Jm4OZ3UwurJeN8aFv1ghGCNQRu5q4Bwi_tkiB1oA-wEdtH7m</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>917428344</pqid></control><display><type>article</type><title>Note segmentation and quantization for music information retrieval</title><source>IEEE Electronic Library (IEL)</source><creator>Adams, N.H. ; Bartsch, M.A. ; Wakefield, G.H.</creator><creatorcontrib>Adams, N.H. ; Bartsch, M.A. ; Wakefield, G.H.</creatorcontrib><description>Much research in music information retrieval has focused on query-by-humming systems, which search melodic databases using sung queries. The database retrieval aspect of such systems has received considerable attention, but query processing and the melodic representation have not been examined as carefully. Common methods for query processing are based on musical intuition and historical momentum rather than specific performance criteria; existing systems often employ rudimentary note segmentation or coarse quantization of note estimates. In this work, we examine several alternative query processing methods as well as quantized melodic representations. One common difficulty with designing query-by-humming systems is the coupling between system components. We address this issue by measuring the performance of the query processing system both in isolation and coupled with a retrieval system. We first measure the segmentation performance of several note estimators. We then compute the retrieval accuracy of an experimental query-by-humming system that uses the various note estimators along with varying degrees of pitch and duration quantization. The results show that more advanced query processing can improve both segmentation performance and retrieval performance, although the best segmentation performance does not necessarily yield the best retrieval performance. Further, coarsely quantizing the melodic representation generally degrades retrieval accuracy.</description><identifier>ISSN: 1558-7916</identifier><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 1558-7924</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TSA.2005.854088</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Audio databases ; Content based retrieval ; Degradation ; Estimators ; Exact sciences and technology ; Information retrieval ; Information, signal and communications theory ; Miscellaneous ; Multiple signal classification ; Music ; Music information retrieval ; Natural languages ; pitch ; pitch quantization ; Quantization ; Queries ; Query processing ; query-by-example ; Representations ; Retrieval ; Segmentation ; Signal processing ; Speech processing ; Studies ; Telecommunications and information theory</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2006-01, Vol.14 (1), p.131-141</ispartof><rights>2006 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c350t-46a9c08c2a112d28cdfacf7c1ac278e1f6eff1112eb388c250641e2758ac6b3f3</citedby><cites>FETCH-LOGICAL-c350t-46a9c08c2a112d28cdfacf7c1ac278e1f6eff1112eb388c250641e2758ac6b3f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1561271$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4009,27902,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1561271$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17533019$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Adams, N.H.</creatorcontrib><creatorcontrib>Bartsch, M.A.</creatorcontrib><creatorcontrib>Wakefield, G.H.</creatorcontrib><title>Note segmentation and quantization for music information retrieval</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>Much research in music information retrieval has focused on query-by-humming systems, which search melodic databases using sung queries. The database retrieval aspect of such systems has received considerable attention, but query processing and the melodic representation have not been examined as carefully. Common methods for query processing are based on musical intuition and historical momentum rather than specific performance criteria; existing systems often employ rudimentary note segmentation or coarse quantization of note estimates. In this work, we examine several alternative query processing methods as well as quantized melodic representations. One common difficulty with designing query-by-humming systems is the coupling between system components. We address this issue by measuring the performance of the query processing system both in isolation and coupled with a retrieval system. We first measure the segmentation performance of several note estimators. We then compute the retrieval accuracy of an experimental query-by-humming system that uses the various note estimators along with varying degrees of pitch and duration quantization. The results show that more advanced query processing can improve both segmentation performance and retrieval performance, although the best segmentation performance does not necessarily yield the best retrieval performance. Further, coarsely quantizing the melodic representation generally degrades retrieval accuracy.</description><subject>Applied sciences</subject><subject>Audio databases</subject><subject>Content based retrieval</subject><subject>Degradation</subject><subject>Estimators</subject><subject>Exact sciences and technology</subject><subject>Information retrieval</subject><subject>Information, signal and communications theory</subject><subject>Miscellaneous</subject><subject>Multiple signal classification</subject><subject>Music</subject><subject>Music information retrieval</subject><subject>Natural languages</subject><subject>pitch</subject><subject>pitch quantization</subject><subject>Quantization</subject><subject>Queries</subject><subject>Query processing</subject><subject>query-by-example</subject><subject>Representations</subject><subject>Retrieval</subject><subject>Segmentation</subject><subject>Signal processing</subject><subject>Speech processing</subject><subject>Studies</subject><subject>Telecommunications and information theory</subject><issn>1558-7916</issn><issn>2329-9290</issn><issn>1558-7924</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkM1LAzEQxRdRsFbPHrwsgnjaNpOP3eyxFr-g6MF6Dmk6kZT9aJNdQf96U7YoeJrHvN8bhpckl0AmAKScLt9mE0qImEjBiZRHyQiEkFlRUn78qyE_Tc5C2BDCWc5hlNy9tB2mAT9qbDrdubZJdbNOd71uOvc9LGzr07oPzqSuiboeth477_BTV-fJidVVwIvDHCfvD_fL-VO2eH18ns8WmWGCdBnPdWmINFQD0DWVZm21sYUBbWghEWyO1kL0cMVkxASJDyIthNQmXzHLxsntcHfr212PoVO1CwarSjfY9kFJWTLJeAmRvP5HbtreN_E5VULBaaR4hKYDZHwbgkertt7V2n8pIGrfqIqNqn2jamg0Jm4OZ3UwurJeN8aFv1ghGCNQRu5q4Bwi_tkiB1oA-wEdtH7m</recordid><startdate>200601</startdate><enddate>200601</enddate><creator>Adams, N.H.</creator><creator>Bartsch, M.A.</creator><creator>Wakefield, G.H.</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>200601</creationdate><title>Note segmentation and quantization for music information retrieval</title><author>Adams, N.H. ; Bartsch, M.A. ; Wakefield, G.H.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c350t-46a9c08c2a112d28cdfacf7c1ac278e1f6eff1112eb388c250641e2758ac6b3f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Applied sciences</topic><topic>Audio databases</topic><topic>Content based retrieval</topic><topic>Degradation</topic><topic>Estimators</topic><topic>Exact sciences and technology</topic><topic>Information retrieval</topic><topic>Information, signal and communications theory</topic><topic>Miscellaneous</topic><topic>Multiple signal classification</topic><topic>Music</topic><topic>Music information retrieval</topic><topic>Natural languages</topic><topic>pitch</topic><topic>pitch quantization</topic><topic>Quantization</topic><topic>Queries</topic><topic>Query processing</topic><topic>query-by-example</topic><topic>Representations</topic><topic>Retrieval</topic><topic>Segmentation</topic><topic>Signal processing</topic><topic>Speech processing</topic><topic>Studies</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Adams, N.H.</creatorcontrib><creatorcontrib>Bartsch, M.A.</creatorcontrib><creatorcontrib>Wakefield, G.H.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Adams, N.H.</au><au>Bartsch, M.A.</au><au>Wakefield, G.H.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Note segmentation and quantization for music information retrieval</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2006-01</date><risdate>2006</risdate><volume>14</volume><issue>1</issue><spage>131</spage><epage>141</epage><pages>131-141</pages><issn>1558-7916</issn><issn>2329-9290</issn><eissn>1558-7924</eissn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Much research in music information retrieval has focused on query-by-humming systems, which search melodic databases using sung queries. The database retrieval aspect of such systems has received considerable attention, but query processing and the melodic representation have not been examined as carefully. Common methods for query processing are based on musical intuition and historical momentum rather than specific performance criteria; existing systems often employ rudimentary note segmentation or coarse quantization of note estimates. In this work, we examine several alternative query processing methods as well as quantized melodic representations. One common difficulty with designing query-by-humming systems is the coupling between system components. We address this issue by measuring the performance of the query processing system both in isolation and coupled with a retrieval system. We first measure the segmentation performance of several note estimators. We then compute the retrieval accuracy of an experimental query-by-humming system that uses the various note estimators along with varying degrees of pitch and duration quantization. The results show that more advanced query processing can improve both segmentation performance and retrieval performance, although the best segmentation performance does not necessarily yield the best retrieval performance. Further, coarsely quantizing the melodic representation generally degrades retrieval accuracy.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TSA.2005.854088</doi><tpages>11</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1558-7916
ispartof	IEEE transactions on audio, speech, and language processing, 2006-01, Vol.14 (1), p.131-141
issn	1558-7916 2329-9290 1558-7924 2329-9304
language	eng
recordid	cdi_ieee_primary_1561271
source	IEEE Electronic Library (IEL)
subjects	Applied sciences Audio databases Content based retrieval Degradation Estimators Exact sciences and technology Information retrieval Information, signal and communications theory Miscellaneous Multiple signal classification Music Music information retrieval Natural languages pitch pitch quantization Quantization Queries Query processing query-by-example Representations Retrieval Segmentation Signal processing Speech processing Studies Telecommunications and information theory
title	Note segmentation and quantization for music information retrieval
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T13%3A23%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Note%20segmentation%20and%20quantization%20for%20music%20information%20retrieval&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Adams,%20N.H.&rft.date=2006-01&rft.volume=14&rft.issue=1&rft.spage=131&rft.epage=141&rft.pages=131-141&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TSA.2005.854088&rft_dat=%3Cproquest_RIE%3E889383491%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=917428344&rft_id=info:pmid/&rft_ieee_id=1561271&rfr_iscdi=true