On the use of audio events for improving video scene segmentation

This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the propose...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sidiropoulos, P, Mezaris, V, Kompatsiaris, I, Meinedo, H, Bugalho, M, Trancoso, I
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4
container_issue
container_start_page 1
container_title
container_volume
creator Sidiropoulos, P
Mezaris, V
Kompatsiaris, I
Meinedo, H
Bugalho, M
Trancoso, I
description This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the proposed technique is built upon a recently proposed audio-visual scene segmentation approach that involves the construction of multiple scene transition graphs (STGs) that separately exploit information coming from different modalities. In the extension of the latter approach presented in this work, audio event detection results are introduced to the definition of an audio-based scene transition graph, while a visual-based scene transition graph is also defined independently. The results of these two types of STGs are subsequently combined. The application of the proposed technique to broadcast videos demonstrates the usefulness of audio events for scene segmentation.
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5617686</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5617686</ieee_id><sourcerecordid>5617686</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-dde9ace44ce1ce5a8e18af023f008342b8d3c5e5003f307832b11c92ef70d4223</originalsourceid><addsrcrecordid>eNotjstqwzAQRVXaQtPUX9CNfsAwelnjZQh9QSCb7IMijVKVWgqWY-jf19Cu7uacw71hj4g9GCUR7C1reotCS60tatR3bCWFwdagVQ-sqfULAJRYBCVXbLPPfPokfq3ES-TuGlLhNFOeKo9l5Gm4jGVO-cznFKjw6ikTr3QeFsRNqeQndh_dd6Xmf9fs8Ppy2L63u_3bx3aza1MPUxsC9c6T1p6EJ-OQBLoIUkUAVFqeMChvyCzXogKLSp6E8L2kaCFoKdWaPf9lExEdL2Ma3PhzNJ2wHXbqF1kIRyo</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>On the use of audio events for improving video scene segmentation</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Sidiropoulos, P ; Mezaris, V ; Kompatsiaris, I ; Meinedo, H ; Bugalho, M ; Trancoso, I</creator><creatorcontrib>Sidiropoulos, P ; Mezaris, V ; Kompatsiaris, I ; Meinedo, H ; Bugalho, M ; Trancoso, I</creatorcontrib><description>This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the proposed technique is built upon a recently proposed audio-visual scene segmentation approach that involves the construction of multiple scene transition graphs (STGs) that separately exploit information coming from different modalities. In the extension of the latter approach presented in this work, audio event detection results are introduced to the definition of an audio-based scene transition graph, while a visual-based scene transition graph is also defined independently. The results of these two types of STGs are subsequently combined. The application of the proposed technique to broadcast videos demonstrates the usefulness of audio events for scene segmentation.</description><identifier>ISSN: 2158-5873</identifier><identifier>ISBN: 9781424478484</identifier><identifier>ISBN: 1424478480</identifier><identifier>EISBN: 8890532807</identifier><identifier>EISBN: 9788890532801</identifier><language>eng</language><publisher>IEEE</publisher><subject>Event detection ; Explosions ; Feature extraction ; Merging ; Semantics ; Streaming media ; Visualization</subject><ispartof>11th International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS 10, 2010, p.1-4</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5617686$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,54898</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5617686$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Sidiropoulos, P</creatorcontrib><creatorcontrib>Mezaris, V</creatorcontrib><creatorcontrib>Kompatsiaris, I</creatorcontrib><creatorcontrib>Meinedo, H</creatorcontrib><creatorcontrib>Bugalho, M</creatorcontrib><creatorcontrib>Trancoso, I</creatorcontrib><title>On the use of audio events for improving video scene segmentation</title><title>11th International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS 10</title><addtitle>WIAMIS</addtitle><description>This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the proposed technique is built upon a recently proposed audio-visual scene segmentation approach that involves the construction of multiple scene transition graphs (STGs) that separately exploit information coming from different modalities. In the extension of the latter approach presented in this work, audio event detection results are introduced to the definition of an audio-based scene transition graph, while a visual-based scene transition graph is also defined independently. The results of these two types of STGs are subsequently combined. The application of the proposed technique to broadcast videos demonstrates the usefulness of audio events for scene segmentation.</description><subject>Event detection</subject><subject>Explosions</subject><subject>Feature extraction</subject><subject>Merging</subject><subject>Semantics</subject><subject>Streaming media</subject><subject>Visualization</subject><issn>2158-5873</issn><isbn>9781424478484</isbn><isbn>1424478480</isbn><isbn>8890532807</isbn><isbn>9788890532801</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2010</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotjstqwzAQRVXaQtPUX9CNfsAwelnjZQh9QSCb7IMijVKVWgqWY-jf19Cu7uacw71hj4g9GCUR7C1reotCS60tatR3bCWFwdagVQ-sqfULAJRYBCVXbLPPfPokfq3ES-TuGlLhNFOeKo9l5Gm4jGVO-cznFKjw6ikTr3QeFsRNqeQndh_dd6Xmf9fs8Ppy2L63u_3bx3aza1MPUxsC9c6T1p6EJ-OQBLoIUkUAVFqeMChvyCzXogKLSp6E8L2kaCFoKdWaPf9lExEdL2Ma3PhzNJ2wHXbqF1kIRyo</recordid><startdate>201004</startdate><enddate>201004</enddate><creator>Sidiropoulos, P</creator><creator>Mezaris, V</creator><creator>Kompatsiaris, I</creator><creator>Meinedo, H</creator><creator>Bugalho, M</creator><creator>Trancoso, I</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201004</creationdate><title>On the use of audio events for improving video scene segmentation</title><author>Sidiropoulos, P ; Mezaris, V ; Kompatsiaris, I ; Meinedo, H ; Bugalho, M ; Trancoso, I</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-dde9ace44ce1ce5a8e18af023f008342b8d3c5e5003f307832b11c92ef70d4223</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Event detection</topic><topic>Explosions</topic><topic>Feature extraction</topic><topic>Merging</topic><topic>Semantics</topic><topic>Streaming media</topic><topic>Visualization</topic><toplevel>online_resources</toplevel><creatorcontrib>Sidiropoulos, P</creatorcontrib><creatorcontrib>Mezaris, V</creatorcontrib><creatorcontrib>Kompatsiaris, I</creatorcontrib><creatorcontrib>Meinedo, H</creatorcontrib><creatorcontrib>Bugalho, M</creatorcontrib><creatorcontrib>Trancoso, I</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sidiropoulos, P</au><au>Mezaris, V</au><au>Kompatsiaris, I</au><au>Meinedo, H</au><au>Bugalho, M</au><au>Trancoso, I</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>On the use of audio events for improving video scene segmentation</atitle><btitle>11th International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS 10</btitle><stitle>WIAMIS</stitle><date>2010-04</date><risdate>2010</risdate><spage>1</spage><epage>4</epage><pages>1-4</pages><issn>2158-5873</issn><isbn>9781424478484</isbn><isbn>1424478480</isbn><eisbn>8890532807</eisbn><eisbn>9788890532801</eisbn><abstract>This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the proposed technique is built upon a recently proposed audio-visual scene segmentation approach that involves the construction of multiple scene transition graphs (STGs) that separately exploit information coming from different modalities. In the extension of the latter approach presented in this work, audio event detection results are introduced to the definition of an audio-based scene transition graph, while a visual-based scene transition graph is also defined independently. The results of these two types of STGs are subsequently combined. The application of the proposed technique to broadcast videos demonstrates the usefulness of audio events for scene segmentation.</abstract><pub>IEEE</pub><tpages>4</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2158-5873
ispartof 11th International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS 10, 2010, p.1-4
issn 2158-5873
language eng
recordid cdi_ieee_primary_5617686
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Event detection
Explosions
Feature extraction
Merging
Semantics
Streaming media
Visualization
title On the use of audio events for improving video scene segmentation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T23%3A57%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=On%20the%20use%20of%20audio%20events%20for%20improving%20video%20scene%20segmentation&rft.btitle=11th%20International%20Workshop%20on%20Image%20Analysis%20for%20Multimedia%20Interactive%20Services%20WIAMIS%2010&rft.au=Sidiropoulos,%20P&rft.date=2010-04&rft.spage=1&rft.epage=4&rft.pages=1-4&rft.issn=2158-5873&rft.isbn=9781424478484&rft.isbn_list=1424478480&rft_id=info:doi/&rft_dat=%3Cieee_6IE%3E5617686%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=8890532807&rft.eisbn_list=9788890532801&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5617686&rfr_iscdi=true