Detection and representation of scenes in videos

This paper presents a method to perform a high-level segmentation of videos into scenes. A scene can be defined as a subdivision of a play in which either the setting is fixed, or when it presents continuous action in one place. We exploit this fact and propose a novel approach for clustering shots...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on multimedia 2005-12, Vol.7 (6), p.1097-1105
Hauptverfasser:	Rasheed, Z., Shah, M.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Applied sciences Artificial intelligence Cable TV Cameras Clustering Computer science control theory systems Digital libraries Exact sciences and technology Graph partitioning Graphs Image segmentation Internet key-frames Layout Motion pictures normalized cuts Partitioning Partitioning algorithms Pattern recognition. Digital image processing. Computational geometry scene Shot Similarity Software libraries video segmentation Videos
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1105
container_issue	6
container_start_page	1097
container_title	IEEE transactions on multimedia
container_volume	7
creator	Rasheed, Z. Shah, M.
description	This paper presents a method to perform a high-level segmentation of videos into scenes. A scene can be defined as a subdivision of a play in which either the setting is fixed, or when it presents continuous action in one place. We exploit this fact and propose a novel approach for clustering shots into scenes by transforming this task into a graph partitioning problem. This is achieved by constructing a weighted undirected graph called a shot similarity graph (SSG), where each node represents a shot and the edges between the shots are weighted by their similarity based on color and motion information. The SSG is then split into subgraphs by applying the normalized cuts for graph partitioning. The partitions so obtained represent individual scenes in the video. When clustering the shots, we consider the global similarities of shots rather than the individual shot pairs. We also propose a method to describe the content of each scene by selecting one representative image from the video as a scene key-frame. Recently, DVDs have become available with a chapter selection option where each chapter is represented by one image. Our algorithm automates this objective which is useful for applications such as video-on-demand, digital libraries, and the Internet. Experiments are presented with promising results on several Hollywood movies and one sitcom.
doi_str_mv	10.1109/TMM.2005.858392
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_pascalfrancis_primary_17295724</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1542086</ieee_id><sourcerecordid>896202626</sourcerecordid><originalsourceid>FETCH-LOGICAL-c381t-e54f20b366337e1baefcf621edfbe51fe40a9a4f032d14836505de006eee366f3</originalsourceid><addsrcrecordid>eNp9kEtLAzEURoMoWKtrF24GQV1Ne_OcyVLqE1rc1HVIZ25gyjRTk6ngvzd1hIILVwnJ-T7uPYRcUphQCnq6XCwmDEBOSllyzY7IiGpBc4CiOE53ySDXjMIpOYtxDUCFhGJE4AF7rPqm85n1dRZwGzCi7-3PU-eyWKHHmDU--2xq7OI5OXG2jXjxe47J-9PjcvaSz9-eX2f387ziJe1zlMIxWHGlOC-Qriy6yilGsXYrlNShAKutcMBZTUXJlQRZI4BCxBRyfEzuht5t6D52GHuzadIsbWs9drtoSq0YMMVUIm__JVmZrIBiCbz-A667XfBpC1MqDVoWXCRoOkBV6GIM6Mw2NBsbvgwFsxdtkmizF20G0Slx81trY2VbF6yvmniIFSw1s33z1cA1acfDtxQMSsW_AUwwhJc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>869095734</pqid></control><display><type>article</type><title>Detection and representation of scenes in videos</title><source>IEEE Electronic Library (IEL)</source><creator>Rasheed, Z. ; Shah, M.</creator><creatorcontrib>Rasheed, Z. ; Shah, M.</creatorcontrib><description>This paper presents a method to perform a high-level segmentation of videos into scenes. A scene can be defined as a subdivision of a play in which either the setting is fixed, or when it presents continuous action in one place. We exploit this fact and propose a novel approach for clustering shots into scenes by transforming this task into a graph partitioning problem. This is achieved by constructing a weighted undirected graph called a shot similarity graph (SSG), where each node represents a shot and the edges between the shots are weighted by their similarity based on color and motion information. The SSG is then split into subgraphs by applying the normalized cuts for graph partitioning. The partitions so obtained represent individual scenes in the video. When clustering the shots, we consider the global similarities of shots rather than the individual shot pairs. We also propose a method to describe the content of each scene by selecting one representative image from the video as a scene key-frame. Recently, DVDs have become available with a chapter selection option where each chapter is represented by one image. Our algorithm automates this objective which is useful for applications such as video-on-demand, digital libraries, and the Internet. Experiments are presented with promising results on several Hollywood movies and one sitcom.</description><identifier>ISSN: 1520-9210</identifier><identifier>EISSN: 1941-0077</identifier><identifier>DOI: 10.1109/TMM.2005.858392</identifier><identifier>CODEN: ITMUF8</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Algorithms ; Applied sciences ; Artificial intelligence ; Cable TV ; Cameras ; Clustering ; Computer science; control theory; systems ; Digital libraries ; Exact sciences and technology ; Graph partitioning ; Graphs ; Image segmentation ; Internet ; key-frames ; Layout ; Motion pictures ; normalized cuts ; Partitioning ; Partitioning algorithms ; Pattern recognition. Digital image processing. Computational geometry ; scene ; Shot ; Similarity ; Software libraries ; video segmentation ; Videos</subject><ispartof>IEEE transactions on multimedia, 2005-12, Vol.7 (6), p.1097-1105</ispartof><rights>2006 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2005</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c381t-e54f20b366337e1baefcf621edfbe51fe40a9a4f032d14836505de006eee366f3</citedby><cites>FETCH-LOGICAL-c381t-e54f20b366337e1baefcf621edfbe51fe40a9a4f032d14836505de006eee366f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1542086$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1542086$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17295724$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Rasheed, Z.</creatorcontrib><creatorcontrib>Shah, M.</creatorcontrib><title>Detection and representation of scenes in videos</title><title>IEEE transactions on multimedia</title><addtitle>TMM</addtitle><description>This paper presents a method to perform a high-level segmentation of videos into scenes. A scene can be defined as a subdivision of a play in which either the setting is fixed, or when it presents continuous action in one place. We exploit this fact and propose a novel approach for clustering shots into scenes by transforming this task into a graph partitioning problem. This is achieved by constructing a weighted undirected graph called a shot similarity graph (SSG), where each node represents a shot and the edges between the shots are weighted by their similarity based on color and motion information. The SSG is then split into subgraphs by applying the normalized cuts for graph partitioning. The partitions so obtained represent individual scenes in the video. When clustering the shots, we consider the global similarities of shots rather than the individual shot pairs. We also propose a method to describe the content of each scene by selecting one representative image from the video as a scene key-frame. Recently, DVDs have become available with a chapter selection option where each chapter is represented by one image. Our algorithm automates this objective which is useful for applications such as video-on-demand, digital libraries, and the Internet. Experiments are presented with promising results on several Hollywood movies and one sitcom.</description><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Cable TV</subject><subject>Cameras</subject><subject>Clustering</subject><subject>Computer science; control theory; systems</subject><subject>Digital libraries</subject><subject>Exact sciences and technology</subject><subject>Graph partitioning</subject><subject>Graphs</subject><subject>Image segmentation</subject><subject>Internet</subject><subject>key-frames</subject><subject>Layout</subject><subject>Motion pictures</subject><subject>normalized cuts</subject><subject>Partitioning</subject><subject>Partitioning algorithms</subject><subject>Pattern recognition. Digital image processing. Computational geometry</subject><subject>scene</subject><subject>Shot</subject><subject>Similarity</subject><subject>Software libraries</subject><subject>video segmentation</subject><subject>Videos</subject><issn>1520-9210</issn><issn>1941-0077</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNp9kEtLAzEURoMoWKtrF24GQV1Ne_OcyVLqE1rc1HVIZ25gyjRTk6ngvzd1hIILVwnJ-T7uPYRcUphQCnq6XCwmDEBOSllyzY7IiGpBc4CiOE53ySDXjMIpOYtxDUCFhGJE4AF7rPqm85n1dRZwGzCi7-3PU-eyWKHHmDU--2xq7OI5OXG2jXjxe47J-9PjcvaSz9-eX2f387ziJe1zlMIxWHGlOC-Qriy6yilGsXYrlNShAKutcMBZTUXJlQRZI4BCxBRyfEzuht5t6D52GHuzadIsbWs9drtoSq0YMMVUIm__JVmZrIBiCbz-A667XfBpC1MqDVoWXCRoOkBV6GIM6Mw2NBsbvgwFsxdtkmizF20G0Slx81trY2VbF6yvmniIFSw1s33z1cA1acfDtxQMSsW_AUwwhJc</recordid><startdate>20051201</startdate><enddate>20051201</enddate><creator>Rasheed, Z.</creator><creator>Shah, M.</creator><general>IEEE</general><general>Institute of Electrical and Electronic Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20051201</creationdate><title>Detection and representation of scenes in videos</title><author>Rasheed, Z. ; Shah, M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c381t-e54f20b366337e1baefcf621edfbe51fe40a9a4f032d14836505de006eee366f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Cable TV</topic><topic>Cameras</topic><topic>Clustering</topic><topic>Computer science; control theory; systems</topic><topic>Digital libraries</topic><topic>Exact sciences and technology</topic><topic>Graph partitioning</topic><topic>Graphs</topic><topic>Image segmentation</topic><topic>Internet</topic><topic>key-frames</topic><topic>Layout</topic><topic>Motion pictures</topic><topic>normalized cuts</topic><topic>Partitioning</topic><topic>Partitioning algorithms</topic><topic>Pattern recognition. Digital image processing. Computational geometry</topic><topic>scene</topic><topic>Shot</topic><topic>Similarity</topic><topic>Software libraries</topic><topic>video segmentation</topic><topic>Videos</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rasheed, Z.</creatorcontrib><creatorcontrib>Shah, M.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE transactions on multimedia</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rasheed, Z.</au><au>Shah, M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Detection and representation of scenes in videos</atitle><jtitle>IEEE transactions on multimedia</jtitle><stitle>TMM</stitle><date>2005-12-01</date><risdate>2005</risdate><volume>7</volume><issue>6</issue><spage>1097</spage><epage>1105</epage><pages>1097-1105</pages><issn>1520-9210</issn><eissn>1941-0077</eissn><coden>ITMUF8</coden><abstract>This paper presents a method to perform a high-level segmentation of videos into scenes. A scene can be defined as a subdivision of a play in which either the setting is fixed, or when it presents continuous action in one place. We exploit this fact and propose a novel approach for clustering shots into scenes by transforming this task into a graph partitioning problem. This is achieved by constructing a weighted undirected graph called a shot similarity graph (SSG), where each node represents a shot and the edges between the shots are weighted by their similarity based on color and motion information. The SSG is then split into subgraphs by applying the normalized cuts for graph partitioning. The partitions so obtained represent individual scenes in the video. When clustering the shots, we consider the global similarities of shots rather than the individual shot pairs. We also propose a method to describe the content of each scene by selecting one representative image from the video as a scene key-frame. Recently, DVDs have become available with a chapter selection option where each chapter is represented by one image. Our algorithm automates this objective which is useful for applications such as video-on-demand, digital libraries, and the Internet. Experiments are presented with promising results on several Hollywood movies and one sitcom.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TMM.2005.858392</doi><tpages>9</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-9210
ispartof	IEEE transactions on multimedia, 2005-12, Vol.7 (6), p.1097-1105
issn	1520-9210 1941-0077
language	eng
recordid	cdi_pascalfrancis_primary_17295724
source	IEEE Electronic Library (IEL)
subjects	Algorithms Applied sciences Artificial intelligence Cable TV Cameras Clustering Computer science control theory systems Digital libraries Exact sciences and technology Graph partitioning Graphs Image segmentation Internet key-frames Layout Motion pictures normalized cuts Partitioning Partitioning algorithms Pattern recognition. Digital image processing. Computational geometry scene Shot Similarity Software libraries video segmentation Videos
title	Detection and representation of scenes in videos
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T07%3A50%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Detection%20and%20representation%20of%20scenes%20in%20videos&rft.jtitle=IEEE%20transactions%20on%20multimedia&rft.au=Rasheed,%20Z.&rft.date=2005-12-01&rft.volume=7&rft.issue=6&rft.spage=1097&rft.epage=1105&rft.pages=1097-1105&rft.issn=1520-9210&rft.eissn=1941-0077&rft.coden=ITMUF8&rft_id=info:doi/10.1109/TMM.2005.858392&rft_dat=%3Cproquest_RIE%3E896202626%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=869095734&rft_id=info:pmid/&rft_ieee_id=1542086&rfr_iscdi=true