A comprehensive system for image scene classification

In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human percepti...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia tools and applications 2020-07, Vol.79 (25-26), p.18033-18058
Hauptverfasser:	Sorkhi, Ali Ghanbari, Hassanpour, Hamid, Fateh, Mansoor
Format:	Artikel
Sprache:	eng
Schlagworte:	Bayesian analysis Classification Computer Communication Networks Computer Science Data mining Data Structures and Information Theory Datasets Feature extraction Image classification Image processing Labels Modelling Multimedia Information Systems Object recognition Semantics Special Purpose and Application-Based Systems
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	18058
container_issue	25-26
container_start_page	18033
container_title	Multimedia tools and applications
container_volume	79
creator	Sorkhi, Ali Ghanbari Hassanpour, Hamid Fateh, Mansoor
description	In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.
doi_str_mv	10.1007/s11042-019-08264-y
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2420901801</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2420901801</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwNOC5-hM_m6OpagVCl70HHazSd3S7tZkK-y3N3UFb55mGN578_gRcotwjwD6ISGCYBTQUCiZEnQ8IzOUmlOtGZ7nnZdAtQS8JFcpbQFQSSZmRC4K1-8P0X_4LrVfvkhjGvy-CH0s2n21yQfnO1-4XZVSG1pXDW3fXZOLUO2Sv_mdc_L-9Pi2XNH16_PLcrGmjqMZqCglGIVOYFmbujG8FkGcLsEbwZVyuRsLjRdNqUWtVMW806UJQjLdCMn5nNxNuYfYfx59Guy2P8Yuv7RMMDCAJWBWsUnlYp9S9MEeYu4eR4tgT3jshMdmPPYHjx2ziU-mlMXdxse_6H9c31BUZvk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2420901801</pqid></control><display><type>article</type><title>A comprehensive system for image scene classification</title><source>SpringerLink (Online service)</source><creator>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</creator><creatorcontrib>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</creatorcontrib><description>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-019-08264-y</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Bayesian analysis ; Classification ; Computer Communication Networks ; Computer Science ; Data mining ; Data Structures and Information Theory ; Datasets ; Feature extraction ; Image classification ; Image processing ; Labels ; Modelling ; Multimedia Information Systems ; Object recognition ; Semantics ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2020-07, Vol.79 (25-26), p.18033-18058</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</citedby><cites>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-019-08264-y$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-019-08264-y$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Sorkhi, Ali Ghanbari</creatorcontrib><creatorcontrib>Hassanpour, Hamid</creatorcontrib><creatorcontrib>Fateh, Mansoor</creatorcontrib><title>A comprehensive system for image scene classification</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</description><subject>Bayesian analysis</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data mining</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Feature extraction</subject><subject>Image classification</subject><subject>Image processing</subject><subject>Labels</subject><subject>Modelling</subject><subject>Multimedia Information Systems</subject><subject>Object recognition</subject><subject>Semantics</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwNOC5-hM_m6OpagVCl70HHazSd3S7tZkK-y3N3UFb55mGN578_gRcotwjwD6ISGCYBTQUCiZEnQ8IzOUmlOtGZ7nnZdAtQS8JFcpbQFQSSZmRC4K1-8P0X_4LrVfvkhjGvy-CH0s2n21yQfnO1-4XZVSG1pXDW3fXZOLUO2Sv_mdc_L-9Pi2XNH16_PLcrGmjqMZqCglGIVOYFmbujG8FkGcLsEbwZVyuRsLjRdNqUWtVMW806UJQjLdCMn5nNxNuYfYfx59Guy2P8Yuv7RMMDCAJWBWsUnlYp9S9MEeYu4eR4tgT3jshMdmPPYHjx2ziU-mlMXdxse_6H9c31BUZvk</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Sorkhi, Ali Ghanbari</creator><creator>Hassanpour, Hamid</creator><creator>Fateh, Mansoor</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20200701</creationdate><title>A comprehensive system for image scene classification</title><author>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Bayesian analysis</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data mining</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Feature extraction</topic><topic>Image classification</topic><topic>Image processing</topic><topic>Labels</topic><topic>Modelling</topic><topic>Multimedia Information Systems</topic><topic>Object recognition</topic><topic>Semantics</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sorkhi, Ali Ghanbari</creatorcontrib><creatorcontrib>Hassanpour, Hamid</creatorcontrib><creatorcontrib>Fateh, Mansoor</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI-INFORM Complete</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global (ProQuest)</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sorkhi, Ali Ghanbari</au><au>Hassanpour, Hamid</au><au>Fateh, Mansoor</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A comprehensive system for image scene classification</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>79</volume><issue>25-26</issue><spage>18033</spage><epage>18058</epage><pages>18033-18058</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-019-08264-y</doi><tpages>26</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1380-7501
ispartof	Multimedia tools and applications, 2020-07, Vol.79 (25-26), p.18033-18058
issn	1380-7501 1573-7721
language	eng
recordid	cdi_proquest_journals_2420901801
source	SpringerLink (Online service)
subjects	Bayesian analysis Classification Computer Communication Networks Computer Science Data mining Data Structures and Information Theory Datasets Feature extraction Image classification Image processing Labels Modelling Multimedia Information Systems Object recognition Semantics Special Purpose and Application-Based Systems
title	A comprehensive system for image scene classification
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T08%3A45%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20comprehensive%20system%20for%20image%20scene%20classification&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Sorkhi,%20Ali%20Ghanbari&rft.date=2020-07-01&rft.volume=79&rft.issue=25-26&rft.spage=18033&rft.epage=18058&rft.pages=18033-18058&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-019-08264-y&rft_dat=%3Cproquest_cross%3E2420901801%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2420901801&rft_id=info:pmid/&rfr_iscdi=true