A comprehensive system for image scene classification

In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human percepti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2020-07, Vol.79 (25-26), p.18033-18058
Hauptverfasser: Sorkhi, Ali Ghanbari, Hassanpour, Hamid, Fateh, Mansoor
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 18058
container_issue 25-26
container_start_page 18033
container_title Multimedia tools and applications
container_volume 79
creator Sorkhi, Ali Ghanbari
Hassanpour, Hamid
Fateh, Mansoor
description In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.
doi_str_mv 10.1007/s11042-019-08264-y
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2420901801</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2420901801</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwNOC5-hM_m6OpagVCl70HHazSd3S7tZkK-y3N3UFb55mGN578_gRcotwjwD6ISGCYBTQUCiZEnQ8IzOUmlOtGZ7nnZdAtQS8JFcpbQFQSSZmRC4K1-8P0X_4LrVfvkhjGvy-CH0s2n21yQfnO1-4XZVSG1pXDW3fXZOLUO2Sv_mdc_L-9Pi2XNH16_PLcrGmjqMZqCglGIVOYFmbujG8FkGcLsEbwZVyuRsLjRdNqUWtVMW806UJQjLdCMn5nNxNuYfYfx59Guy2P8Yuv7RMMDCAJWBWsUnlYp9S9MEeYu4eR4tgT3jshMdmPPYHjx2ziU-mlMXdxse_6H9c31BUZvk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2420901801</pqid></control><display><type>article</type><title>A comprehensive system for image scene classification</title><source>SpringerLink (Online service)</source><creator>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</creator><creatorcontrib>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</creatorcontrib><description>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-019-08264-y</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Bayesian analysis ; Classification ; Computer Communication Networks ; Computer Science ; Data mining ; Data Structures and Information Theory ; Datasets ; Feature extraction ; Image classification ; Image processing ; Labels ; Modelling ; Multimedia Information Systems ; Object recognition ; Semantics ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2020-07, Vol.79 (25-26), p.18033-18058</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</citedby><cites>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-019-08264-y$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-019-08264-y$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Sorkhi, Ali Ghanbari</creatorcontrib><creatorcontrib>Hassanpour, Hamid</creatorcontrib><creatorcontrib>Fateh, Mansoor</creatorcontrib><title>A comprehensive system for image scene classification</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</description><subject>Bayesian analysis</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data mining</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Feature extraction</subject><subject>Image classification</subject><subject>Image processing</subject><subject>Labels</subject><subject>Modelling</subject><subject>Multimedia Information Systems</subject><subject>Object recognition</subject><subject>Semantics</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwNOC5-hM_m6OpagVCl70HHazSd3S7tZkK-y3N3UFb55mGN578_gRcotwjwD6ISGCYBTQUCiZEnQ8IzOUmlOtGZ7nnZdAtQS8JFcpbQFQSSZmRC4K1-8P0X_4LrVfvkhjGvy-CH0s2n21yQfnO1-4XZVSG1pXDW3fXZOLUO2Sv_mdc_L-9Pi2XNH16_PLcrGmjqMZqCglGIVOYFmbujG8FkGcLsEbwZVyuRsLjRdNqUWtVMW806UJQjLdCMn5nNxNuYfYfx59Guy2P8Yuv7RMMDCAJWBWsUnlYp9S9MEeYu4eR4tgT3jshMdmPPYHjx2ziU-mlMXdxse_6H9c31BUZvk</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Sorkhi, Ali Ghanbari</creator><creator>Hassanpour, Hamid</creator><creator>Fateh, Mansoor</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20200701</creationdate><title>A comprehensive system for image scene classification</title><author>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Bayesian analysis</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data mining</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Feature extraction</topic><topic>Image classification</topic><topic>Image processing</topic><topic>Labels</topic><topic>Modelling</topic><topic>Multimedia Information Systems</topic><topic>Object recognition</topic><topic>Semantics</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sorkhi, Ali Ghanbari</creatorcontrib><creatorcontrib>Hassanpour, Hamid</creatorcontrib><creatorcontrib>Fateh, Mansoor</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI-INFORM Complete</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global (ProQuest)</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sorkhi, Ali Ghanbari</au><au>Hassanpour, Hamid</au><au>Fateh, Mansoor</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A comprehensive system for image scene classification</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>79</volume><issue>25-26</issue><spage>18033</spage><epage>18058</epage><pages>18033-18058</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-019-08264-y</doi><tpages>26</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1380-7501
ispartof Multimedia tools and applications, 2020-07, Vol.79 (25-26), p.18033-18058
issn 1380-7501
1573-7721
language eng
recordid cdi_proquest_journals_2420901801
source SpringerLink (Online service)
subjects Bayesian analysis
Classification
Computer Communication Networks
Computer Science
Data mining
Data Structures and Information Theory
Datasets
Feature extraction
Image classification
Image processing
Labels
Modelling
Multimedia Information Systems
Object recognition
Semantics
Special Purpose and Application-Based Systems
title A comprehensive system for image scene classification
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T08%3A45%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20comprehensive%20system%20for%20image%20scene%20classification&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Sorkhi,%20Ali%20Ghanbari&rft.date=2020-07-01&rft.volume=79&rft.issue=25-26&rft.spage=18033&rft.epage=18058&rft.pages=18033-18058&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-019-08264-y&rft_dat=%3Cproquest_cross%3E2420901801%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2420901801&rft_id=info:pmid/&rfr_iscdi=true