A comprehensive system for image scene classification
In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human percepti...
Gespeichert in:
Veröffentlicht in: | Multimedia tools and applications 2020-07, Vol.79 (25-26), p.18033-18058 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 18058 |
---|---|
container_issue | 25-26 |
container_start_page | 18033 |
container_title | Multimedia tools and applications |
container_volume | 79 |
creator | Sorkhi, Ali Ghanbari Hassanpour, Hamid Fateh, Mansoor |
description | In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets. |
doi_str_mv | 10.1007/s11042-019-08264-y |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2420901801</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2420901801</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwNOC5-hM_m6OpagVCl70HHazSd3S7tZkK-y3N3UFb55mGN578_gRcotwjwD6ISGCYBTQUCiZEnQ8IzOUmlOtGZ7nnZdAtQS8JFcpbQFQSSZmRC4K1-8P0X_4LrVfvkhjGvy-CH0s2n21yQfnO1-4XZVSG1pXDW3fXZOLUO2Sv_mdc_L-9Pi2XNH16_PLcrGmjqMZqCglGIVOYFmbujG8FkGcLsEbwZVyuRsLjRdNqUWtVMW806UJQjLdCMn5nNxNuYfYfx59Guy2P8Yuv7RMMDCAJWBWsUnlYp9S9MEeYu4eR4tgT3jshMdmPPYHjx2ziU-mlMXdxse_6H9c31BUZvk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2420901801</pqid></control><display><type>article</type><title>A comprehensive system for image scene classification</title><source>SpringerLink (Online service)</source><creator>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</creator><creatorcontrib>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</creatorcontrib><description>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-019-08264-y</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Bayesian analysis ; Classification ; Computer Communication Networks ; Computer Science ; Data mining ; Data Structures and Information Theory ; Datasets ; Feature extraction ; Image classification ; Image processing ; Labels ; Modelling ; Multimedia Information Systems ; Object recognition ; Semantics ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2020-07, Vol.79 (25-26), p.18033-18058</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</citedby><cites>FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-019-08264-y$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-019-08264-y$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Sorkhi, Ali Ghanbari</creatorcontrib><creatorcontrib>Hassanpour, Hamid</creatorcontrib><creatorcontrib>Fateh, Mansoor</creatorcontrib><title>A comprehensive system for image scene classification</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</description><subject>Bayesian analysis</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data mining</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Feature extraction</subject><subject>Image classification</subject><subject>Image processing</subject><subject>Labels</subject><subject>Modelling</subject><subject>Multimedia Information Systems</subject><subject>Object recognition</subject><subject>Semantics</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwNOC5-hM_m6OpagVCl70HHazSd3S7tZkK-y3N3UFb55mGN578_gRcotwjwD6ISGCYBTQUCiZEnQ8IzOUmlOtGZ7nnZdAtQS8JFcpbQFQSSZmRC4K1-8P0X_4LrVfvkhjGvy-CH0s2n21yQfnO1-4XZVSG1pXDW3fXZOLUO2Sv_mdc_L-9Pi2XNH16_PLcrGmjqMZqCglGIVOYFmbujG8FkGcLsEbwZVyuRsLjRdNqUWtVMW806UJQjLdCMn5nNxNuYfYfx59Guy2P8Yuv7RMMDCAJWBWsUnlYp9S9MEeYu4eR4tgT3jshMdmPPYHjx2ziU-mlMXdxse_6H9c31BUZvk</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Sorkhi, Ali Ghanbari</creator><creator>Hassanpour, Hamid</creator><creator>Fateh, Mansoor</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20200701</creationdate><title>A comprehensive system for image scene classification</title><author>Sorkhi, Ali Ghanbari ; Hassanpour, Hamid ; Fateh, Mansoor</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-4850961c418b9bd93b4f40961fe94366c7212fde4d874b66a2ec789f4527d4533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Bayesian analysis</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data mining</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Feature extraction</topic><topic>Image classification</topic><topic>Image processing</topic><topic>Labels</topic><topic>Modelling</topic><topic>Multimedia Information Systems</topic><topic>Object recognition</topic><topic>Semantics</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sorkhi, Ali Ghanbari</creatorcontrib><creatorcontrib>Hassanpour, Hamid</creatorcontrib><creatorcontrib>Fateh, Mansoor</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI-INFORM Complete</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global (ProQuest)</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sorkhi, Ali Ghanbari</au><au>Hassanpour, Hamid</au><au>Fateh, Mansoor</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A comprehensive system for image scene classification</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>79</volume><issue>25-26</issue><spage>18033</spage><epage>18058</epage><pages>18033-18058</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>In recent years, image scene classification based on low/high-level features has been considered as one of the most important and challenging problems faced in image processing research. The high-level features based on semantic concepts present a more accurate and closer model to the human perception of the image scene content. This paper presents a new multi-stage approach for image scene classification based on high-level semantic features extracted from image content. In the first stage, the object boundaries and their labels that represent the content are extracted. For this purpose, a combined method of a fully convolutional deep network and a combined network of a two-class SVM-fuzzy and SVR are used. Topic modeling is used to represent the latent relationships between the objects. Hence in the second stage, a new combination of methods consisting of the bag of visual words, and supervised document neural autoregressive distribution estimator is used to extract the latent topics (topic modeling) in the image. Finally, classification based on Bayesian method is performed according to the extracted features of the deep network, objects labels and the latent topics in the image. The proposed method has been evaluated on three datasets: Scene15, UIUC Sports, and MIT-67 Indoor. The experimental results show that the proposed approach achieves average performance improvement of 12%, 11% and 14% in the accuracy of object detection, and 0.5%, 0.6% and 1.8% in the mean average precision criteria of the image scene classification, compared to the previous state-of-the-art methods on these three datasets.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-019-08264-y</doi><tpages>26</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1380-7501 |
ispartof | Multimedia tools and applications, 2020-07, Vol.79 (25-26), p.18033-18058 |
issn | 1380-7501 1573-7721 |
language | eng |
recordid | cdi_proquest_journals_2420901801 |
source | SpringerLink (Online service) |
subjects | Bayesian analysis Classification Computer Communication Networks Computer Science Data mining Data Structures and Information Theory Datasets Feature extraction Image classification Image processing Labels Modelling Multimedia Information Systems Object recognition Semantics Special Purpose and Application-Based Systems |
title | A comprehensive system for image scene classification |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T08%3A45%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20comprehensive%20system%20for%20image%20scene%20classification&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Sorkhi,%20Ali%20Ghanbari&rft.date=2020-07-01&rft.volume=79&rft.issue=25-26&rft.spage=18033&rft.epage=18058&rft.pages=18033-18058&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-019-08264-y&rft_dat=%3Cproquest_cross%3E2420901801%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2420901801&rft_id=info:pmid/&rfr_iscdi=true |