Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation

Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed f...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Information technology and control 2021-12, Vol.50 (4), p.686-705
Hauptverfasser:	Maheswari, B. Uma, Sonia, R., Rajakumar, M. P., Ramya, J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Automation & Control Systems Computer Science Computer Science, Artificial Intelligence Computer Science, Information Systems Science & Technology Technology
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	705
container_issue	4
container_start_page	686
container_title	Information technology and control
container_volume	50
creator	Maheswari, B. Uma Sonia, R. Rajakumar, M. P. Ramya, J.
description	Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.
doi_str_mv	10.5755/j01.itc.50.4.27845
format	Article
fullrecord	<record><control><sourceid>webofscience_cross</sourceid><recordid>TN_cdi_webofscience_primary_000766012000006</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>000766012000006</sourcerecordid><originalsourceid>FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</originalsourceid><addsrcrecordid>eNqNkE9LAzEQxYMoWLRfwFPusmv-Z3uURVuhWlALvYVsNlsjbVKSrcVvb7YVz85l5g3vDcwPgBuMSi45v_tEuHS9KTkqWUlkxfgZGBFKeVFVbHUORphOSIEJW12CcUqfCCHCEacMj8DhJXzZDXzW5sN5C-dWR-_8GnYhwtl-qz28N70LPsF6o1NynTN60HCZBtvMpT6so97C0MFFdNb3toXTqNthTFD7Fr7tdEwWvtpdtClvj_lrcNHpTbLj334Flo8P7_WsmC-mT_X9vDCEyb7QljFCqqYRLbWtbiglhIuGMcQk4ZIRQVrZNEzwStLJpLGCoaxshYXuKGL0CpDTXRNDStF2ahfdVsdvhZEa6KlMT2V6iiPF1JFeDt2eQgfbhC6Z_Iuxf8GMTwqBMEFDieyu_u-u3en_Oux9T38AOdSFtQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /></source><creator>Maheswari, B. Uma ; Sonia, R. ; Rajakumar, M. P. ; Ramya, J.</creator><creatorcontrib>Maheswari, B. Uma ; Sonia, R. ; Rajakumar, M. P. ; Ramya, J.</creatorcontrib><description>Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.</description><identifier>ISSN: 1392-124X</identifier><identifier>EISSN: 2335-884X</identifier><identifier>DOI: 10.5755/j01.itc.50.4.27845</identifier><language>eng</language><publisher>KAUNAS: Kaunas Univ Technology</publisher><subject>Automation & Control Systems ; Computer Science ; Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Science & Technology ; Technology</subject><ispartof>Information technology and control, 2021-12, Vol.50 (4), p.686-705</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>1</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000766012000006</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</citedby><cites>FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</cites><orcidid>0000-0003-2821-7677</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,865,2115,27929,27930,39263</link.rule.ids></links><search><creatorcontrib>Maheswari, B. Uma</creatorcontrib><creatorcontrib>Sonia, R.</creatorcontrib><creatorcontrib>Rajakumar, M. P.</creatorcontrib><creatorcontrib>Ramya, J.</creatorcontrib><title>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</title><title>Information technology and control</title><addtitle>INF TECHNOL CONTROL</addtitle><description>Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.</description><subject>Automation & Control Systems</subject><subject>Computer Science</subject><subject>Computer Science, Artificial Intelligence</subject><subject>Computer Science, Information Systems</subject><subject>Science & Technology</subject><subject>Technology</subject><issn>1392-124X</issn><issn>2335-884X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>HGBXW</sourceid><recordid>eNqNkE9LAzEQxYMoWLRfwFPusmv-Z3uURVuhWlALvYVsNlsjbVKSrcVvb7YVz85l5g3vDcwPgBuMSi45v_tEuHS9KTkqWUlkxfgZGBFKeVFVbHUORphOSIEJW12CcUqfCCHCEacMj8DhJXzZDXzW5sN5C-dWR-_8GnYhwtl-qz28N70LPsF6o1NynTN60HCZBtvMpT6so97C0MFFdNb3toXTqNthTFD7Fr7tdEwWvtpdtClvj_lrcNHpTbLj334Flo8P7_WsmC-mT_X9vDCEyb7QljFCqqYRLbWtbiglhIuGMcQk4ZIRQVrZNEzwStLJpLGCoaxshYXuKGL0CpDTXRNDStF2ahfdVsdvhZEa6KlMT2V6iiPF1JFeDt2eQgfbhC6Z_Iuxf8GMTwqBMEFDieyu_u-u3en_Oux9T38AOdSFtQ</recordid><startdate>20211216</startdate><enddate>20211216</enddate><creator>Maheswari, B. Uma</creator><creator>Sonia, R.</creator><creator>Rajakumar, M. P.</creator><creator>Ramya, J.</creator><general>Kaunas Univ Technology</general><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-2821-7677</orcidid></search><sort><creationdate>20211216</creationdate><title>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</title><author>Maheswari, B. Uma ; Sonia, R. ; Rajakumar, M. P. ; Ramya, J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Automation & Control Systems</topic><topic>Computer Science</topic><topic>Computer Science, Artificial Intelligence</topic><topic>Computer Science, Information Systems</topic><topic>Science & Technology</topic><topic>Technology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Maheswari, B. Uma</creatorcontrib><creatorcontrib>Sonia, R.</creatorcontrib><creatorcontrib>Rajakumar, M. P.</creatorcontrib><creatorcontrib>Ramya, J.</creatorcontrib><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>CrossRef</collection><jtitle>Information technology and control</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Maheswari, B. Uma</au><au>Sonia, R.</au><au>Rajakumar, M. P.</au><au>Ramya, J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</atitle><jtitle>Information technology and control</jtitle><stitle>INF TECHNOL CONTROL</stitle><date>2021-12-16</date><risdate>2021</risdate><volume>50</volume><issue>4</issue><spage>686</spage><epage>705</epage><pages>686-705</pages><issn>1392-124X</issn><eissn>2335-884X</eissn><abstract>Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.</abstract><cop>KAUNAS</cop><pub>Kaunas Univ Technology</pub><doi>10.5755/j01.itc.50.4.27845</doi><tpages>20</tpages><orcidid>https://orcid.org/0000-0003-2821-7677</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1392-124X
ispartof	Information technology and control, 2021-12, Vol.50 (4), p.686-705
issn	1392-124X 2335-884X
language	eng
recordid	cdi_webofscience_primary_000766012000006
source	DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />
subjects	Automation & Control Systems Computer Science Computer Science, Artificial Intelligence Computer Science, Information Systems Science & Technology Technology
title	Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T02%3A41%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-webofscience_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Novel%20Machine%20Learning%20for%20Human%20Actions%20Classification%20Using%20Histogram%20of%20Oriented%20Gradients%20and%20Sparse%20Representation&rft.jtitle=Information%20technology%20and%20control&rft.au=Maheswari,%20B.%20Uma&rft.date=2021-12-16&rft.volume=50&rft.issue=4&rft.spage=686&rft.epage=705&rft.pages=686-705&rft.issn=1392-124X&rft.eissn=2335-884X&rft_id=info:doi/10.5755/j01.itc.50.4.27845&rft_dat=%3Cwebofscience_cross%3E000766012000006%3C/webofscience_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true