Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation

Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information technology and control 2021-12, Vol.50 (4), p.686-705
Hauptverfasser: Maheswari, B. Uma, Sonia, R., Rajakumar, M. P., Ramya, J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 705
container_issue 4
container_start_page 686
container_title Information technology and control
container_volume 50
creator Maheswari, B. Uma
Sonia, R.
Rajakumar, M. P.
Ramya, J.
description Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.
doi_str_mv 10.5755/j01.itc.50.4.27845
format Article
fullrecord <record><control><sourceid>webofscience_cross</sourceid><recordid>TN_cdi_webofscience_primary_000766012000006</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>000766012000006</sourcerecordid><originalsourceid>FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</originalsourceid><addsrcrecordid>eNqNkE9LAzEQxYMoWLRfwFPusmv-Z3uURVuhWlALvYVsNlsjbVKSrcVvb7YVz85l5g3vDcwPgBuMSi45v_tEuHS9KTkqWUlkxfgZGBFKeVFVbHUORphOSIEJW12CcUqfCCHCEacMj8DhJXzZDXzW5sN5C-dWR-_8GnYhwtl-qz28N70LPsF6o1NynTN60HCZBtvMpT6so97C0MFFdNb3toXTqNthTFD7Fr7tdEwWvtpdtClvj_lrcNHpTbLj334Flo8P7_WsmC-mT_X9vDCEyb7QljFCqqYRLbWtbiglhIuGMcQk4ZIRQVrZNEzwStLJpLGCoaxshYXuKGL0CpDTXRNDStF2ahfdVsdvhZEa6KlMT2V6iiPF1JFeDt2eQgfbhC6Z_Iuxf8GMTwqBMEFDieyu_u-u3en_Oux9T38AOdSFtQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Web of Science - Science Citation Index Expanded - 2021&lt;img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /&gt;</source><creator>Maheswari, B. Uma ; Sonia, R. ; Rajakumar, M. P. ; Ramya, J.</creator><creatorcontrib>Maheswari, B. Uma ; Sonia, R. ; Rajakumar, M. P. ; Ramya, J.</creatorcontrib><description>Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.</description><identifier>ISSN: 1392-124X</identifier><identifier>EISSN: 2335-884X</identifier><identifier>DOI: 10.5755/j01.itc.50.4.27845</identifier><language>eng</language><publisher>KAUNAS: Kaunas Univ Technology</publisher><subject>Automation &amp; Control Systems ; Computer Science ; Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Science &amp; Technology ; Technology</subject><ispartof>Information technology and control, 2021-12, Vol.50 (4), p.686-705</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>1</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000766012000006</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</citedby><cites>FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</cites><orcidid>0000-0003-2821-7677</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,865,2115,27929,27930,39263</link.rule.ids></links><search><creatorcontrib>Maheswari, B. Uma</creatorcontrib><creatorcontrib>Sonia, R.</creatorcontrib><creatorcontrib>Rajakumar, M. P.</creatorcontrib><creatorcontrib>Ramya, J.</creatorcontrib><title>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</title><title>Information technology and control</title><addtitle>INF TECHNOL CONTROL</addtitle><description>Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.</description><subject>Automation &amp; Control Systems</subject><subject>Computer Science</subject><subject>Computer Science, Artificial Intelligence</subject><subject>Computer Science, Information Systems</subject><subject>Science &amp; Technology</subject><subject>Technology</subject><issn>1392-124X</issn><issn>2335-884X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>HGBXW</sourceid><recordid>eNqNkE9LAzEQxYMoWLRfwFPusmv-Z3uURVuhWlALvYVsNlsjbVKSrcVvb7YVz85l5g3vDcwPgBuMSi45v_tEuHS9KTkqWUlkxfgZGBFKeVFVbHUORphOSIEJW12CcUqfCCHCEacMj8DhJXzZDXzW5sN5C-dWR-_8GnYhwtl-qz28N70LPsF6o1NynTN60HCZBtvMpT6so97C0MFFdNb3toXTqNthTFD7Fr7tdEwWvtpdtClvj_lrcNHpTbLj334Flo8P7_WsmC-mT_X9vDCEyb7QljFCqqYRLbWtbiglhIuGMcQk4ZIRQVrZNEzwStLJpLGCoaxshYXuKGL0CpDTXRNDStF2ahfdVsdvhZEa6KlMT2V6iiPF1JFeDt2eQgfbhC6Z_Iuxf8GMTwqBMEFDieyu_u-u3en_Oux9T38AOdSFtQ</recordid><startdate>20211216</startdate><enddate>20211216</enddate><creator>Maheswari, B. Uma</creator><creator>Sonia, R.</creator><creator>Rajakumar, M. P.</creator><creator>Ramya, J.</creator><general>Kaunas Univ Technology</general><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-2821-7677</orcidid></search><sort><creationdate>20211216</creationdate><title>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</title><author>Maheswari, B. Uma ; Sonia, R. ; Rajakumar, M. P. ; Ramya, J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c247t-ae44228bb6d3edab332256b440472574262d7bb46587399be640b46e816af3043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Automation &amp; Control Systems</topic><topic>Computer Science</topic><topic>Computer Science, Artificial Intelligence</topic><topic>Computer Science, Information Systems</topic><topic>Science &amp; Technology</topic><topic>Technology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Maheswari, B. Uma</creatorcontrib><creatorcontrib>Sonia, R.</creatorcontrib><creatorcontrib>Rajakumar, M. P.</creatorcontrib><creatorcontrib>Ramya, J.</creatorcontrib><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>CrossRef</collection><jtitle>Information technology and control</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Maheswari, B. Uma</au><au>Sonia, R.</au><au>Rajakumar, M. P.</au><au>Ramya, J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation</atitle><jtitle>Information technology and control</jtitle><stitle>INF TECHNOL CONTROL</stitle><date>2021-12-16</date><risdate>2021</risdate><volume>50</volume><issue>4</issue><spage>686</spage><epage>705</epage><pages>686-705</pages><issn>1392-124X</issn><eissn>2335-884X</eissn><abstract>Recognition of human actions is a trending research topic as it can be used for crucial medical applications like life care and healthcare. In this research, we propose a novel machine learning algorithm for the classification of human actions based on sparse representation theory. In the proposed framework, the input videos are initially partitioned into several temporal segments of a predefined length. From these temporal segments, the key-cuboids are then obtained. These cuboids are obtained based on the locations having maximum variation in orientation. From these regions, key-cuboids are extracted. From the key-cuboids, Histogram of Oriented Gradient (HOG) features are extracted. This new descriptor has the capability to express the dynamic features in the action videos. Using these features, a single shared dictionary is created from the videos belonging to different classes using K-Singular Value Decomposition (K-SVD) algorithm. This dictionary has the combined features of all the action classes. This shared dictionary is generated during the training phase. During the testing phase, the features belonging to a test class is classified using a novel Sparse Representation Modeling based Action Recognition (SRMAR) Algorithm using Orthogonal Matching Pursuit (OMP) and the shared dictionary. The proposed framework was evaluated using popular benchmark action recognition datasets like KTH dataset, Olympic dataset and the Hollywood dataset. The results obtained using these datasets were represented in the form of a confusion matrix. Evaluation was performed using metrics like overall classification accuracy, specificity, precision, recall and F-score that were obtained from the confusion matrix. This system achieved a high specificity of about 99.52%, 99.16% and 96.15% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Similarly, the proposed framework attained very good precision of 97.64%, 90.46% and 73.39% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. Also, the average value of recall achieved was 97.58%, 90.86% and 74.09% for the KTH dataset, Olympic dataset and the Hollywood datasets, respectively. It was also observed that the proposed machine learning algorithm achieved outstanding results compared to the existing state-of-the-art human action recognition frameworks in the literature.</abstract><cop>KAUNAS</cop><pub>Kaunas Univ Technology</pub><doi>10.5755/j01.itc.50.4.27845</doi><tpages>20</tpages><orcidid>https://orcid.org/0000-0003-2821-7677</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1392-124X
ispartof Information technology and control, 2021-12, Vol.50 (4), p.686-705
issn 1392-124X
2335-884X
language eng
recordid cdi_webofscience_primary_000766012000006
source DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />
subjects Automation & Control Systems
Computer Science
Computer Science, Artificial Intelligence
Computer Science, Information Systems
Science & Technology
Technology
title Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T02%3A41%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-webofscience_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Novel%20Machine%20Learning%20for%20Human%20Actions%20Classification%20Using%20Histogram%20of%20Oriented%20Gradients%20and%20Sparse%20Representation&rft.jtitle=Information%20technology%20and%20control&rft.au=Maheswari,%20B.%20Uma&rft.date=2021-12-16&rft.volume=50&rft.issue=4&rft.spage=686&rft.epage=705&rft.pages=686-705&rft.issn=1392-124X&rft.eissn=2335-884X&rft_id=info:doi/10.5755/j01.itc.50.4.27845&rft_dat=%3Cwebofscience_cross%3E000766012000006%3C/webofscience_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true