A Deep Learning Based System for the Detection of Human Violence in Video Data

The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Traitement du signal 2021-12, Vol.38 (6), p.1623-1635
Hauptverfasser:	Shoaib, Muhammad, Sayed, Nasir
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1635
container_issue	6
container_start_page	1623
container_title	Traitement du signal
container_volume	38
creator	Shoaib, Muhammad Sayed, Nasir
description	The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.
doi_str_mv	10.18280/ts.380606
format	Article
fullrecord	<record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_18280_ts_380606</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_18280_ts_380606</sourcerecordid><originalsourceid>FETCH-LOGICAL-c226t-d7d8dc20ca480ef163fa27bcc6b4729efc9b065f6ce629895d254cbf29d83df3</originalsourceid><addsrcrecordid>eNotkLFOwzAURS0EElHpwhd4Rkp5tmPHHksLLVIEAxVr5NjPENQklW2G_j2Fcpd7h6M7HEJuGSyY5hruc1oIDQrUBSmYkbqUCvQlKaBWsgRg5prMU_qCUwSrlBIFeVnSNeKBNmjj2I8f9MEm9PTtmDIONEyR5k88IRld7qeRToFuvwc70vd-2uPokPa_2-NE1zbbG3IV7D7h_L9nZPf0uFtty-Z187xaNqXjXOXS1157x8HZSgMGpkSwvO6cU11Vc4PBmQ6UDMqh4kYb6bmsXBe48Vr4IGbk7nzr4pRSxNAeYj_YeGwZtH8u2pzaswvxA32lUQc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Deep Learning Based System for the Detection of Human Violence in Video Data</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Shoaib, Muhammad ; Sayed, Nasir</creator><creatorcontrib>Shoaib, Muhammad ; Sayed, Nasir</creatorcontrib><description>The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.</description><identifier>ISSN: 0765-0019</identifier><identifier>EISSN: 1958-5608</identifier><identifier>DOI: 10.18280/ts.380606</identifier><language>eng</language><ispartof>Traitement du signal, 2021-12, Vol.38 (6), p.1623-1635</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Shoaib, Muhammad</creatorcontrib><creatorcontrib>Sayed, Nasir</creatorcontrib><title>A Deep Learning Based System for the Detection of Human Violence in Video Data</title><title>Traitement du signal</title><description>The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.</description><issn>0765-0019</issn><issn>1958-5608</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNotkLFOwzAURS0EElHpwhd4Rkp5tmPHHksLLVIEAxVr5NjPENQklW2G_j2Fcpd7h6M7HEJuGSyY5hruc1oIDQrUBSmYkbqUCvQlKaBWsgRg5prMU_qCUwSrlBIFeVnSNeKBNmjj2I8f9MEm9PTtmDIONEyR5k88IRld7qeRToFuvwc70vd-2uPokPa_2-NE1zbbG3IV7D7h_L9nZPf0uFtty-Z187xaNqXjXOXS1157x8HZSgMGpkSwvO6cU11Vc4PBmQ6UDMqh4kYb6bmsXBe48Vr4IGbk7nzr4pRSxNAeYj_YeGwZtH8u2pzaswvxA32lUQc</recordid><startdate>20211201</startdate><enddate>20211201</enddate><creator>Shoaib, Muhammad</creator><creator>Sayed, Nasir</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20211201</creationdate><title>A Deep Learning Based System for the Detection of Human Violence in Video Data</title><author>Shoaib, Muhammad ; Sayed, Nasir</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c226t-d7d8dc20ca480ef163fa27bcc6b4729efc9b065f6ce629895d254cbf29d83df3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shoaib, Muhammad</creatorcontrib><creatorcontrib>Sayed, Nasir</creatorcontrib><collection>CrossRef</collection><jtitle>Traitement du signal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shoaib, Muhammad</au><au>Sayed, Nasir</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Deep Learning Based System for the Detection of Human Violence in Video Data</atitle><jtitle>Traitement du signal</jtitle><date>2021-12-01</date><risdate>2021</risdate><volume>38</volume><issue>6</issue><spage>1623</spage><epage>1635</epage><pages>1623-1635</pages><issn>0765-0019</issn><eissn>1958-5608</eissn><abstract>The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.</abstract><doi>10.18280/ts.380606</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0765-0019
ispartof	Traitement du signal, 2021-12, Vol.38 (6), p.1623-1635
issn	0765-0019 1958-5608
language	eng
recordid	cdi_crossref_primary_10_18280_ts_380606
source	EZB-FREE-00999 freely available EZB journals
title	A Deep Learning Based System for the Detection of Human Violence in Video Data
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T13%3A47%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Deep%20Learning%20Based%20System%20for%20the%20Detection%20of%20Human%20Violence%20in%20Video%20Data&rft.jtitle=Traitement%20du%20signal&rft.au=Shoaib,%20Muhammad&rft.date=2021-12-01&rft.volume=38&rft.issue=6&rft.spage=1623&rft.epage=1635&rft.pages=1623-1635&rft.issn=0765-0019&rft.eissn=1958-5608&rft_id=info:doi/10.18280/ts.380606&rft_dat=%3Ccrossref%3E10_18280_ts_380606%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true