A Deep Learning Based System for the Detection of Human Violence in Video Data

The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Traitement du signal 2021-12, Vol.38 (6), p.1623-1635
Hauptverfasser: Shoaib, Muhammad, Sayed, Nasir
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1635
container_issue 6
container_start_page 1623
container_title Traitement du signal
container_volume 38
creator Shoaib, Muhammad
Sayed, Nasir
description The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.
doi_str_mv 10.18280/ts.380606
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_18280_ts_380606</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_18280_ts_380606</sourcerecordid><originalsourceid>FETCH-LOGICAL-c226t-d7d8dc20ca480ef163fa27bcc6b4729efc9b065f6ce629895d254cbf29d83df3</originalsourceid><addsrcrecordid>eNotkLFOwzAURS0EElHpwhd4Rkp5tmPHHksLLVIEAxVr5NjPENQklW2G_j2Fcpd7h6M7HEJuGSyY5hruc1oIDQrUBSmYkbqUCvQlKaBWsgRg5prMU_qCUwSrlBIFeVnSNeKBNmjj2I8f9MEm9PTtmDIONEyR5k88IRld7qeRToFuvwc70vd-2uPokPa_2-NE1zbbG3IV7D7h_L9nZPf0uFtty-Z187xaNqXjXOXS1157x8HZSgMGpkSwvO6cU11Vc4PBmQ6UDMqh4kYb6bmsXBe48Vr4IGbk7nzr4pRSxNAeYj_YeGwZtH8u2pzaswvxA32lUQc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Deep Learning Based System for the Detection of Human Violence in Video Data</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Shoaib, Muhammad ; Sayed, Nasir</creator><creatorcontrib>Shoaib, Muhammad ; Sayed, Nasir</creatorcontrib><description>The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.</description><identifier>ISSN: 0765-0019</identifier><identifier>EISSN: 1958-5608</identifier><identifier>DOI: 10.18280/ts.380606</identifier><language>eng</language><ispartof>Traitement du signal, 2021-12, Vol.38 (6), p.1623-1635</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Shoaib, Muhammad</creatorcontrib><creatorcontrib>Sayed, Nasir</creatorcontrib><title>A Deep Learning Based System for the Detection of Human Violence in Video Data</title><title>Traitement du signal</title><description>The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.</description><issn>0765-0019</issn><issn>1958-5608</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNotkLFOwzAURS0EElHpwhd4Rkp5tmPHHksLLVIEAxVr5NjPENQklW2G_j2Fcpd7h6M7HEJuGSyY5hruc1oIDQrUBSmYkbqUCvQlKaBWsgRg5prMU_qCUwSrlBIFeVnSNeKBNmjj2I8f9MEm9PTtmDIONEyR5k88IRld7qeRToFuvwc70vd-2uPokPa_2-NE1zbbG3IV7D7h_L9nZPf0uFtty-Z187xaNqXjXOXS1157x8HZSgMGpkSwvO6cU11Vc4PBmQ6UDMqh4kYb6bmsXBe48Vr4IGbk7nzr4pRSxNAeYj_YeGwZtH8u2pzaswvxA32lUQc</recordid><startdate>20211201</startdate><enddate>20211201</enddate><creator>Shoaib, Muhammad</creator><creator>Sayed, Nasir</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20211201</creationdate><title>A Deep Learning Based System for the Detection of Human Violence in Video Data</title><author>Shoaib, Muhammad ; Sayed, Nasir</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c226t-d7d8dc20ca480ef163fa27bcc6b4729efc9b065f6ce629895d254cbf29d83df3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shoaib, Muhammad</creatorcontrib><creatorcontrib>Sayed, Nasir</creatorcontrib><collection>CrossRef</collection><jtitle>Traitement du signal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shoaib, Muhammad</au><au>Sayed, Nasir</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Deep Learning Based System for the Detection of Human Violence in Video Data</atitle><jtitle>Traitement du signal</jtitle><date>2021-12-01</date><risdate>2021</risdate><volume>38</volume><issue>6</issue><spage>1623</spage><epage>1635</epage><pages>1623-1635</pages><issn>0765-0019</issn><eissn>1958-5608</eissn><abstract>The number of security cameras positioned within the surrounding area has expanded, increasing the demand for automatic activity recognition systems. In addition to offline assessment and the issuance of an ongoing alarm in the case of aberrant behaviour, automatic activity detection systems can be employed in conjunction with human operators. In the proposed research framework, an ensemble of Mask Region-based Convolutional Neural Networks for key-point detection scheme, and LSTM based Recurrent Neural Network is used to create a deep neural network model (Mask RCNN) for recognizing violent activities (i.e. kicking, punching, etc.) of a single person. First of all, the key-points locations and ground-truth masks of humans in an image are selected using the selected region; the temporal information is extracted. Experimental results show that the ensemble model outperforms individual models. The proposed technique has a reasonable accuracy rate of 77.4 percent, 95.7 percent, and 88.2 percent, respectively, on the Weizmann, KTH, and our custom datasets. As the proposed effort applies to industry and in terms of security, it is beneficial to society.</abstract><doi>10.18280/ts.380606</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0765-0019
ispartof Traitement du signal, 2021-12, Vol.38 (6), p.1623-1635
issn 0765-0019
1958-5608
language eng
recordid cdi_crossref_primary_10_18280_ts_380606
source EZB-FREE-00999 freely available EZB journals
title A Deep Learning Based System for the Detection of Human Violence in Video Data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T13%3A47%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Deep%20Learning%20Based%20System%20for%20the%20Detection%20of%20Human%20Violence%20in%20Video%20Data&rft.jtitle=Traitement%20du%20signal&rft.au=Shoaib,%20Muhammad&rft.date=2021-12-01&rft.volume=38&rft.issue=6&rft.spage=1623&rft.epage=1635&rft.pages=1623-1635&rft.issn=0765-0019&rft.eissn=1958-5608&rft_id=info:doi/10.18280/ts.380606&rft_dat=%3Ccrossref%3E10_18280_ts_380606%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true