Your browser doesn't support javascript.
loading
Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network.
Ullah, Fath U Min; Ullah, Amin; Muhammad, Khan; Haq, Ijaz Ul; Baik, Sung Wook.
Afiliación
  • Ullah FUM; Intelligent Media Laboratory, Digital Contents Research Institute, Sejong University, Seoul 143-747, Korea. fath3797@gmail.com.
  • Ullah A; Intelligent Media Laboratory, Digital Contents Research Institute, Sejong University, Seoul 143-747, Korea. qamin3797@gmail.com.
  • Muhammad K; Department of Software, Sejong University, Seoul 143-747, Korea. Khan.muhammad@ieee.org.
  • Haq IU; Intelligent Media Laboratory, Digital Contents Research Institute, Sejong University, Seoul 143-747, Korea. hijaz3797@gmail.com.
  • Baik SW; Intelligent Media Laboratory, Digital Contents Research Institute, Sejong University, Seoul 143-747, Korea. sbaik@sejong.ac.kr.
Sensors (Basel) ; 19(11)2019 May 30.
Article en En | MEDLINE | ID: mdl-31151184
ABSTRACT
The worldwide utilization of surveillance cameras in smart cities has enabled researchers to analyze a gigantic volume of data to ensure automatic monitoring. An enhanced security system in smart cities, schools, hospitals, and other surveillance domains is mandatory for the detection of violent or abnormal activities to avoid any casualties which could cause social, economic, and ecological damages. Automatic detection of violence for quick actions is very significant and can efficiently assist the concerned departments. In this paper, we propose a triple-staged end-to-end deep learning violence detection framework. First, persons are detected in the surveillance video stream using a light-weight convolutional neural network (CNN) model to reduce and overcome the voluminous processing of useless frames. Second, a sequence of 16 frames with detected persons is passed to 3D CNN, where the spatiotemporal features of these sequences are extracted and fed to the Softmax classifier. Furthermore, we optimized the 3D CNN model using an open visual inference and neural networks optimization toolkit developed by Intel, which converts the trained model into intermediate representation and adjusts it for optimal execution at the end platform for the final prediction of violent activity. After detection of a violent activity, an alert is transmitted to the nearest police station or security department to take prompt preventive actions. We found that our proposed method outperforms the existing state-of-the-art methods for different benchmark datasets.
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Tipo de estudio: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Sensors (Basel) Año: 2019 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Tipo de estudio: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Sensors (Basel) Año: 2019 Tipo del documento: Article