Enhanced Real-Time Surveillance and Suspect Identification Using CNN-LSTM Based Body Language Analysis

International Journal of Electronics and Communication Engineering
© 2024 by SSRG - IJECE Journal
Volume 11 Issue 9
Year of Publication : 2024
Authors : M. Archana, S. Kavitha, A.Vani Vathsala
pdf
How to Cite?

M. Archana, S. Kavitha, A.Vani Vathsala, "Enhanced Real-Time Surveillance and Suspect Identification Using CNN-LSTM Based Body Language Analysis," SSRG International Journal of Electronics and Communication Engineering, vol. 11,  no. 9, pp. 36-46, 2024. Crossref, https://doi.org/10.14445/23488549/IJECE-V11I9P104

Abstract:

The exponential rise in criminal activities necessitates advanced methods for suspect identification and surveillance. This research aims to tackle this issue through the development of a sophisticated video analytics system leveraging computer vision and deep learning. The primary objective is to accurately identify suspects based on body language patterns extracted from video inputs. We propose a CNN-LSTM based Body Language Rule System (BLRS) that integrates Convolutional Neural Networks (CNNs) for spatial feature extraction and Long Short-Term Memory (LSTM) networks for temporal sequence learning. The system processes video frames to identify key body language indicators such as gestures, postures, and facial expressions. Extensive evaluations using the UCF-Crime Dataset demonstrate the model's high accuracy, with a precision of 95.5%, recall of 95.7%, and overall accuracy of 95.3%. The results indicate that the BLRS significantly outperforms traditional human action recognition models, providing robust and reliable identification of suspicious activities. This research concludes that integrating CNN and LSTM networks within a unified framework enhances real-time surveillance capabilities. The proposed system holds substantial potential for improving public safety and security by enabling more effective monitoring and identification of suspects through advanced body language analysis.

Keywords:

Computer vision, Video analytics, Neural networks, Data analytics,  Deep learning.

References:

[1] Muhammad Shoaib Akhtar, and Tao Feng, “Detection of Malware by Deep Learning as CNN-LSTM Machine Learning Techniques in Real Time,” Symmetry, vol. 14, no. 11, pp. 1-12, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Hyun Bin Kwon et al., “Hybrid CNN-LSTM Network for Real-Time Apnea-Hypopnea Event Detection Based on IR-UWB Radar,” IEEE Access, vol. 10, pp. 17556-17564, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[3] B. Pannalal et al., “Optimizing Pedestrian Analysis at Crosswalks: An Edge-Federated Learning Approach,” International Journal of Computer Engineering in Research Trends, vol. 10, no. 7, pp. 39-48, 2023.
[CrossRef] [Publisher Link]
[4] Madhuri Agrawal, and Shikha Agrawal, “Enhanced Deep Learning for Detecting Suspicious Fall Event in Video Data,” Intelligent Automation & Soft Computing, vol. 36, no. 3, pp. 2653-2667, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[5] M. Bhavsingh, B. Pannalal, and K. Samunnisa, “Review: Pedestrian Behavior Analysis and Trajectory Prediction with Deep Learning,” International Journal of Computer Engineering in Research Trends, vol. 9, no. 12, pp. 263-268, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Waseem Ullah et al., “An Efficient Anomaly Recognition Framework Using an Attention Residual LSTM in Surveillance Videos,” Sensors, vol. 21, no. 8, pp. 1-17, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[7] B. Pannalal, Maloth Bhavsingh, and Y. Ramadevi, “Enhancing Zebra Crossing Safety with Edge-Enabled Deep Learning for Pedestrian Dynamics Prediction,” International Journal of Computer Engineering in Research Trends, vol. 10, no. 10, pp. 71-79, 2023.
[Publisher Link]
[8] Amin Ullah et al., “Efficient Activity Recognition Using Lightweight CNN and DS-GRU Network for Surveillance Applications,” Applied Soft Computing, vol. 103, pp. 1-13, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Qianqian Xiong et al., “Transferable Two-Stream Convolutional Neural Network for Human Action Recognition,” Journal of Manufacturing Systems, vol. 56, pp. 605-614, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Chen Chen et al., “Action Recognition from Depth Sequences Using Weighted Fusion of 2D and 3D Autocorrelation of Gradients Features,” Multimedia Tools and Applications, vol. 76, no. 3, pp. 4651-4669, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[11] N. Kumaran, A. Vadivel, and S. Saravana Kumar, “Recognition of Human Actions Using CNN-GWO: A Novel Modeling of CNN for Enhancement of Classification Performance,” Multimedia Tools and Applications, vol. 77, no. 18, pp. 23115-23147, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Neziha Jaouedi, Noureddine Boujnah, and Med Salim Bouhlel, “A New Hybrid Deep Learning Model for Human Action Recognition,” Journal of King Saud University - Computer and Information Sciences, vol. 32, no. 4, pp. 447-453, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Gül Varol, Ivan Laptev, and Cordelia Schmid, “Long-Term Temporal Convolutions for Action Recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 6, pp. 1510-1517, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Muhammad Zahid et al., “Pedestrian Identification Using Motion-Controlled Deep Neural Network in Real-Time Visual Surveillance,” Soft Computing, vol. 27, pp. 453-469, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Uppalapati Vamsi Krishna et al., “Enhancing Airway Assessment with a Secure Hybrid Network-Blockchain System for CT & CBCT Image Evaluation,” International Research Journal of Multidisciplinary Technovation, vol. 6, no. 2, pp. 51-69, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Massinissa Hamidi, and Aomar Osmani, “Human Activity Recognition: A Dynamic Inductive Bias Selection Perspective,” Sensors, vol. 21, no. 21, pp. 1-42, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Halil İbrahim Öztürk, and Ahmet Burak Can, “ADNet: Temporal Anomaly Detection in Surveillance Videos,” Pattern Recognition. ICPR International Workshops and Challenges, pp. 88-101, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[18] E.V.N. Jyothi et al., “A Graph Neural Network-Based Traffic Flow Prediction System with Enhanced Accuracy and Urban Efficiency,” Journal of Electrical Systems, vol. 19, no. 4, pp. 336-349, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Maringanti Venkata Anirudh Kumar, Rohan Adithyaa Nandedapu, and K. Venkatesh Sharma, “Real-Time Abdominal Trauma Detection Using LSTM Neural Networks with MediaPipe and OpenCV Integration,” Macaw International Journal of Advanced Research in Computer Science and Engineering (MIJARCSE), vol. 10, no. 1, pp. 36-48, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Paolo Dini, Mykola Makhortykh, and Maryna Sydorova, “DataStreamAdapt: Unified Detection Framework for Gradual and Abrupt Concept Drifts,” Synthesis: A Multidisciplinary Research Journal, vol. 1, no. 4, pp. 1-9, 2024.
[Google Scholar] [Publisher Link]
[21] Rick Scholte et al., “Truncated Aperture Extrapolation for Fourier-Based Near-Field Acoustic Holography by Means of Border-Padding,” The Journal of the Acoustical Society of America, vol. 125, no. 6, pp. 3844-3854, 2009.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Ohoud Nafea, Wadood Abdul, and Ghulam Muhammad, “Multi-Sensor Human Activity Recognition Using CNN and GRU,” International Journal of Multimedia Information Retrieval, vol. 11, no. 2, pp. 135-147, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Tongtong Yuan et al., “Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges,” arXiv, pp. 1-19, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Mohannad Elhamod, and Martin D. Levine, “Automated Real-Time Detection of Potentially Suspicious Behavior in Public Transport Areas,” IEEE Transactions on Intelligent Transportation Systems, vol. 14, no. 2, pp. 688-699, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Ahmed Saaudi et al., “Insider Threats Detection Using CNN-LSTM Model,” 2018 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, pp. 94-99, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Feng-Ping An, “Human Action Recognition Algorithm Based on Adaptive Initialization of Deep Learning Model Parameters and Support Vector Machine,” IEEE Access, vol. 6, pp. 59405-59421, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[27] V. Radhika, Ch. Rajendra Prasad, and A. Chakradhar, “Smartphone-Based Human Activities Recognition System Using Random Forest Algorithm,” 2022 International Conference for Advancement in Technology (ICONAT), Goa, India, pp. 1-4, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Pengbo Wang, Yongqiang Zhang, and Wenting Jiang, “Application of K-Nearest Neighbor (KNN) Algorithm for Human Action Recognition,” 2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Chongqing, China, pp. 492-496, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Khan Muhammad et al., “Human Action Recognition Using Attention-Based LSTM Network with Dilated CNN Features,” Future Generation Computer Systems, vol. 125, pp. 820-830, 2021.
[CrossRef] [Google Scholar] [Publisher Link]