Restricted access

Research article

First published online December 8, 2025

Request permissions

Mixed learning based multi-head attention convolutional network for abnormal event detection in video surveillance systems

Tukaram Ugile https://orcid.org/0000-0002-2441-3702 [email protected] and Nilesh Uke https://orcid.org/0000-0002-3281-5429View all authors and affiliations

Volume 21, Issue 3-4

https://doi.org/10.1177/15741702251401521

View article versions

Abstract
References
Author Biographies

Abstract

Video Surveillance is generally utilized in highways, residential zones, schools, and other public areas to monitor events happening in those areas, where detecting abnormal events in video surveillance effectively contributes to guaranteeing the safety of public areas. Although various methods have been created in this field, many unsolved issues remain, such as higher computational complexity, irrelevant features, and low learning capability, are exist in the existing methods, which limit them from obtaining an accurate abnormal event detection. Hence, a Supervised Incremental Learning based Multihead Attention Convolutional Network (SIL-MACoN) model is proposed in this research to detect the abnormal events accurately by eliminating the existing drawbacks. The unification of the Multihead Attention (MA) mechanism helps to increase the ability of the SIL-MACoN model to understand complex features by capturing the variances among the features by multiple heads. Moreover, the utilization of incremental and supervised contrastive learning mechanisms improves the MACoN model's learning capability and performance through updating its knowledge without forgetting the previously learned features and producing similar and dissimilar set features for training, respectively. The SIL-MACoN model attains 97.34% accuracy, 97.36% specificity, and 97.33% sensitivity with 90% of training data using the ShanghaiTech Campus Dataset, respectively.

Get full access to this article

View all access and purchase options for this article.

References

1. Natha S, Ahmed F, Siraj M, et al. Deep BiLSTM attention model for spatial and temporal anomaly detection in video surveillance. Sensors 2025; 25: 251.

2. Ullah W, Ullah A, Haq IU, et al. CNN Features with bi-directional LSTM for real-time anomaly detection in surveillance networks. Multimed Tools Appl 2021; 80: 16979–16995.

3. Georgescu MI, Ionescu RT, Khan FS, et al. A background-agnostic framework with adversarial training for abnormal event detection in video. IEEE Trans Pattern Anal Mach Intell 2021; 44: 4505–4523.

4. Kalshetty R, Parveen A. Abnormal event detection model using an improved ResNet101 in context context-aware surveillance system. Cognit Comput Syst 2023; 5: 153–167.

5. Islam M, Dukyil AS, Alyahya S, et al. An IoT-enabled anomaly detection system for smart city surveillance. Sensors 2023; 23: 2358.

6. Chu W, Xue H, Yao C, et al. Sparse coding guided spatiotemporal feature learning for abnormal event detection in large videos. IEEE Trans Multimedia 2017; 14: 1–14.

7. Merlin RT, Karthick R, Babu AA, et al. Abnormal events detection using spatio-temporal saliency descriptor and fuzzy representation analysis. Sci Rep 2024; 14: 29818.

8. Zhang Q, Wei H, Chen J, et al. Video anomaly detection based on an attention mechanism. Symmetry (Basel) 2023; 15: 528.

9. Zhou JT, Du J, Zhu H, et al. Anomalynet: an anomaly detection network for video surveillance. IEEE Trans Inf Forensics Secur 2019; 14: 2537–2550.

10. Yu J, Lee Y, Yow KC, et al. Abnormal event detection and localization via adversarial event prediction. IEEE Trans Neural Networks Learn Syst 2021; 33: 3572–3586.

11. Huang H, Zhao B, Gao F, et al. A novel unsupervised video anomaly detection framework based on optical flow reconstruction and erased frame prediction. Sensors 2023; 23: 4828.

12. Elmetwally A, Eldeeb R, Elmougy S. Deep learning based anomaly detection in real-time video. Multimed Tools Appl 2025; 84: 9555–9571.

13. Rezaee K, Rezakhani SM, Khosravi MR, et al. A survey on deep learning-based real-time crowd anomaly detection for secure distributed video surveillance. Pers Ubiquitous Comput 2024; 28: 135–151.

14. Pelvan SÖ, Can B, Ozkan H. A hierarchical approach for improved anomaly detection in video surveillance. IEEE Access 2023; 11: 101644–101665.

15. Fang MT, Chen ZJ, Przystupa K, et al. Examination of abnormal behavior detection based on improved YOLOv3. Electronics (Basel) 2021; 10: 197.

16. Wan B, Jiang W, Fang Y, et al. Anomaly detection in video sequences: a benchmark and computational model. IET Image Proc 2021; 15: 3454–3465.

17. Yang Y, Xie L, Fu Z, et al. Pose-oriented scene-adaptive matching for abnormal event detection. Neurocomputing 2025; 611: 128673.

18. Mathieu M, Couprie C, LeCun Y. Deep multiscale video prediction beyond mean square error. CoRR, abs/1511.05440, 2015.

19. Liu W, Luo W, Lian D, et al. Future frame prediction for anomaly detection: a new baseline. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp.6536–6545. 2018.

20. Inayathulla M, Karthikeyan C. Object detection in video summarization for video surveillance applications. Yugoslav J Oper Res 2025; 35: 10.

21. Tran CH, Kong SG. An iterative learning scheme with binary classifier for improved event detection in surveillance video. Electronics (Basel) 2023; 12: 3275.

22. Alafif T, Hadi A, Allahyani M, et al. Hybrid classifiers for spatio-temporal abnormal behavior detection, tracking, and recognition in massive Hajj crowds. Electronics (Basel) 2023; 12: 1165.

23. Zeng X, Wen L, Liu B, et al. Deep learning for ultrasound image caption generation based on object detection. Neurocomputing 2020; 392: 132–141.

24. Paulraj S, Vairavasundaram S. Transformer-enabled weakly supervised abnormal event detection in intelligent video surveillance systems. Eng Appl Artif Intell 2025; 139: 109496.

25. Sengonul E, Samet R, Abu Al-Haija Q, et al. Abnormal event detection in surveillance videos through LSTM auto-encoding and local minima assistance. Discover Internet Things 2025; 5: 32.

26. Ghauri MS, Bajwa UI, Saleem G, et al. Surveillancenet: spatio-temporal anomaly identification in surveillance videos using two-stream CNN and LSTM. Multimed Tools Appl 2025: 1–25.

27. UCSD Anomaly Detection Dataset. http://www.svcl.ucsd.edu/projects/anomaly/dataset.htm (accessed June 2025).

28. Avenue Dataset. https://www.cse.cuhk.edu.hk/leojia/projects/detectabnormal/dataset.html (accessed June 2025).

29. Shanghai Tech Dataset. https://svip-lab.github.io/dataset/campus_dataset.html (accessed June 2025).

Author Biographies

Tukaram Ugile is a PhD Research Scholar in the Computer Engineering Department at VIIT Pune. He has around 20 years of experience in Software Quality Assurance. He is an intacs-certified ASPICE Principal Assessor with extensions in Machine Learning and Cybersecurity. His research interests include Computer Vision, Machine Learning, Cybersecurity, and Software Process Improvement.

Nilesh Uke received the BE degree in Computer Science and Engineering from Amaravati University, India, in 1995, and the ME from Bharathi Vidhyapeeth in 2005 and PhD degrees in Computer Science, from SRTM University, Nanded India, in 2014. He is currently a Principal and Professor at Indira College of Engineering and Management, Pune, India, affiliated to Savitribai Phule Pune University. His current research interest includes Visual Computing, Artificial Intelligence, Human Computer Interface and Multimedia. He is member of IEEE, ACM and Life Member of the Indian Society for Technical Education (ISTE), Computer Society of India (CSI) and Fellow of Institute of Engineers. He has guided 4 PhD scholar and 8 candidates are pursuing PhD.

|

You currently have no access to this content. Visit the access options page to authenticate.