A Realistic and New Approach for Crowd Density, Quantity, Types, and Anomaly Detection Using Effective Multi-Task Deep Learning Model

ALTUNDOĞAN, TURAN; Gurbuz, Selen; Karakose, Mehmet

doi:10.1109/access.2025.3624319

A Realistic and New Approach for Crowd Density, Quantity, Types, and Anomaly Detection Using Effective Multi-Task Deep Learning Model

ALTUNDOĞAN T. G., Gurbuz S., Karakose M.

IEEE Access, cilt.13, ss.182430-182443, 2025 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 13
Basım Tarihi: 2025
Doi Numarası: 10.1109/access.2025.3624319
Dergi Adı: IEEE Access
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, INSPEC, Directory of Open Access Journals
Sayfa Sayıları: ss.182430-182443
Anahtar Kelimeler: crowd analysis, crowd anomaly, deep learning, density-awareness, Smart cities
Manisa Celal Bayar Üniversitesi Adresli: Evet

Özet

Quantitative, type, and anomaly information in crowd videos is critical for smart city and campus applications. Existing approaches generally focus on high-performance counting or anomaly detection within the scope of crowd analysis. Existing studies in crowd counting generate density maps using regressive neural architectures, and counting is performed on these density maps. Approaches focused on anomaly detection, on the other hand, perform some crime classification tasks that cannot be generalized, particularly for dense crowds. In this study, high-performance neural models are developed to perform density, type, and anomaly classification of crowd images and videos. A CNN-based multi-task model was developed for density classification, which both generates the density map and classifies these densities. Type classification is performed with a frame-by-frame ViT model that focuses on identifying attributes of crowd images such as gathering, concert, sports and protest. Finally, the Swin Transformer model is used for multiple classification of dynamic video segments based on anomalies such as running, falling, panic, and violence. The developed models are integrated with Apache Kafka, and the F1-score performance of each module is over 90%. (Density classification: 91.66%, Anomaly classification: 96.63%, Type classification: 90.9%).