A Systematic Review of Event-Matching Methods for Complex Event Detection in Video Streams
Abstract
:1. Introduction
- Reviewing the literature on complex events matching in videos over the past 12 years (2012–2024);
- Discussing how these studies have addressed gaps in CED in videos and outline considerations for future research in this field;
- Extracting the most critical challenges of complex event matching from these studies.
2. Research Method
2.1. Eligibility Criteria
2.2. Information Sources
2.3. Search Strategy
2.4. Selection Process
2.5. Data Collection Process
2.6. Data Items
2.7. Study Risk of Bias Assessment
2.8. Synthesis Methods
2.9. Reporting Bias Assessment
2.10. Certainty Assessment
3. Complex Event Detection
3.1. Definitions
3.2. Applications and Frameworks
3.3. Complex Event Frameworks Challenges
4. Event-Matching Methods
4.1. Multi-Source Solutions
4.2. Approaches Based on Training and Predicting of Videos
4.3. Object Detection and Spatio–Temporal Matching
4.4. Other Solutions
5. Challenges of Complex Event Matching
5.1. Composite Metric for Method Evaluation
- -
- Accuracy: How effectively the method detects and matches events.
- -
- Robustness: The ability of the method to handle noisy or incomplete data.
- -
- Low Latency: Efficiency of the method in real-time applications.
- -
- Scalability: How well the method adapts to larger datasets or complex events.
- -
- Ease of Implementation: Practicality in terms of computational resources and required expertise.
5.2. Intra-Category Ranking
- Multi-source Solutions
- IoT Framework: The IoT framework ranks highly across accuracy, robustness, scalability, and latency, although it scores lower on ease of implementation. Its strengths lie in integrating data from multiple sensors, which enhances accuracy and robustness while allowing for a moderate reduction in latency.
- Multimodal Feature Fusion: This method scores high in accuracy but falls short in scalability and ease of implementation. It ranks moderately due to its balanced performance across most criteria but lacks consistency in latency optimization.
- Low-Level Feature Integration and Hierarchical Event Fusion: Although this method ranks well in accuracy, its limitations in robustness and scalability lead to a lower overall ranking.
- Event Matching by Training and Predicting
- Merging Frames and Vector Encoders: This method ranks highly across accuracy, latency, and moderately in robustness. It is particularly effective in scenarios where timely event matching is critical.
- Multiple Instances Learning: MIL ranks well in accuracy and robustness but lacks in scalability and ease of implementation. It is a strong candidate for environments where robustness is prioritized.
- Spatial and Temporal Key Evidence: This approach shows consistent performance across all criteria, making it well-rounded but without excelling in any single dimension.
- Cell-by-Cell Basis Feature Fusion: Despite high accuracy and robustness, this method’s significant challenges with latency and scalability limit its overall ranking.
- Object Detection and Spatio–temporal Matching
- Event Knowledge Graph and Dividing Objects into Static and Dynamic Categories: This method excels in accuracy, robustness, and scalability, making it ideal for scenarios requiring high accuracy and adaptable event matching.
- Event Query Optimization: With high rankings in low latency and scalability, this method performs well in real-time applications where speed is prioritized over accuracy.
- Other Solutions
- Zero Examplers and NLP-Aided Feature Extraction: This approach performs highly across robustness, scalability, and ease of implementation, making it valuable for environments where innovative feature extraction is needed.
- Feature Extraction and Bag-of-Words (BOW) Descriptors: With moderate scores across all criteria, this method is balanced but lacks standout strengths.
5.3. Inter-Category Ranking
- Top Performers
- Balanced Approaches
5.4. Event-Matching Challenges
- Object Detection and Tracking
- Small Training Dataset Size for Visual Examples
- Optical Flow Failure
- Noisy and Unreliable Labels
- Latency Problems in Event Recognition
- Designing Ontology of Objects and Relationships for Complex Events
- Spatial Relationships Extraction
6. Discussion
6.1. Research Trends
6.2. Bias Assessment
6.3. Certainty Assessment
- -
- Risk of Bias: The level of bias in individual studies can reduce certainty. If the studies included in a category show high risk of bias (due to flaws in study design, selective reporting, or conflicts of interest), certainty is downgraded.
- -
- Inconsistency: If the results across studies within a category are highly variable or inconsistent (e.g., differing effect sizes), the certainty is reduced. Consistency indicates reliable evidence, while large differences in outcomes suggest uncertainty.
- -
- Indirectness: Certainty decreases if the studies do not directly address the specific research question or population. For instance, studies that only indirectly address complex event matching in videos would lower certainty.
- -
- Imprecision: If studies have wide confidence intervals or include small sample sizes, they provide less precise estimates of the effect, leading to lower certainty.
- -
- Publication Bias: Evidence certainty is reduced if there is suspected or confirmed reporting bias (i.e., studies with negative or non-significant findings are missing). Funnel plots and other tools help assess this bias.
6.4. Comparative Analysis of Methods by Application Domains
- Security Surveillance
- Smart City Analytics
- Sports Event Analysis
- New Researchers
- Focused Development
6.5. Categories Correlation Analysis
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
Abbreviations
Abbreviations | Definitions |
CED | Complex event detection |
DSMS | Data stream management system |
MRFR | Market Research Future |
CEP | Complex event processing |
PRISMA | Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
GRADE | Grading of Recommendations Assessment, Development and Evaluation |
CER | Complex event recognition |
IoT | Internet of Things |
EPL | Event Processing Language |
SQL | Structured Query Language |
BOW | Bag-of-Words |
MAFnet | Multimodal attentive fusion network |
FiLM | Feature-wise linear modulation |
NLP | Natural language processing |
AVRL | Audio–video representation learning |
CNN | Convolutional neural network |
MCR | Multi-level collaborative regression |
MIL | Multiple instances learning |
IDT | Improved dense trajectories |
DCNN | Deep convolutional neural network |
S2L | Single-StreamLine |
LSTM | Long short-term memory |
VEQL | Video Event Query Language |
VEKG | Video Event Knowledge Graph |
MERN | Multimedia event relation network |
TAG | Time-aggregated graph |
NOP | Notification- Oriented Paradigm |
SQC | Static query chain |
DQC | Dynamic query chain |
SIN | Semantic Index dataset |
EACI | Event-adaptive concept integration |
ASUC | Area score under curve |
CCTV | Closed-circuit television |
SVM | Support vector machine |
SIFT | Scale-invariant feature transform |
GITS | Gini-Index Text |
MoSIFT | Motion SIFT |
References
- Liang, S.H.; Saeedi, S.; Ojagh, S.; Honarparvar, S.; Kiaei, S.; Jahromi, M.M.; Squires, J. An Interoperable Architecture for the Internet of COVID-19 Things (IoCT) Using Open Geospatial Standards—Case Study: Workplace Reopening. Sensors 2021, 21, 50. [Google Scholar] [CrossRef] [PubMed]
- Luckham, D. The Power of Events; Addison-Wesley Reading: Boston, MA, USA, 2002; Volume 204. [Google Scholar]
- Yao, W.; Chu, C.-H.; Li, Z. Leveraging complex event processing for smart hospitals using RFID. J. Netw. Comput. Appl. 2011, 34, 799–810. [Google Scholar] [CrossRef]
- Wu, E.; Diao, Y.; Rizvi, S. High-performance complex event processing over streams. In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, Chicago, IL, USA, 27–29 June 2006; pp. 407–418. [Google Scholar]
- Wang, D. Extending Complex Event Processing for Advanced Applications; Worcester Polytechnic Institute: Worcester, MA, USA, 2013. [Google Scholar]
- Cugola, G.; Margara, A. Processing flows of information: From data stream to complex event processing. ACM Comput. Surv. (CSUR) 2012, 44, 1–62. [Google Scholar] [CrossRef]
- Coppola, J. Announcing the 2021 State of Video Report; Wistia: Cambridge, MA, USA, 2021. [Google Scholar]
- MRFR (Market Research Future). AI Camera Market Research Report: By Type (Smartphone Cameras, Surveillance Cameras, DSLRs, others), By Technology (Image/Face Recognition, Speech/Voice Recognition, Computer Vision, others) and by Region (North America, Europe, Asia-Pacific, Middle East & Africa and South America)—Forecast till 2027; Market Research Future: Pune, India, 2021; p. 111. [Google Scholar]
- Bazhenov, N.; Korzun, D. Event-driven video services for monitoring in edge-centric internet of things environments. In Proceedings of the 2019 25th Conference of Open Innovations Association (FRUCT), Helsinki, Finland, 5–8 November 2019; pp. 47–56. [Google Scholar]
- Vu, V.-T.; Brémond, F.; Davini, G.; Thonnat, M.; Pham, Q.-C.; Allezard, N.; Sayd, P.; Rouas, J.-L.; Ambellouis, S.; Flancquart, A. Audio-video event recognition system for public transport security. In Proceedings of the 2006 IET Conference on Crime and Security, London, UK, 13–14 June 2006; pp. 414–419. [Google Scholar]
- Knoch, S.; Ponpathirkoottam, S.; Schwartz, T. Video-to-model: Unsupervised trace extraction from videos for process discovery and conformance checking in manual assembly. In Proceedings of the Business Process Management: 18th International Conference, BPM 2020, Seville, Spain, 13–18 September 2020; pp. 291–308. [Google Scholar]
- Li, Z.; Katsifodimos, A.; Bozzon, A.; Houben, G. Complex Event Processing on Real-time Video Streams. In Proceedings of the VLDB 2020 PhD Workshop, Tokyo, Japan, 31 August 2020. [Google Scholar]
- Yadav, P.; Curry, E. VidCEP: Complex Event Processing Framework to Detect Spatiotemporal Patterns in Video Streams. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; pp. 2513–2522. [Google Scholar]
- Bhattacharya, S.; Yu, F.X.; Chang, S.-F. Minimally needed evidence for complex event recognition in unconstrained videos. In Proceedings of the International Conference on Multimedia Retrieval, Glasgow, UK, 1–4 April 2014; pp. 105–112. [Google Scholar]
- Giatrakos, N.; Alevizos, E.; Artikis, A.; Deligiannakis, A.; Garofalakis, M. Complex event recognition in the big data era: A survey. VLDB J. 2020, 29, 313–352. [Google Scholar] [CrossRef]
- Vandenhouten, R.; Holland-Moritz, R. A software architecture for intelligent facility management based on complex event processing. Wiss. Beiträge 2012, 16, 57–62. [Google Scholar] [CrossRef]
- Chuanfei, X.; Shukuan, L.; Lei, W.; Jianzhong, Q. Complex event detection in probabilistic stream. In Proceedings of the 2010 12th International Asia-Pacific Web Conference, Busan, Republic of Korea, 6–8 April 2010; pp. 361–363. [Google Scholar]
- Wang, J.; Ji, B.; Lin, F.; Lu, S.; Lan, Y.; Cheng, L. A multiple pattern complex event detection scheme based on decomposition and merge sharing for massive event streams. Int. J. Distrib. Sens. Netw. 2020, 16, 1550147720961336. [Google Scholar] [CrossRef]
- Artikis, A.; Sergot, M.; Paliouras, G. An event calculus for event recognition. IEEE Trans. Knowl. Data Eng. 2014, 27, 895–908. [Google Scholar] [CrossRef]
- Sokha, Y.; Jeong, K.; Lee, J.; Joe, W. A complex event processing system approach to real time road traffic event detection. J. Converg. Inf. Technol. 2013, 8, 142. [Google Scholar]
- Wang, W.; Sung, J.; Kim, D. Complex event processing in epc sensor network middleware for both rfid and wsn. In Proceedings of the 2008 11th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC), Orlando, FL, USA, 5–7 May 2008; pp. 165–169. [Google Scholar]
- Keskisärkkä, R.; Blomqvist, E. Semantic complex event processing for social media monitoring-a survey. In Proceedings of the Social Media and Linked Data for Emergency Response (SMILE) Co-located with the 10th Extended Semantic Web Conference, Montpellier, France, 26–30 May 2013. [Google Scholar]
- Megargel, A.; Shankararaman, V.; Reddy, S.K. Real-time inbound marketing: A use case for digital banking. In Handbook of Blockchain, Digital Finance, and Inclusion, Volume 1; Elsevier: Amsterdam, The Netherlands, 2018; pp. 311–328. [Google Scholar]
- Michelson, B.M. Event-driven architecture overview. Patricia Seybold Group 2006, 2, 10–1571. [Google Scholar]
- Suntinger, M.; Obweger, H.; Schiefer, J.; Groller, M.E. The event tunnel: Interactive visualization of complex event streams for business process pattern analysis. In Proceedings of the 2008 IEEE Pacific Visualization Symposium, Kyoto, Japan, 5–7 March 2008; pp. 111–118. [Google Scholar]
- Chang, X.; Yu, Y.-L.; Yang, Y.; Xing, E.P. Semantic pooling for complex event analysis in untrimmed videos. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1617–1632. [Google Scholar] [CrossRef]
- Khazael, B.; Malazi, H.T.; Clarke, S. Complex event processing in smart city monitoring applications. IEEE Access 2021, 9, 143150–143165. [Google Scholar] [CrossRef]
- Dhillon, A.S.; Majumdar, S.; St-Hilaire, M.; El-Haraki, A. A mobile complex event processing system for remote patient monitoring. In Proceedings of the 2018 IEEE International Congress on Internet of Things (ICIOT), San Francisco, CA, USA, 2–7 July 2018; pp. 180–183. [Google Scholar]
- Widder, A.; von Ammon, R.; Schaeffer, P.; Wolff, C. Combining discriminant analysis and neural networks for fraud detection on the base of complex event processing. In Proceedings of the Second International Conference on Distributed Event-Based Systems, Rome, Italy, 1–4 July 2008; p. 6830. [Google Scholar]
- Fülöp, L.J.; Beszédes, Á.; Tóth, G.; Demeter, H.; Vidács, L.; Farkas, L. Predictive complex event processing: A conceptual framework for combining complex event processing and predictive analytics. In Proceedings of the Fifth Balkan Conference in Informatics, Novi Sad, Serbia, 16–20 September 2012; pp. 26–31. [Google Scholar]
- Liu, X.; Cao, J.; Tang, S.; Guo, P. Fault tolerant complex event detection in WSNs: A case study in structural health monitoring. IEEE Trans. Mob. Comput. 2015, 14, 2502–2515. [Google Scholar] [CrossRef]
- Naseri, M.M.; Tabibian, S.; Homayounvala, E. Adaptive and personalized user behavior modeling in complex event processing platforms for remote health monitoring systems. Artif. Intell. Med. 2022, 134, 102421. [Google Scholar] [CrossRef]
- Wang, D.; Rundensteiner, E.A.; Wang, H.; Ellison III, R.T. Active complex event processing: Applications in real-time health care. Proc. VLDB Endow. 2010, 3, 1545–1548. [Google Scholar] [CrossRef]
- Honarparvar, S.; Saeedi, S.; Liang, S.; Squires, J. Design and Development of an Internet of Smart Cameras Solution for Complex Event Detection in COVID-19 Risk Behaviour Recognition. ISPRS Int. J. Geo-Inf. 2021, 10, 81. [Google Scholar] [CrossRef]
- de Moura, I.R.; e Silva, F.J.d.S.; Coutinho, L.R.; Teles, A.S. Mental health ubiquitous monitoring: Detecting context-enriched sociability patterns through complex event processing. In Proceedings of the 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS), Rochester, MN, USA, 28–30 July 2020; pp. 239–244. [Google Scholar]
- Wang, Y.; Cao, K. A proactive complex event processing method for large-scale transportation internet of things. Int. J. Distrib. Sens. Netw. 2014, 10, 159052. [Google Scholar] [CrossRef]
- Wang, Y.; Cao, K.; Zhang, X. Complex event processing over distributed probabilistic event streams. Comput. Math. Appl. 2013, 66, 1808–1821. [Google Scholar] [CrossRef]
- Hussonnois, F. Kafka Streams CEP. Available online: https://github.com/fhussonnois/kafkastreams-cep (accessed on 8 November 2024).
- Lu, J. Eagle: Real Time Data Processing System Based on Flink and CEP. Available online: https://github.com/luxiaoxun/eagle (accessed on 8 November 2024).
- Kolarov, V.; Sedlacek, J.; Badger, T.G. Rivus CEP. Available online: https://github.com/vascokk/rivus_cep (accessed on 8 November 2024).
- Itsubaki. Gostream. Available online: https://github.com/itsubaki/gostream (accessed on 8 November 2024).
- Kougioumtzi, E.; Kontaxakis, A.; Deligiannakis, A.; Kotidis, Y. Towards creating a generalized complex event processing operator using FlinkCEP: Architecture & benchmark. In Proceedings of the 15th ACM International Conference on Distributed and Event-Based Systems, Virtual, 28 June–2 July 2021; pp. 188–189. [Google Scholar]
- Bernhardt, T.; Kaicao; Choly, I.; Yang, G.; Shelton, M.; Leitschuh, J. Esper—Complex Event Processing, Streaming SQL and Event Series Analysis for Java. Available online: https://github.com/espertechinc/esper (accessed on 8 November 2024).
- Alevizos, E.; Artikis, A. Being logical or going with the flow? A comparison of complex event processing systems. In Proceedings of the Hellenic Conference on Artificial Intelligence, Ioannina, Greece, 15–17 May 2014; pp. 460–474. [Google Scholar]
- Albek, E.; Bax, E.; Billock, G.; Chandy, K.M.; Swett, I. An event processing language (epl) for building sense and respond applications. In Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium, Denver, CO, USA, 4–8 April 2005; p. 5. [Google Scholar]
- Suhothayan, S.; Gajasinghe, K.; Loku Narangoda, I.; Chaturanga, S.; Perera, S.; Nanayakkara, V. Siddhi: A second look at complex event processing architectures. In Proceedings of the 2011 ACM Workshop on Gateway Computing Environments, Seattle WA, USA, 18 November 2011; pp. 43–50. [Google Scholar]
- Cardinale, Y.; Freites, G.; Valderrama, E.; Aguilera, A.; Angsuchotmetee, C. Semantic framework of event detection in emergency situations for smart buildings. Digit. Commun. Netw. 2022, 8, 64–79. [Google Scholar] [CrossRef]
- Barbieri, D.F.; Braga, D.; Ceri, S.; Della Valle, E.; Grossniklaus, M. C-SPARQL: SPARQL for continuous querying. In Proceedings of the 18th International Conference on World Wide Web, Madrid Spain, 20–24 April 2009; pp. 1061–1062. [Google Scholar]
- Anicic, D.; Fodor, P.; Rudolph, S.; Stojanovic, N. EP-SPARQL: A unified language for event processing and stream reasoning. In Proceedings of the 20th International Conference on World Wide Web, Hyderabad India, 28 March–1 April 2011; pp. 635–644. [Google Scholar]
- Hasan, S.; Curry, E. Approximate semantic matching of events for the internet of things. ACM Trans. Internet Technol. (TOIT) 2014, 14, 1–23. [Google Scholar] [CrossRef]
- Teymourian, K.; Coskun, G.; Paschke, A. Modular Upper-Level Ontologies for Semantic Complex Event Processing. In Modular Ontologies; IOS Press: Amsterdam, The Netherlands, 2010; pp. 81–93. [Google Scholar]
- Binnewies, S.; Stantic, B. OECEP: Enriching complex event processing with domain knowledge from ontologies. In Proceedings of the Fifth Balkan Conference in Informatics, Novi Sad, Serbia, 16–20 September 2012; pp. 20–25. [Google Scholar]
- Doulamis, N.D.; Kokkinos, P.; Varvarigos, E. Resource selection for tasks with time requirements using spectral clustering. IEEE Trans. Comput. 2012, 63, 461–474. [Google Scholar] [CrossRef]
- Akdere, M.; Çetintemel, U.; Tatbul, N. Plan-based complex event detection across distributed sources. Proc. VLDB Endow. 2008, 1, 66–77. [Google Scholar] [CrossRef]
- Jones, M.T. Process Real-Time Big Data with Twitter Storm; DeveloperWorks IBM: Armonk, NY, USA, 2013; pp. 83–84. [Google Scholar]
- Zhou, Q.; Simmhan, Y.; Prasanna, V. Knowledge-infused and consistent Complex Event Processing over real-time and persistent streams. Future Gener. Comput. Syst. 2017, 76, 391–406. [Google Scholar] [CrossRef]
- Esposito, C.; Ficco, M.; Palmieri, F.; Castiglione, A. A knowledge-based platform for big data analytics based on publish/subscribe services and stream processing. Knowl.-Based Syst. 2015, 79, 3–17. [Google Scholar] [CrossRef]
- Skarlatidis, A.; Paliouras, G.; Artikis, A.; Vouros, G.A. Probabilistic event calculus for event recognition. ACM Trans. Comput. Log. (TOCL) 2015, 16, 1–37. [Google Scholar] [CrossRef]
- Liu, F.; Deng, D.; Li, P. Dynamic context-aware event recognition based on Markov logic networks. Sensors 2017, 17, 491. [Google Scholar] [CrossRef]
- Rincé, R.; Kervarc, R.; Leray, P. Complex event processing under uncertainty using Markov chains, constraints, and sampling. In Proceedings of the International Joint Conference on Rules and Reasoning, Luxembourg, 18–21 September 2018; pp. 147–163. [Google Scholar]
- Wang, J.; Cheng, L.; Liu, J. A Complex Event Detection Method formulti-probability RFID Event Stream. J. Softw. 2014, 9, 834. [Google Scholar]
- Cugola, G.; Margara, A.; Matteucci, M.; Tamburrelli, G. Introducing uncertainty in complex event processing: Model, implementation, and validation. Computing 2015, 97, 103–144. [Google Scholar] [CrossRef]
- Natarajan, P.; Wu, S.; Vitaladevuni, S.; Zhuang, X.; Tsakalidis, S.; Park, U.; Prasad, R.; Natarajan, P. Multimodal feature fusion for robust event detection in web videos. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 1298–1305. [Google Scholar]
- Jhuo, I.-H.; Lee, D. Video event detection via multi-modality deep learning. In Proceedings of the 2014 22nd International Conference on Pattern Recognition, Stockholm, Sweden, 24–28 August 2014; pp. 666–671. [Google Scholar]
- Oh, S.; McCloskey, S.; Kim, I.; Vahdat, A.; Cannons, K.J.; Hajimirsadeghi, H.; Mori, G.; Perera, A.A.; Pandey, M.; Corso, J.J. Multimedia event detection with multimodal feature fusion and temporal concept localization. Mach. Vis. Appl. 2014, 25, 49–69. [Google Scholar] [CrossRef]
- Shahad, R.A.; Bein, L.G.; Saad, M.H.M.; Hussain, A. Complex event detection in an intelligent surveillance system using CAISER platform. In Proceedings of the 2016 International Conference on Advances in Electrical, Electronic and Systems Engineering (ICAEES), Putrajaya, Malaysia, 14–16 November 2016; pp. 129–133. [Google Scholar]
- Brousmiche, M.; Rouat, J.; Dupont, S. Multimodal Attentive Fusion Network for audio-visual event recognition. Inf. Fusion 2022, 85, 52–59. [Google Scholar] [CrossRef]
- Gao, J.; Yang, H.; Gong, M.; Li, X. Audio–visual representation learning for anomaly events detection in crowds. Neurocomputing 2024, 582, 127489. [Google Scholar] [CrossRef]
- Ma, Z.; Yang, Y.; Xu, Z.; Yan, S.; Sebe, N.; Hauptmann, A.G. Complex event detection via multi-source video attributes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 2627–2633. [Google Scholar]
- Tang, K.; Fei-Fei, L.; Koller, D. Learning latent temporal structure for complex event detection. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 1250–1257. [Google Scholar]
- Yang, Y.; Ma, Z.; Xu, Z.; Yan, S.; Hauptmann, A.G. How related exemplars help complex event detection in web videos? In Proceedings of the IEEE International Conference on Computer Vision, Sydney, Australia, 1–8 December 2013; pp. 2104–2111. [Google Scholar]
- Lai, K.-T.; Yu, F.X.; Chen, M.-S.; Chang, S.-F. Video event detection by inferring temporal instance labels. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 2243–2250. [Google Scholar]
- Xu, Z.; Tsang, I.W.; Yang, Y.; Ma, Z.; Hauptmann, A.G. Event detection using multi-level relevance labels and multiple features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 97–104. [Google Scholar]
- Gan, C.; Wang, N.; Yang, Y.; Yeung, D.-Y.; Hauptmann, A.G. Devnet: A deep event network for multimedia event detection and evidence recounting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 2568–2577. [Google Scholar]
- Abbasnejad, I.; Sridharan, S.; Denman, S.; Fookes, C.; Lucey, S. Complex event detection using joint max margin and semantic features. In Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA), Gold Coast, Australia, 30 November–2 December 2016; pp. 1–8. [Google Scholar]
- Liu, H.; Zheng, Q.; Li, Z.; Qin, T.; Zhu, L. An efficient multi-feature SVM solver for complex event detection. Multimed. Tools Appl. 2018, 77, 3509–3532. [Google Scholar] [CrossRef]
- Dao, M.-S.; Zettsu, K. Complex event analysis of urban environmental data based on deep CNN of spatiotemporal raster images. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10–13 December 2018; pp. 2160–2169. [Google Scholar]
- Xu, Z.; Su, L.; Wang, S.; Huang, Q.; Zhang, Y. S2L: Single-streamline for complex video event detection. In Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), San Diego, CA, USA, 23–27 July 2018; pp. 1–6. [Google Scholar]
- Luo, M.; Chang, X.; Gong, C. Reliable shot identification for complex event detection via visual-semantic embedding. Comput. Vis. Image Underst. 2021, 213, 103300. [Google Scholar] [CrossRef]
- Alanazi, T.; Muhammad, G. Human fall detection using 3D multi-stream convolutional neural networks with fusion. Diagnostics 2022, 12, 3060. [Google Scholar] [CrossRef] [PubMed]
- Vilamala, M.R.; Xing, T.; Taylor, H.; Garcia, L.; Srivastava, M.; Kaplan, L.; Preece, A.; Kimmig, A.; Cerutti, F. DeepProbCEP: A neuro-symbolic approach for complex event processing in adversarial settings. Expert Syst. Appl. 2023, 215, 119376. [Google Scholar] [CrossRef]
- Fei, M.; Jiang, W.; Mao, W. Creating personalized video summaries via semantic event detection. J. Ambient Intell. Humaniz. Comput. 2023, 14, 14931–14942. [Google Scholar] [CrossRef]
- Chakraborty, I.; Cheng, H.; Javed, O. Entity Centric Feature Pooling for Complex Event Detection. In Proceedings of the 1st ACM International Workshop on Human Centered Event Understanding from Multimedia, Orlando, FL, USA, 7 November 2014; pp. 1–5. [Google Scholar]
- Ye, G.; Li, Y.; Xu, H.; Liu, D.; Chang, S.-F. Eventnet: A large scale structured concept library for complex event detection in video. In Proceedings of the 23rd ACM International Conference on Multimedia, New York, NY, USA, 26–30 October 2015; pp. 471–480. [Google Scholar]
- Chen, C.Y.; Fu, J.H.; Sung, T.; Wang, P.-F.; Jou, E.; Feng, M.-W. Complex event processing for the internet of things and its applications. In Proceedings of the 2014 IEEE International Conference on Automation Science and Engineering (CASE), Taipei, Taiwan, 18–22 August 2014; pp. 1144–1149. [Google Scholar]
- Ke, J.; Chen, X.-J.; Chen, B.-D.; Xu, H.; Zhang, J.-G.; Jiang, X.-M.; Wang, M.-R.; Chen, X.-B.; Zhang, Q.-Q.; Cai, W.-H. Complex Event Detection in Video Streams. In Proceedings of the 2016 IEEE Symposium on Service-Oriented System Engineering (SOSE), Oxford, UK, 29 March–2 April 2016; pp. 172–179. [Google Scholar]
- Coşar, S.; Donatiello, G.; Bogorny, V.; Garate, C.; Alvares, L.O.; Brémond, F. Toward abnormal trajectory and event detection in video surveillance. IEEE Trans. Circuits Syst. Video Technol. 2016, 27, 683–695. [Google Scholar] [CrossRef]
- Li, C.; Huang, Z.; Yang, Y.; Cao, J.; Sun, X.; Shen, H.T. Hierarchical latent concept discovery for video event detection. IEEE Trans. Image Process. 2017, 26, 2149–2162. [Google Scholar] [CrossRef]
- Zhang, H.; Ananthanarayanan, G.; Bodik, P.; Philipose, M.; Bahl, P.; Freedman, M.J. Live video analytics at scale with approximation and {Delay-Tolerance}. In Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17), Boston, MA, USA, 27–29 March 2017; pp. 377–392. [Google Scholar]
- Kang, D.; Bailis, P.; Zaharia, M. BlazeIt: Optimizing declarative aggregation and limit queries for neural network-based video analytics. arXiv 2018, arXiv:1805.01046. [Google Scholar] [CrossRef]
- Khan, A.; Serafini, L.; Bozzato, L.; Lazzerini, B. Event Detection from Video Using Answer Set Programing. In Proceedings of the CILC, Pescaia, Italy, 16–19 June 2019; pp. 48–58. [Google Scholar]
- Yadav, P.; Curry, E. VEKG: Video Event Knowledge Graph to Represent Video Streams for Complex Event Pattern Matching. In Proceedings of the 2019 First International Conference on Graph Computing (GC), Laguna Hills, CA, USA, 25–27 September 2019; pp. 13–20. [Google Scholar]
- Yadav, P. High-performance complex event processing framework to detect event patterns over video streams. In Proceedings of the 20th International Middleware Conference Doctoral Symposium, Davis, CA, USA, 9–13 December 2019; pp. 47–50. [Google Scholar]
- Yadav, P.; Curry, E. Visual Semantic Multimedia Event Model for Complex Event Detection in Video Streams. arXiv 2020, arXiv:2009.14525. [Google Scholar]
- Yadav, P.; Salwala, D.; Das, D.P.; Curry, E. Knowledge Graph Driven Approach to Represent Video Streams for Spatiotemporal Event Pattern Matching in Complex Event Processing. Int. J. Semant. Comput. 2020, 14, 423–455. [Google Scholar] [CrossRef]
- Patel, A.S.; Merlino, G.; Puliafito, A.; Vyas, R.; Vyas, O.; Ojha, M.; Tiwari, V. An NLP-guided ontology development and refinement approach to represent and query visual information. Expert Syst. Appl. 2023, 213, 118998. [Google Scholar] [CrossRef]
- Kossoski, C.; Simão, J.M.; Lopes, H.S. Modeling and Performance Analysis of a Notification-Based Method for Processing Video Queries on the Fly. Appl. Sci. 2024, 14, 3566. [Google Scholar] [CrossRef]
- Tamrakar, A.; Ali, S.; Yu, Q.; Liu, J.; Javed, O.; Divakaran, A.; Cheng, H.; Sawhney, H. Evaluation of low-level features and their combinations for complex event detection in open source videos. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 3681–3688. [Google Scholar]
- Yan, Y.; Yang, Y.; Meng, D.; Liu, G.; Tong, W.; Hauptmann, A.G.; Sebe, N. Event oriented dictionary learning for complex event detection. IEEE Trans. Image Process. 2015, 24, 1867–1878. [Google Scholar] [CrossRef] [PubMed]
- Chang, X.; Yang, Y.; Long, G.; Zhang, C.; Hauptmann, A. Dynamic concept composition for zero-example event detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016. [Google Scholar]
- de Boer, M.; Schutte, K.; Kraaij, W. Knowledge based query expansion in complex multimedia event detection. Multimed. Tools Appl. 2016, 75, 9025–9043. [Google Scholar] [CrossRef]
- Li, Z.; Yao, L.; Chang, X.; Zhan, K.; Sun, J.; Zhang, H. Zero-shot event detection via event-adaptive concept relevance mining. Pattern Recognit. 2019, 88, 595–603. [Google Scholar] [CrossRef]
- Li, Z.; Chang, X.; Yao, L.; Pan, S.; Zongyuan, G.; Zhang, H. Grounding visual concepts for zero-shot event detection and event captioning. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Virtual, 6–10 July 2020; pp. 297–305. [Google Scholar]
- Jin, Y.; Jiang, W.; Yang, Y.; Mu, Y. Zero-Shot Video Event Detection with High-Order Semantic Concept Discovery and Matching. IEEE Trans. Multimed. 2021, 24, 1896–1908. [Google Scholar] [CrossRef]
- Mazon-Olivo, B.; Hernández-Rojas, D.; Maza-Salinas, J.; Pan, A. Rules engine and complex event processor in the context of internet of things for precision agriculture. Comput. Electron. Agric. 2018, 154, 347–360. [Google Scholar] [CrossRef]
- Mousheimish, R. Combining the Internet of Things, Complex Event Processing, and Time Series Classification for a Proactive Business Process Management. Ph.D. Thesis, Université Paris Saclay (COmUE), Gif-sur-Yvette, France, 2017. [Google Scholar]
Research Method Items | Assessment Approach |
---|---|
Eligibility Criteria | Inclusion of 2012 to 2024, written in English, and providing quantitative results related papers |
Information Sources | IEEE, Google Scholar |
Search Strategy | Boolean and proximity search of keywords |
Selection Process | Abstract and title screening by single reviewer |
Data items | Accuracy of event matching, computational efficiency, and applicability to real-time analysis |
Study Risk of Bias Assessment | Quantifying risks of reporting outcomes, selection, data input, and evaluation metrics biases by low, medium, and high scores |
Synthesis Methods | Grouping the studies based on the type of event detection method and statistical and comparative analysis |
Certainty Assessment | GRADE framework |
CED Framework | Processing Method |
---|---|
FlinkCEP | Powerful parallel processing to resolve expensive computer cluster solutions |
Esper | Upside-down traditional relational database |
Siddhi | Microservices and event query API for streaming |
KafkaStreamsCEP | Handling state with local RocksDB stores and stateful conditions for pattern matching |
Eagle | Uses a distributed stream processing framework and dynamic rule-based detection |
RivusCEP | Pattern matching based on a directed-graph finite state machine and streaming data using a declarative SQL-like engine |
GoStream | Running event aggregation, temporal constraints, and deriving insights from data streams. |
Event-Matching Method | Advantages | Disadvantages |
---|---|---|
Multi-source solutions |
|
|
Training and predicting of videos |
|
|
Object detection and Spatio-temporal matching |
|
|
Other solutions |
|
|
Inter-Category | Intra-Category | Accuracy | Robustness | LowLatency | Scalability | Ease of Implementation | Composite Rank |
---|---|---|---|---|---|---|---|
Multi-source solutions | IoT framework | high | high | moderate | high | low | high |
Multi-source solutions | Multimodal feature fusion | high | moderate | moderate | low | moderate | moderate |
Multi-source solutions | Low level feature integration and hierarchical event fusion | high | low | moderate | moderate | Low | low |
Training and predicting of videos | Multi-level collaborative regression | moderate | moderate | high | low | moderate | moderate |
Training and predicting of videos | Multiple instances learning | high | moderate | moderate | low | moderate | moderate |
Training and predicting of videos | Spatial and temporal key evidence | high | moderate | moderate | moderate | moderate | moderate |
Training and predicting of videos | Cell-by-cell basis feature fusion | high | high | low | low | low | low |
Training and predicting of videos | Merging frames and vector encoders | high | moderate | high | low | moderate | high |
Object detection and spatio–temporal matching | Abnormal behavior detection | high | moderate | moderate | low | low | moderate |
Object detection and spatio–temporal matching | Event query optimization | moderate | low | high | high | moderate | moderate |
Object detection and spatio–temporal matching | Event Knowledge Graph and dividing objects to static and dynamic categories | high | high | moderate | high | moderate | high |
Other solutions | Feature extraction and BOW descriptors | moderate | moderate | moderate | moderate | moderate | moderate |
Other solutions | Zero examplers and NLP-aided feature extraction | moderate | high | moderate | high | moderate | high |
Challenge | Event Matching Method | Top Rank Intra-Category Solutions |
---|---|---|
Object Detection and Tracking | Object Detection and Spatio–temporal Matching | Event Knowledge Graph and IoT Framework |
Small Training Dataset Size for Visual Examples | Event Matching by Training and Predicting | Zero-Shot techniques |
Optical Flow Failure | Object Detection and Spatio–temporal Matching | Event Knowledge Graph and IoT Framework |
Noisy and Unreliable Labels | Event Matching by Training and Predicting and Other Solutions | Multiple Instances Learning |
Latency Problem of Event Recognition | Multi-source Solutions | Serverless IoT framework |
Designing Ontology of Objects and Relationships for Complex Events | Object Detection and Spatio–temporal Matching | Event Knowledge Graph and IoT Framework |
Spatial Relationships Extraction | Object Detection and Spatio–temporal Matching | Event Knowledge Graph and IoT Framework |
Category | Number of Studies | Risk of Bias | Inconsistency | Indirectness | Imprecision | Publication Bias | Overall Certainty |
---|---|---|---|---|---|---|---|
Object detection and spatio–temporal matching | 30 | Low | Low | Low | Low | Low | High |
Approaches based on training and predicting of videos | 34 | Low | Low | Low | Low | Low | High |
Others | 17 | Moderate | Moderate | Low | Moderate | Low | Moderate |
Multi-source solutions | 13 | High | Moderate | Moderate | High | Moderate | Low |
Category | Object Detection and Spatio–Temporal Matching | Approaches Based on Training and Predicting | Others | Multi-Source Solutions |
---|---|---|---|---|
Object Detection and Spatio–temporal Matching | 30 | 6 | 3 | 2 |
Approaches Based on Training and Predicting | 6 | 34 | 4 | 2 |
Others | 3 | 4 | 17 | 1 |
Multi-source Solutions | 2 | 2 | 1 | 13 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Honarparvar, S.; Ashena, Z.B.; Saeedi, S.; Liang, S. A Systematic Review of Event-Matching Methods for Complex Event Detection in Video Streams. Sensors 2024, 24, 7238. https://doi.org/10.3390/s24227238
Honarparvar S, Ashena ZB, Saeedi S, Liang S. A Systematic Review of Event-Matching Methods for Complex Event Detection in Video Streams. Sensors. 2024; 24(22):7238. https://doi.org/10.3390/s24227238
Chicago/Turabian StyleHonarparvar, Sepehr, Zahra Bagheri Ashena, Sara Saeedi, and Steve Liang. 2024. "A Systematic Review of Event-Matching Methods for Complex Event Detection in Video Streams" Sensors 24, no. 22: 7238. https://doi.org/10.3390/s24227238
APA StyleHonarparvar, S., Ashena, Z. B., Saeedi, S., & Liang, S. (2024). A Systematic Review of Event-Matching Methods for Complex Event Detection in Video Streams. Sensors, 24(22), 7238. https://doi.org/10.3390/s24227238