The surge in internet usage has sparked new demands. Historically, specialized web crawlers were devised to retrieve pages pertaining to specific subjects. However, contemporary needs such as event identification and extraction have gained significance. Conventional web crawlers prove inadequate for these tasks, necessitating exploration of novel techniques for event identification, extraction, and utilization. This study presents an innovative approach for detecting and extracting events using the Whale Optimization Algorithm (WOA) for feature extraction and classification. By integrating this method with machine learning algorithms, the proposed technique exhibits improvements in experiments, including decreased execution time and enhancements in metrics such as Root Mean Square Error (RMSE) and accuracy score. Comparative analysis reveals that the proposed method outperformed alternative models. Nevertheless, when tested across various data models and datasets, the WOA model consistently demonstrated superior performance, albeit exhibiting reduced evaluation metrics for Wikipedia text data.
Oliveira and C. Teixeira Lopes, “The Evolution of Web Search User Interfaces-An Archaeological Analysis of Google Search Engine Result Pages,” in Proceedings of the 2023 Conference on Human Information Interaction and Retrieval, 2023, pp. 55-68. https://doi.org/10.1145/3576840.3578320.
Liu, R. Yahyapour, H. Liu and Y. Hu, “A novel combining method of dynamic and static web crawler with parallel computing,” Multimedia Tools and Applications, pp. 1-22, 2024. https://doi.org/10.1007/s11042-023-17925-y.
Kumar and D. Aggarwal, “LEARNING-based focused WEB crawler,” IETE Journal of Research, vol. 69, no. 4, pp. 2037-2045, 2023. https://doi.org/10.1080/03772063.2021.1885312.
Wu and D. Hou, “A Focused Event Crawler with Temporal Intent,” Applied Sciences, vol. 13, no. 7, pp. 4149, 2023. https://doi.org/10.3390/app13074149.
Xu, B. Wang, Q. Peng and W. Li, “Key-frame reference selection for error resilient video coding using low-delay hierarchical coding structure,” Signal, Image and Video Processing, vol. 18, no. 1, pp. 215-222, 2024. https://doi.org/10.1007/s11760-023-02742-5.
Sebastiani, “Machine learning in automated text categorization,” ACM Computing Surveys (CSUR), vol. 34, no. 1, pp. 1-47, 2002. https://doi.org/10.1145/505282.505283.
C. Aggarwal and C. Zhai, “A survey of text classification algorithms,” in Mining Text Data, MA: Springer, 2012, pp. 163-222. https://doi.org/10.1007/978-1-4614-3223-4_6.
D. M. Rennie and A. McCallum, “Using reinforcement learning to spider the web efficiently,” in Proceedings of the 16th International Conference on Machine Learning (ICML'99), 1999, pp. 335-343.
U. Demirezen and T. S. Navruz, “Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection,” International Journal of Information Security Science, vol. 12, no. 3, pp. 29-59, 2023. https://doi.org/10.55859/ijiss.1338813.
K. N. Dang, D. Bucur, B. Atil, G. Pitel, F. Ruis, H. Kadkhodaei, and Litvak, “Look back, look around: A systematic analysis of effective predictors for new outlinks in focused Web crawling,” Knowledge-Based Systems, vol. 260, p. 110126, 2023. https://doi.org/10.1016/j.knosys.2022.110126.
Wei and U. Lihua, “UCrawler: A learning-based web crawler using a URL knowledge base,” Journal of Computational Methods in Sciences and Engineering, vol. 21, no. 2, pp. 461-474, 2021.
Rajiv and C. Navaneethan, “Keyword weight optimization using gradient strategies in event focused web crawling,” Pattern Recognition Letters, vol. 142, pp. 3-10, 2021. https://doi.org/10.1016/j.patrec.2020.12.003.
Ho, E. Liang, X. Chen, I. Stoica and P. Abbeel, “Population based augmentation: Efficient learning of augmentation policy schedules,” in International conference on machine learning, 2019, pp. 2731-2741. https://proceedings.mlr.press/v97/ho19b.html.
Rios, L. S. Ochi, C. Boeres, V. N. Coelho, I. M. Coelho and R. Farias, “Exploring parallel multi-GPU local search strategies in a metaheuristic framework,” Journal of Parallel and Distributed Computing, vol. 111, pp. 39-55, 2018. https://doi.org/10.1016/j.jpdc.2017.06.011.
Faris, I. Aljarah, M. A. Al-Betar and S. Mirjalili, “Grey wolf optimizer: a review of recent variants and applications,” Neural Computing and Applications, vol. 30, pp. 413-435, 2018. https://doi.org/10.1007/s00521-017-3272-5.
A. Heidari, H. Faris, S. Mirjalili, I. Aljarah and M. Mafarja, “Ant lion optimizer: theory, literature review, and application in multi-layer perceptron neural networks,” Nature-Inspired Optimizers: Theories, Literature Reviews and Applications, pp. 23-46, 2020. https://doi.org/10.1007/978-3-030-12127-3_3.
Abd Elaziz and S. Mirjalili, “A hyper-heuristic for improving the initial population of whale optimization algorithm,” Knowledge-Based Systems, vol. 172, pp. 42-63, 2019. https://doi.org/10.1016/j.knosys.2019.02.010.
Nagpal, P. Singh and B. P. Garg, “Concurrent bacterial foraging with emotional intelligence for global optimization,” International Journal of Information Technology, vol. 11, pp. 313-320, 2019. https://doi.org/10.1007/s41870-018-0215-z.
Aggarwal, P. Chatterjee, R. P. Bhagat, K. K. Purbey and S. J. Nanda, “A social spider optimization algorithm with chaotic initialization for robust clustering,” Procedia Computer Science, vol. 143, pp. 450-457, 2018. https://doi.org/10.1016/j.procs.2018.10.417.
Mirjalili and S. M. Saremi, “Whale optimization algorithm: theory, literature review, and application in designing photonic crystal filters,” Nature-Inspired Optimizers: Theories, Literature Reviews and Applications, pp. 219-238, 2020. https://doi.org/10.1007/978-3-030-12127-3_13.
Mirjalili, Evolutionary Algorithms and Neural Networks, Cham: Springer, 2019, pp. 43-60.
C. Montgomery, E. A. Peck and G. G. Vining, Introduction to Linear Regression Analysis, John Wiley & Sons, 2012.
C. Aggarwal and C. Zhai, “A survey of text classification algorithms,” in Mining Text Data, Boston, MA: Springer, 2012, pp. 163-22. https://doi.org/10.1007/978-1-4614-3223-4_6..
Moradi,H. and Azimzadeh,F. (2023). Focused Crawler for Event Detection Using Metaheuristic Algorithms and Knowledge Extraction. International Journal of Web Research, 6(2), 143-150. doi: 10.22133/ijwr.2024.454772.1215
MLA
Moradi,H. , and Azimzadeh,F. . "Focused Crawler for Event Detection Using Metaheuristic Algorithms and Knowledge Extraction", International Journal of Web Research, 6, 2, 2023, 143-150. doi: 10.22133/ijwr.2024.454772.1215
HARVARD
Moradi H., Azimzadeh F. (2023). 'Focused Crawler for Event Detection Using Metaheuristic Algorithms and Knowledge Extraction', International Journal of Web Research, 6(2), pp. 143-150. doi: 10.22133/ijwr.2024.454772.1215
CHICAGO
H. Moradi and F. Azimzadeh, "Focused Crawler for Event Detection Using Metaheuristic Algorithms and Knowledge Extraction," International Journal of Web Research, 6 2 (2023): 143-150, doi: 10.22133/ijwr.2024.454772.1215
VANCOUVER
Moradi H., Azimzadeh F. Focused Crawler for Event Detection Using Metaheuristic Algorithms and Knowledge Extraction. International Journal of Web Research, 2023; 6(2): 143-150. doi: 10.22133/ijwr.2024.454772.1215