Whale Optimization and AutoML for Precise Phishing Detection

Divya  Singhal; Ankit  Verma; Ganesh V.  Radhakrishnan; Jyoti  Parashar; Saroj S.  Date; Kamal  Upreti

doi:10.13052/jmm1550-4646.2153

Authors

Divya Singhal Department of Computer Science, Noida Institute of Engineering & Technology, Greater Noida, India
Ankit Verma Department of Computer Applications, KIET Group of Institutions, Delhi-NCR, Ghaziabad, India
Ganesh V. Radhakrishnan Department of Economics and Finance, KIIT School of Management (KSOM), KIIT University, Bhubaneswar, India
Jyoti Parashar Department of Computer Applications, Panipat Institute of Engineering &Technology College, Panipat, Haryana, India
Saroj S. Date Department of Artificial Intelligence and Data Science, Chh.Shahu College of Engineering, Kanchanwadi, Paithan Road, Chhatrapati Sambhajinagar (Aurangabad), MS, India
Kamal Upreti Department of Computer Science, Christ University, Delhi NCR, Ghaziabad, India

DOI:

https://doi.org/10.13052/jmm1550-4646.2153

Keywords:

Phishing Attack, Optimization Algorithm, Whale Optimization Algorithm, AutoML Framework, AutoML H2O, Regression Analysis, Random Forest Algorithm

Abstract

Online fraud and social engineering tactics frequently use phishing websites as platforms. Phishers often modify the source code of the web pages they exploit in their attacks to create the illusion that alterations were made to authentic websites. A solitary response is insufficient to mitigate phishing due to the many methods employed in its execution. This study examines machine learning algorithms and evaluates their efficacy when trained on datasets including attributes that differentiate secure websites from phishing sites. Automated algorithms facilitate real-time fraud protection by swiftly detecting suspicious URLs, domain names, and website content. This study aims to identify the optimal method for detecting a prevalent category of cyberattacks. This would enhance the security and privacy of all internet users by facilitating the identification and blocking of malicious websites. Nonetheless, there is an urgent desire for automated models that provide rapid and precise detection. This research introduces a regression-based assessment method for phishing detection to address this demand. Our approach employs a whale optimization algorithm for feature selection. An AutoML framework subsequently utilizes the selected feature subsets as input. The model showed good accuracy in its predictions with very small errors on the test data, shown by an RMSE of 0.1079, an MSE of 0.0116, and an R2 value of 0.9534. These results demonstrate the reliability of our feature selection and modeling methods.

Downloads

Download data is not yet available.

Author Biographies

Divya Singhal, Department of Computer Science, Noida Institute of Engineering & Technology, Greater Noida, India

Divya Singhal is working as an assistant professor in Noida Institute of Engineering and Technology. She has been in academics from last 8 years. She has completed her doctorate from Amity University in smart grid & information security. She has published various papers in reputed journals & conferences.

Ankit Verma, Department of Computer Applications, KIET Group of Institutions, Delhi-NCR, Ghaziabad, India

Ankit Verma is currently serving as an Associate Professor in the Department of Computer Applications at KIET Group of Institutions. He brings with him over 18 years of experience in teaching and research. He regularly serves as a reviewer for reputed journals and international conferences and has been invited as a keynote speaker and session chair at several prestigious IEEE conferences. His research interests include artificial intelligence, IoT, fuzzy logic, and web technologies. He has strong technical expertise in programming languages and technologies. In December 2023, he successfully organized the Scopus-indexed International Conference on Recent Advancements in Computing Technologies & Engineering (RACTE-2023). He has also published books on web technologies. Dr. Ankit Verma is a seasoned faculty member with a focus on AI, IoT, fuzzy logic, and web technologies. He is deeply engaged in research, peer-reviewed publications, and academic leadership, such as organizing conferences and supervising student research.

Ganesh V. Radhakrishnan, Department of Economics and Finance, KIIT School of Management (KSOM), KIIT University, Bhubaneswar, India

Ganesh V. Radhakrishnan is a senior academic, interdisciplinary researcher, and policy consultant with over forty years of experience spanning academia, industry, and government. He holds a Ph.D. in Public Systems from IIM Ahmedabad and an MBA in Operations and Finance from IIM Kozhikode. His research spans economic regulation, infrastructure finance, maritime strategy, and the application of artificial intelligence in complex systems, including supply chains and digital public services.

Currently Senior Professor at KIIT University, Dr. Radhakrishnan has also served as Dean of Faculty Affairs at MIT World Peace University and Associate Dean at Jindal Global Business School. He has led the development of forward-looking academic programs in analytics, financial technology, and digital transformation. He has delivered lectures on emerging topics such as AI applications in logistics, predictive analytics, and digital governance at leading institutions across India and Europe.

Jyoti Parashar, Department of Computer Applications, Panipat Institute of Engineering &Technology College, Panipat, Haryana, India

Jyoti Parashar is currently working as a Professor in the Panipat Institute of Engineering & Technology college (AICTE-approved multidisciplinary institution, PIET is affiliated to Kurukshetra University) Haryana, India. With a strong academic background and expertise in computer science, she plays a pivotal role in shaping the knowledge and skills of engineering students.in the Delhi. She has done her Ph.D. in Computer Science from Maharishi Markedeshwar University, Ambala, Haryana with A++ Grade in India. She has research and teaching experience of more than 10 years. Her research interests span various domains, including artificial intelligence, data science, and emerging technologies in computing. Through her dedication to teaching and research, she actively contributes to the academic community by mentoring students, publishing research papers, and participating in conferences and workshops.

Saroj S. Date, Department of Artificial Intelligence and Data Science, Chh.Shahu College of Engineering, Kanchanwadi, Paithan Road, Chhatrapati Sambhajinagar (Aurangabad), MS, India

Saroj S. Date is an accomplished academician and researcher with over 18+ years of teaching experience in Computer Science & Engineering. Currently she is working as an Associate Professor in the Department of Artificial Intelligence and Data Science at CSMSS Chh. Shahu College of Engineering, Chh. Sambhajinagar. She has an extensive background in Computer Engineering, holding a Ph.D. from Dr. Babasaheb Ambedkar Marathwada University, Chh. Sambhajinagar. She pursued Bachelor of Engg. from SGGS College of Engg. & Tech, Swami Ramanand Teerth Marathwada University, Nanded and Master of Engg. from Dr. Babasaheb Ambedkar Marathwada University, Chh. Sambhajinagar. Her expertise spans diverse subjects, including Machine Learning, Theory of Computation, Compiler Design, and Big Data Analytics. Dr. Date is proficient in programming languages such as Python, Java, C, C++. She has contributed significantly to research with publications on sentiment analysis, natural language processing, and machine learning, including Scopus-indexed journal articles, International journals/conferences and book chapters. Her main research work focuses on Sentiment Analysis, Natural Language Processing, Data Mining, Text Mining, Artificial Intelligence, Machine learning, Deep Learning, Mobile Computing, Big Data Analytics, etc.

Kamal Upreti, Department of Computer Science, Christ University, Delhi NCR, Ghaziabad, India

Kamal Upreti is currently working as an Associate Professor in Department of Computer Science, CHRIST (Deemed to be University), Delhi NCR, Ghaziabad, India. He completed is B. Tech (Hons) Degree from UPTU, M. Tech (Gold Medalist), PGDM(Executive) from IMT Ghaziabad and PhD in Department of Computer Science by & Engineering. He has completed Postdoc from National Taipei University of Business, TAIWAN.

He has published 50+ Patents, 45+ Books, 32+ Magazine issues and 185+ Research papers in in various reputed Journals and international Conferences. His areas of Interest such Artificial Intelligence, Machine Learning, Data Analytics, Cyber Security, Machine Learning, Health Care, Embedded System and Cloud Computing. He has published more than 45+ authored and edited books under CRC Press, IGI Global, Oxford Press and Arihant Publication.

He is the main guest editor of more than 10 special issues of journals including Springer, Taylor and Francis, Inderscience, IGI Global, and Elsevier. He is the main guest associate editor in Frontier Journal Convergence of Artificial Intelligence and Cognitive Systems which is SCIE and SCOPUS having impact factor: 4.7 and cite score: 7.3. He is having enriched years’ experience in corporate and teaching experience in Engineering Colleges.

He worked with HCL, NECHCL, Hindustan Times, Dehradun Institute of Technology and Delhi Institute of Advanced Studies, with more than 15+ years of enrich experience in research, Academics and Corporate.

Currently, he has completed work with Joint collaboration with GB PANT & AIIMS Delhi, under funded project of ICMR Scheme on Cardiovascular diseases prediction strokes using Machine Learning Techniques from year 2017–2020.

He got fund from DST SERB for conducting International Conference, ICSCPS-2024, 13–14 Sept 2024. Recently, he got fund from AICTE – Inter-Institutional Biomedical Innovations and Entrepreneurship Program (AICTE-IBIP) for 2024–2026. He has attended as a Session Chair Person in National, International conference and key note speaker in various platforms such as Skill based training, Corporate Trainer, Guest faculty and faculty development Programme. He awarded as best teacher, best researcher, extra academic performer and Gold Medalist in M.Tech programme.

References

M. Nanda, M. Saraswat, and P. K. Sharma, “Enhancing cybersecurity: A review and comparative analysis of convolutional neural network approaches for detecting URL-based phishing attacks,” e-Prime – Adv. Electr. Eng. Electron. Energy, vol. 8, no. March, p. 100533, 2024, doi: 10.1016/j.prime.2024.100533.

E. S. Shombot, G. Dusserre, R. Bestak, and N. B. Ahmed, “An application for predicting phishing attacks: A case of implementing a support vector machine learning model,” Cyber Secur. Appl., vol. 2, no. January, 2024, doi: 10.1016/j.csa.2024.100036.

V. Dixit and D. Kaur, “Development of Two-Factor Authentication to Mitigate Phishing Attack,” no. July 2019, pp. 787–802, 2024, doi: 10.4236/jsea.2024.1711043.

R. Basnet and A. H. Sung, “Rule-Based Phishing Attack Detection Rule-Based Phishing Attack Detection,” no. October, 2016.

S. Hossain, D. Sarma, and R. J. Chakma, “Machine learning-based phishing attack detection,” Int. J. Adv. Comput. Sci. Appl., vol. 11, no. 9, pp. 378–388, 2020, doi: 10.14569/IJACSA.2020.0110945.

P. E. Reports, P. S. Trends, B. P. Measurement, E. P. Attacks, M. Targeted, and I. Sectors, “Peter Cassidy, PHISHING ACTIVITY TRENDS REPOR, 2024,” no. March, pp. 1–11, 2024.

P. Kalaharsha and B. M. Mehtre, “Detecting Phishing Sites – An Overview,” pp. 1–13, 2021, [Online]. Available: http://arxiv.org/abs/2103.12739.

M. Dadkhah, M. D. Jazi, M. S. Mobarakeh, S. Shamshirband, X. Wang, and S. Raste, “An overview of phishing attacks and their detection techniques,” Int. J. Internet Protoc. Technol., vol. 9, no. 4, 2016, doi: 10.1504/IJIPT.2016.081319.

S. Hawa Apandi, J. Sallim, and R. Mohd Sidek, “Types of anti-phishing solutions for phishing attack,” in IOP Conference Series: Materials Science and Engineering, 2020. doi: 10.1088/1757-899X/769/1/012072.

G. J. W. Kathrine, P. M. Praise, A. A. Rose, and E. C. Kalaivani, “Variants of phishing attacks and their detection techniques,” in Proceedings of the International Conference on Trends in Electronics and Informatics, ICOEI 2019, 2019. doi: 10.1109/ICOEI.2019.8862697.

Z. Salah, H. Abu Owida, E. Abu Elsoud, E. Alhenawi, S. Abuowaida, and N. Alshdaifat, “An Effective Ensemble Approach for Preventing and Detecting Phishing Attacks in Textual Form,” Futur. Internet, vol. 16, no. 11, pp. 1–24, 2024, doi: 10.3390/fi16110414.

K. H. Chy, “Securing the web: Machine learning’s role in predicting and preventing phishing attacks Securing the web: Machine learning’s role in predicting and preventing phishing attacks,” no. September, 2024, doi: 10.30574/ijsra.2024.13.1.1770.

Jain, A. K., and Gupta, B. B. (2021). A survey of phishing attack techniques, defence mechanisms and open research challenges. Enterprise Information Systems, 16(4), 527–565. https://doi.org/10.1080/17517575.2021.1896786.

M. Baykara and Z. Z. Gürel, “Detection of phishing attacks,” 6th Int. Symp. Digit. Forensic Secur. ISDFS 2018 – Proceeding, vol. 2018-January, pp. 1–5, May 2018, doi: 10.1109/ISDFS.2018.8355389.

“Web Phishing Detection Using Web Crawling, Cloud Infrastructure and Deep Learning Framework”, JASTT, vol. 4, no. 01, pp. 54–71, Mar. 2023, doi: 10.38094/jastt401144.

T. Peng, I. Harris, and Y. Sawa, “Detecting Phishing Attacks Using Natural Language Processing and Machine Learning,” in Proceedings – 12th IEEE International Conference on Semantic Computing, ICSC 2018, 2018. doi: 10.1109/ICSC.2018.00056.

A. Basit, M. Zafar, X. Liu, A. R. Javed, Z. Jalil, and K. Kifayat, “A comprehensive survey of AI-enabled phishing attacks detection techniques,” Telecommun. Syst., vol. 76, no. 1, pp. 139–154, Jan. 2021, doi: 10.1007/S11235-020-00733-2/TABLES/5.

R. O. Akinyede and J. A. Adelakun, “Detection and Prevention of Phishing Attack Using Linkguard Algorithm,” J. Inf., vol. 4, no. 1, 2018, doi: 10.18488/journal.104.2018.41.10.23.

N. Q. Do, A. Selamat, O. Krejcar, E. Herrera-Viedma and H. Fujita, “Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions,” in IEEE Access, vol. 10, pp. 36429–36463, 2022, doi: 10.1109/ACCESS.2022.3151903.

Aljofey, A., Jiang, Q., Rasool, A. et al. An effective detection approach for phishing websites using URL and HTML features. Sci Rep 12, 8842 (2022). https://doi.org/10.1038/s41598-022-10841-5.

B. Espinoza, J. Simba, W. Fuertes, E. Benavides, R. Andrade, and T. Toulkeridis, “Phishing attack detection: A solution based on the typical machine learning modeling cycle,” in Proceedings – 6th Annual Conference on Computational Science and Computational Intelligence, CSCI 2019, 2019. doi: 10.1109/CSCI49370.2019.00041.

L. M. Abdulrahman, S. H. Ahmed, Z. N. Rashid, Y. S. Jghef, T. M. Ghazi, and U. H. Jader, “Web Phishing Detection Using Web Crawling, Cloud Infrastructure and Deep Learning Framework,” J. Appl. Sci. Technol. Trends, vol. 4, no. 01, 2023, doi: 10.38094/jastt401144.

Tan, Choon Lin (2018), “Phishing Dataset for Machine Learning: Feature Evaluation”, Mendeley Data, V1, doi: 10.17632/h3cgnj8hft.1.

H2O.ai, “H2O AutoML: Automatic Machine Learning,” H2O.ai, 2024, [Online]. Available: https://docs.h2o.ai/h2o/latest-stable/h2o-docs/automl.html.

R. O. Akinyede and J. A. Adelakun, “Detection and Prevention of Phishing Attack Using Linkguard Algorithm,” J. Inf., vol. 4, no. 1, pp. 10–23, 2018, doi: 10.18488/journal.104.2018.41.10.23.

K. L. Chiew, C. L. Tan, K. S. Wong, K. S. C. Yong, and W. K. Tiong, “A new hybrid ensemble feature selection framework for machine learning-based phishing detection system,” Inf. Sci. (Ny)., vol. 484, pp. 153–166, 2019, doi: 10.1016/j.ins.2019.01.064.

Whale Optimization and AutoML for Precise Phishing Detection

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biographies

Divya Singhal, Department of Computer Science, Noida Institute of Engineering & Technology, Greater Noida, India

Ankit Verma, Department of Computer Applications, KIET Group of Institutions, Delhi-NCR, Ghaziabad, India

Ganesh V. Radhakrishnan, Department of Economics and Finance, KIIT School of Management (KSOM), KIIT University, Bhubaneswar, India

Jyoti Parashar, Department of Computer Applications, Panipat Institute of Engineering &Technology College, Panipat, Haryana, India

Saroj S. Date, Department of Artificial Intelligence and Data Science, Chh.Shahu College of Engineering, Kanchanwadi, Paithan Road, Chhatrapati Sambhajinagar (Aurangabad), MS, India

Kamal Upreti, Department of Computer Science, Christ University, Delhi NCR, Ghaziabad, India

References

Downloads

Published

How to Cite

Issue

Section

Most read articles by the same author(s)

IEEE Xplore

interview

splissue

award

2020 Best Paper Award

issn

cover

Open Access

Make a Submission

subreq

indexed

logo