ARCHIVES

Original Article

AI-Driven Phishing Detection Using Natural Language Processing and Machine Learning

Sarveena S1 Dhanusiya R2 A. Raja3
1 2 Department of Computer Science and Engineering (Cyber Security), United Institute of Technology, Coimbatore, Tamil Nadu, India. 3 Head of the Department, Department of Computer Science and Engineering (Cyber Security), United Institute of Technology, Coimbatore, Tamil Nadu, India.

Published Online: May-June 2026

Pages: 120-128

References

1. A. K. Jain and B. B. Gupta, "Phishing detection: Analysis of visual similarity-based approaches," Security and Communication
Networks, vol. 2017, Article ID 5421046, 2017.
2. S. Marchal, J. Francois, R. State, and T. Engel, "PhishStorm: Detecting phishing with streaming analytics," IEEE Transactions on
Network and Service Management, vol. 11, no. 4, pp. 458-471, 2014.
3. R. Verma and N. Hossain, "Semantic feature selection for text with application to phishing email detection," in Proceedings of the
IEEE ICC, 2017.
4. J. Ma, L. K. Saul, S. Savage, and G. M. Voelker, "Learning to detect malicious URLs," ACM Transactions on Intelligent Systems, vol.
2, no. 3, Art. 30, 2011.
5. M. Aburrous, M. A. Hossain, K. Dahal, and F. Thabtah, "Intelligent phishing detection system for e-banking using fuzzy data mining,"
Expert Systems with Applications, vol. 37, no. 12, pp. 7913-7921, 2010.
6. I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. Cambridge, MA: MIT Press, 2016.7. T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean, "Distributed representations of words and phrases and their
compositionality," in Advances in Neural Information Processing Systems (NeurIPS), 2013, pp. 3111-3119.
8. J. Pennington, R. Socher, and C. D. Manning, "GloVe: Global vectors for word representation," in Proc. EMNLP, Doha, Qatar, 2014,
pp. 1532-1543.
9. Y. Kim, "Convolutional neural networks for sentence classification," in Proc. EMNLP, Doha, Qatar, 2014, pp. 1746-1751.
10. A. Vaswani et al., "Attention is all you need," in Advances in Neural Information Processing Systems (NeurIPS), 2017, pp. 5998-6008.
11. J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of deep bidirectional transformers for language
understanding," in Proc. NAACL-HLT, 2019, pp. 4171-4186.
12. F. Pedregosa et al., "Scikit-learn: Machine learning in Python," Journal of Machine Learning Research, vol. 12, pp. 2825-2830, 2011.
13. D. Jurafsky and J. H. Martin, Speech and Language Processing, 3rd ed. Upper Saddle River, NJ: Pearson, 2020.
14. S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
15. Anti-Phishing Working Group (APWG), "Phishing Activity Trends Report, Q4 2023," APWG, Tech. Rep., 2024.
16. R. Basnet, S. Mukkamala, and A. H. Sung, "Detection of phishing attacks: A machine learning approach," in Soft Computing
Applications in Industry, vol. 226, 2008, pp. 373-383.
17. T. Moore and R. Clayton, "Examining the impact of website take-down on phishing," in Proc. APWG eCrime Researchers Summit,
2007.
18. G. Salton and M. J. McGill, Introduction to Modern Information Retrieval. New York: McGraw-Hill, 1983.
19. Z. Zhang, J. Ma, J. Han, and X. Niu, "Email phishing detection by analysing natural language characteristics," in Proc. IEEE CCECE,
2007.
20. K. L. Chiew, K. S. C. Yong, and C. L. Tan, "A survey of phishing attacks: Their types, vectors and technical approaches," Expert
Systems with Applications, vol. 106, pp. 1-20, 2018.

Related Articles

2026

AI-Based Stomach Cancer Detection Using Biomarkers, Medical Images, and Voice Analysis

2026

Hydrogen-Efficient Eco-Driving and Route Planning for Fuel-Cell Electric Vehicles Using Multi-Objective Optimization Under Traffic and Terrain Uncertainty

2026

A Data-Driven Machine Learning Framework for Assessing Patent Commercial Value and Technological Significance

2026

Evaluating Student Academic Performance Through a Benchmark of Fuzzy Reasoning Models

2026

A Hybrid Soft Computing Approach for Managing Uncertainty in Data Analytics

2026

Soft Computing Approaches for Robust Analysis of Imbalanced and Noisy Data

Share Article

X
LinkedIn
Facebook
WhatsApp

Or copy link

https://test.theijire.com/archives/10.59256/ijire.20260703013

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.