
[17]
Ozgur Koray Sahingoz, Ebubekir Buber, Onder Demir, and Banu Diri. Machine learning based
phishing detection from urls. Expert Systems with Applications, 117:345–357, 2019.
[18] Apache Software Foundation. Spamassassin public corpus, 2004.
[19] Bryan Klimt and Yiming Yang. The enron email dataset.
[20] Jose Joseph. Phishing research resources, 2025.
[21] Rachael Tatman. Fraudulent e-mail corpus, 2019.
[22]
Ion Androutsopoulos, John Koutsias, Konstantinos V. Chandrinos, Georgios Paliouras, and Con-
stantine D. Spyropoulos. The Ling-Spam dataset, 2000. Accessed: 2025-02-08.
[23]
Georgios Sakkis, Ion Androutsopoulos, Georgios Paliouras, Vangelis Karkaletsis, Constantine D
Spyropoulos, and Panagiotis Stamatopoulos. A memory-based approach to anti-spam filtering for
mailing lists. Information retrieval, 6:49–73, 2003.
[24]
Abdulla Al-Subaiey, Mohammed Al-Thani, Naser Abdullah Alam, Kaniz Fatema Antora, Amith
Khandakar, and SM Ashfaq Uz Zaman. Novel interpretable and robust web-based ai platform for
phishing email detection. Computers and Electrical Engineering, 120:109625, 2024.
[25] Ebubekir Buber. Phishing detection dataset (pdd), 2023.
[26]
Mahmoud Khonji, Youssef Iraqi, and Andrew Jones. Phishing detection: a literature survey. IEEE
Communications Surveys & Tutorials, 15(4):2091–2121, 2013.
[27] Microsoft. Learn about safe links in microsoft defender for office 365, 2025. Accessed: 2025-05-24.
[28]
Dinil Mon Divakaran and Adam Oest. Phishing detection leveraging machine learning and deep
learning: A review. IEEE Security & Privacy, 20(5):86–95, 2022.
[29]
Pawan Prakash, Manish Kumar, Ramana Rao Kompella, and Minaxi Gupta. Phishnet: Predictive
blacklisting to detect phishing attacks. In 2010 Proceedings IEEE INFOCOM, pages 1–5, 2010.
[30] GeeksforGeeks. Difference between random forest and xgboost, 2023. Accessed: 2025-05-19.
[31]
Yongjie Huang, Qiping Yang, Jinghui Qin, and Wushao Wen. Phishing url detection via cnn and
attention-based hierarchical rnn. In 2019 18th IEEE International Conference On Trust, Security
And Privacy In Computing And Communications/13th IEEE International Conference On Big Data
Science And Engineering (TrustCom/BigDataSE), pages 112–119. IEEE, 2019.
[32]
Jacob Devlin. Bert: Pre-training of deep bidirectional transformers for language understanding.
arXiv preprint arXiv:1810.04805, 2018.
[33]
Nafiz Rifat, Mostofa Ahsan, Md Chowdhury, and Rahul Gomes. Bert against social engineering
attack: Phishing text detection. In 2022 IEEE International Conference on Electro Information
Technology (eIT), pages 1–6. IEEE, 2022.
[34]
Mohammad Amaz Uddin and Iqbal H Sarker. An explainable transformer-based model for phishing
email detection: A large language model approach. arXiv preprint arXiv:2402.13871, 2024.
[35]
S Kavya and D Sumathi. Staying ahead of phishers: a review of recent advances and emerging
methodologies in phishing detection. Artificial Intelligence Review, 58(2):50, 2024.
[36]
Appsilon. Machine learning evaluation metrics for classification.
https://www.appsilon.com/post/
machine-learning-evaluation-metrics-classification, 2023. Accessed: 2025-05-06.
[37]
Jason Brownlee. Failure of accuracy for imbalanced class distributions, 2019. Accessed: 2025-05-05.
[38]
Glassbox Medicine. Measuring performance: Auprc (area under the precision-recall curve), 2019.
Accessed: 2025-02-28.
[39]
Christoph Molnar. Interpretable Machine Learning, chapter Interpretability. Self-published, 2022.
Accessed: 2025-05-04.
[40]
Christoph Molnar. Interpretable Machine Learning, chapter Methods Overview. Self-published, 2022.
Accessed: 2025-05-04.
78