EconPapers    
Economics at your fingertips  
 

Application of word embedding and machine learning in detecting phishing websites

Routhu Srinivasa Rao (), Amey Umarekar () and Alwyn Roshan Pais ()
Additional contact information
Routhu Srinivasa Rao: GMR Institute of Technology
Amey Umarekar: National Institute of Technology
Alwyn Roshan Pais: National Institute of Technology

Telecommunication Systems: Modelling, Analysis, Design and Management, 2022, vol. 79, issue 1, No 4, 33-45

Abstract: Abstract Phishing is an attack whose aim is to gain personal information such as passwords, credit card details etc. from online users by deceiving them through fake websites, emails or any legitimate internet service. There exists many techniques to detect phishing sites such as third-party based techniques, source code based methods and URL based methods but still users are getting trapped into revealing their sensitive information. In this paper, we propose a new technique which detects phishing sites with word embeddings using plain text and domain specific text extracted from the source code. We applied various word embedding for the evaluation of our model using ensemble and multimodal approaches. From the experimental evaluation, we observed that multimodal with domain specific text achieved a significant accuracy of 99.34% with TPR of 99.59%, FPR of 0.93%, and MCC of 98.68%

Keywords: URL; Phishing; Anti-phishing; TF-IDF; Hostname; Random forest (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s11235-021-00850-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:telsys:v:79:y:2022:i:1:d:10.1007_s11235-021-00850-6

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/11235

DOI: 10.1007/s11235-021-00850-6

Access Statistics for this article

Telecommunication Systems: Modelling, Analysis, Design and Management is currently edited by Muhammad Khan

More articles in Telecommunication Systems: Modelling, Analysis, Design and Management from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:telsys:v:79:y:2022:i:1:d:10.1007_s11235-021-00850-6