EconPapers    
Economics at your fingertips  
 

EMAIL SPAM DETECTION: A SYMBIOTIC FEATURE SELECTION APPROACH FOSTERED BY EVOLUTIONARY COMPUTATION

Pedro Sousa (), Paulo Cortez (), Rui Vaz (), Miguel Rocha () and Miguel Rio ()
Additional contact information
Pedro Sousa: Centro Algoritmi/Department of Informatics, Universidade do Minho, Braga, Portugal
Paulo Cortez: Centro Algoritmi/Department of Information Systems, Universidade do Minho, Guimarães, Portugal
Rui Vaz: Department of Information Systems, Universidade do Minho, Guimarães, Portugal
Miguel Rocha: CCTC/Department of Informatics, Universidade do Minho, Braga, Portugal
Miguel Rio: Department of Electric and Electronic Engineering, University College London, Torrington Place, London, U.K.

International Journal of Information Technology & Decision Making (IJITDM), 2013, vol. 12, issue 04, 863-884

Abstract: The electronic mail (email) is nowadays an essential communication service being widely used by most Internet users. One of the main problems affecting this service is the proliferation of unsolicited messages (usually denoted by spam) which, despite the efforts made by the research community, still remains as an inherent problem affecting this Internet service. In this perspective, this work proposes and explores the concept of a novel symbiotic feature selection approach allowing the exchange of relevant features among distinct collaborating users, in order to improve the behavior of anti-spam filters. For such purpose, several Evolutionary Algorithms (EA) are explored as optimization engines able to enhance feature selection strategies within the anti-spam area. The proposed mechanisms are tested using a realistic incremental retraining evaluation procedure and resorting to a novel corpus based on the well-known Enron datasets mixed with recent spam data. The obtained results show that the proposed symbiotic approach is competitive also having the advantage of preserving end-users privacy.

Keywords: Spam detection; content-based filtering; evolutionary algorithms; Naïve Bayes; feature selection (search for similar items in EconPapers)
Date: 2013
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219622013500326
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:ijitdm:v:12:y:2013:i:04:n:s0219622013500326

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0219622013500326

Access Statistics for this article

International Journal of Information Technology & Decision Making (IJITDM) is currently edited by Yong Shi

More articles in International Journal of Information Technology & Decision Making (IJITDM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:ijitdm:v:12:y:2013:i:04:n:s0219622013500326