Abstract
A new system for spam e-mail annotation by end-users is presented. It is based on the recursive application of hand-written annotation rules by means of an inferential engine based on Logic Programming. Annotation rules allow the user to express nuanced considerations that depend on deobfuscation, word (non-)occurrence and structure of the message in a straightforward, human-readable syntax. We show that a sample collection of annotation rules are effective on a relevant corpus that we have assembled by collecting e-mails that have escaped detection by the industry-standard SpamAssassin filter. The system presented here is intended as a personal tool enforcing personalized annotation rules that would not be suitable for the general e-mail traffic.
A companion Web site to this article, with software, results and the corpus described herewith is at http://informatica.unime.it/rubast/
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sergeant, M.: Internet-level spam detection and spamassassin 2.50. In: Spam Conference (2003)
Denti, E., Omicini, A., Ricci, A.: Multi-paradigm java-prolog integration in tuprolog. Sci. Comput. Program. 57(2), 217–250 (2005)
Wielemaker, J., Anjewierden, A.: An architecture for making object-oriented systems available from prolog. In: Proc. of the 12th Int’l Workshop on Logic Programming Environments, WLPE 2002 (2002)
Cormack, G.V., Lynam, T.R.: Online supervised spam filter evaluation. ACM Trans. Inf. Syst. 25(3) (2007)
van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979)
Cormack, G.V., Lynam, T.R.: Spam corpus creation for trec. In: Proc. of the Second Conference on Email and Anti-Spam, CEAS 2005 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fiumara, G., Marchi, M., Pagano, R., Provetti, A. (2010). Rule-Based Spam E-mail Annotation. In: Hitzler, P., Lukasiewicz, T. (eds) Web Reasoning and Rule Systems. RR 2010. Lecture Notes in Computer Science, vol 6333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15918-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-15918-3_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15917-6
Online ISBN: 978-3-642-15918-3
eBook Packages: Computer ScienceComputer Science (R0)