SMARTech   Library Home
 

Georgia Tech's Institutional Repository >
College of Computing (CoC) >
Computational Science and Engineering (CSE)  >
Computational Science and Engineering Technical Reports >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1853/25135

Title: SNARE: Spatio-temporal Network-level Automatic Reputation Engine
Authors: Feamster, Nick
Gray, Alexander G.
Krasser, Sven
Syed, Nadeem Ahmed
Subjects : Blacklists
Botnet
Spammers
Issue Date: 2008
Publisher: Georgia Institute of Technology
Series/Report no.: CSE Technical Reports ; GT-CSE-08-02
Abstract: Current spam filtering techniques classify email based on content and IP reputation blacklists or whitelists. Unfortunately, spammers can alter spam content to evade content based filters, and spammers continually change the IP addresses from which they send spam. Previous work has suggested that filters based on network-level behavior might be more efficient and robust, by making decisions based on how messages are sent, as opposed to what is being sent or who is sending them. This paper presents a technique to identify spammers based on features that exploit the network-level spatio temporal behavior of email senders to differentiate the spamming IPs from legitimate senders. Our behavioral classifier has two benefits: (1) it is early (i.e., it can automatically detect spam without seeing a large amount of email from a sending IP address-sometimes even upon seeing only a single packet); (2) it is evasion-resistant (i.e., it is based on spatial and temporal features that are difficult for a sender to change). We build classifiers based on these features using two different machine learning methods, support vector machine and decision trees, and we study the efficacy of these classifiers using labeled data from a deployed commercial spam-filtering system. Surprisingly, using only features from a single IP packet header (i.e., without looking at packet contents), our classifier can identify spammers with about 93% accuracy and a reasonably low false-positive rate (about 7%). After looking at a single message spammer identification accuracy improves to more than 94% with a false rate of just over 5%. These suggest an effective sender reputation mechanism.
Type: Technical Report
URI: http://hdl.handle.net/1853/25135
Appears in Collections:Computational Science and Engineering Technical Reports

Files in This Item:

File Description SizeFormat
GT-CSE-08-02.pdf193.65 kBAdobe PDFView/Open

Items in SMARTech are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2007 MIT and Hewlett-Packard - Feedback