B@bel:Leveraging Email Delivery for Spam Mitigation

Usenix Security 2012

Gianluca Stringhini, Manuel Egele, Apostolis Zarras, Thorsten Holz,

Christopher Kruegel, and Giovanni Vigna

University of California, Santa Barbara Ruhr-University Bochum

李佳恆 leegoder@gmail.com

Outline

Introducion

Background

Approach

Evaluation

Conclusion

IntroducionKASPERSKY LAB. Spam Report: April 2012.

Email spam Accounting for more than 77% of all email traffic

https://www.securelist.com/en/analysis/204792230/Spam_Report_April_2012

SYMANTEC CORP. State of spam & phishing report

http://www.symantec.com/business/theme.jsp?themeid=state_of_spam

About 85% of world-wide spam traffic is sent by botnets

Traditional spam dection systems

1.Content analysis

2.Origin base

Ex.Blacklists

============new way==========

Focus on the email deliivery mechanism

(How messages are sent by spammers)

Background

(mail user agent )eg: Outlook

(mail transfer agent )

eg: msa.hinet

From wiki

eg: Hotmail

SMTP Conversaction

Reply:220 msr5.hinet.net ESMTP Sendmail 8.14.2/8.14.2; Sun, 29 Jul 2012 17:38:35 +0800 (CST)

EHLO adl.com

Reply:250-msr5.hinet.net Hello 114-34-35-96.HINET-IP.hinet.net [114.34.35.96], pleased to meet you

MAIL FrOm:<dada@msa.hinet.net>

Reply:250 2.1.0 <dada@msa.hinet.net>... Sender ok

rCpt tO: <leegoder@gmail.com>

Reply:250 2.1.5 <leegoder@gmail.com>... Recipient ok

Reply:354 Enter mail, end with "." on a line by itself

SubJECT : HI i am dada

YOYOYO

test !!!`~~~~

Reply:250 2.0.0 q6T9cZtc012399 Message accepted for delivery

SMTP RFC defines 14 commands.

Each command consists of four case-insensitive,alphabetic-character command codes

One or more space characters separate command codes

All command are terminated by line terminator(<CR><LF>)

Smtp replies :three-digit status code+space+description

(one line ,e.g., 250 OK)

RFC 821

Approach

SMTP Dialects

Different clients might implement the SMTP protocol in slightly different ways.

1.RFCs Do not always provide a single Format (e.g.,EHLO vs HELO)

2.Using different extension,client might add different parameters

3.Server accept commands that do not comply with the strict SMTP definitions

Learning Dialects

Passively observe ( )

A set of SMTP conversations

Each conversation is a sequence of <reply,command> pairs

E.g.,<220 hinet.net, EHLO adl.com>

Active probing

Send specifically-crafted replies to a client

And observe its responses

Active probing

Standard SMTP replies (e.g., send error)

Addiional SMTP replies (e.g., send twice)

Out-of-order Smtp replies

Missing replies (nerver sends a reply to a command)

Compliant replies (e.g., hOsT)

Incorrect replies (e.g., 9999)

incorrectly-terminated replis (e.g.,<CR><CR>)

Regular expressions

MAIL FROM:<dada@msa.hinet.net>MAIL FROM:gaga@msa.hinet.net

MAIL FROM:<email-addr>

Mail From :gaga@msa.hinet.net

Mail From :<email-addr>

E.g.,<220 hinet.net, EHLO adl.com> <220 hostname,EHLO domain>

State machine

E.g.,<220 hostname,EHLO domain>

Decision state Machine

WOLF, W. An Algorithm for Nearly-Minimal Collapsing of Finite-State Machine Networks.

(ICCAD) (1990).

Making a descison

E.g.,<220 hostname,EHLO domain> ...

E.g.,<220 hostname,HELO domain>

<250 OK,MAIL FROM:<email-addr>> ...

< Reply,Command>

E.g.,<220 hostname,HELO domain>

<250 OK,RSET> ...

C3 unknow

unknowunknow

The Botnet Feedback Mechanism

Some spammers take server feedback into account

e.g., recopient address does not exist

Cutwail : 35% email address were not exist [38]

Providing False Responses to Spam Emails.

[38]http://www.iseclab.org/papers/cutwail-LEET11.pdf

Evaluation

Enviroment

1.Virtual machine zoo

2.gateway

3.learner => decision fsm =>

4.decision maker

Evaluating dialects for Classification Run BabelTraining set (13 legitimate , 91malware)

Legitimate MUAs and MTAs are distinct from Bots Legitimate MUAs and MTAs are all speak distinct dialects (except for Outlook Express and Windows Live Mail)

91malware: 48 dialects Same dialects belong to the same family

Evaluating Dialects for Spam Detection

Run Babel

SMTP converastions for 621919 email messages(40days)

7114 bot samples[4] >> bad dialects

MUA+MTA+webmail >> good dialects

Passive spam detection

Decision machine do not recognize the conversaction >> mark as spam

Evaluating Dialects for Spam Detection

621919 email (ALL)

260074 spam , 218675 ham ,143170 ??

Verify

true positive

IP blacklist (30) + resolve domain

99.32% true positive

False negative

21% False negative

(misused web mail account,dedicated MTA)

(half is legitimate MTAs)

Limitations and Evasion

Evading dialects detection:

Use an existing open source smtp engine (CDO)

But spambots are built for performance

Bagle(a spam bot) : 20ms / a letter

CDO(windows) : 200ms / a letter

collaboration data objects library

Conclusion

Introduced a novel way to detect and mitigate spam emails

We study how the feedback mechanism used by botnets can be poisoned

Empirical result confirm that our approach can be used to detect and mitigate spam emails.

THANKS

B@bel:Leveraging Email Delivery for Spam Mitigation

Documents

NEW ABUSE REPORT - AFNIC · Google Safe Browsing Domain name Abuse spam phishing spam phishing spam Unwanted s spam spam malware GURID Registrar name Potential abuses Creation date

Spam and Anti Spam Techniques

EK Ch 17: Power laws and rich-get-richer phenomena (with an application of Web Spam detection Spam, Damn Spam and Statistics ) Spam, Damn Spam and Statistics

CASL vs CAN-SPAM - Canada’s Anti‐Spam Law

Spam Spam Spam Spam

PREVENTION, RISK MITIGATION AND RESPONSE SEXUAL AND … · SGBV prevention, risk mitigation and multi-sectoral response in all our operational responses. We recognise that the delivery

Getting frustrated with Spam; tips to Spam it

Delivery: risk assessment and mitigation · Risk Assessment and Mitigation November 2014 An independent commission appointed by Government . ... construction phases as appropriate

Secure Pipes with Network Security · Malware scanning: Address and port scan detection and mitigation Outbound email spam: Mitigation of email spam using the CSP’s mail servers

Spam Hammer 3 - WP Plugin To End Spam

ΑΜ.: 1130 µΒίκινγκς τραγουδούσε “Spam, spam, spam, …”, παρενοχλώντας οποιαδήποτε συνοµιλία µεταξύ των υπολοίπων

Spam, Spam, Spam, Spam…

SocialFilter: Introducing Social Trust to Collaborative Spam Mitigation Michael Sirivianos Telefonica Research Telefonica Research Joint work with Kyungbaek

B@BEL: Leveraging Email Delivery for Spam Mitigation

Spam spam spam spam. Lovely spam! Wonderful spam! Spam spa ... Verdel.pdf · Als er iets is dat ik geleerd heb in de afgelopen 23 jaar waarin ik onderwijs heb mogen genieten, dan

Exploiting Network Structure for Proactive Spam Mitigation Shobha Venkataraman * Joint work with Subhabrata Sen §, Oliver Spatscheck §, Patrick Haffner

Spam Mitigation using Spatio temporal Reputations from ...A.G. West, A.J. Aviv, J. Chang, and I. Lee ACSAC `10 ‐December 9, 2010 Spam Mitigation using Spatio‐temporal Reputations

1 DMPT: Controlling Spam Through Message Delivery Differentiation Zhenhai Duan, Kartik Gopalan Florida State University Yingfei Dong University of Hawaii

Spam and Botnets: Characterization and Mitigation

Canada’s New Anti-Spam Legislation: Compliance Challenges and Risk Mitigation Strategies IT.CAN 18 th Annual Conference October 20, 2014 Craig T. McDougall