Advertisement

Business

Regulatory Resource   Threat Intelligence      Resilient IT      Boardroom Strategies      
Threat Intelligence / Spam and Viruses

How Filtering Screens Out Spam

By Tom Schmidt

Apart from making money, a spammer's primary interest is to evade antispam filters. Never has that been truer than now. Gone are the days when organizations could block unsolicited email using homegrown approaches and static keyword filters. Today spammers are continually raising the stakes by devising ways to escape filtering. This article will look at the current state of spam activity, some of the principal filter evasion tactics used by spammers, and the technologies that are available today to help filter out spam.

The state of spam today

How big of a problem is spam? While there is little doubt that spam is an annoyance to users and administrators, it is also increasingly a serious security concern, as it can be used to deliver Trojan horses, viruses, and phishing attacks. In addition, high volumes of spam can create DoS (Denial-of-Service) conditions where email systems become so overloaded that legitimate email and network traffic are unable to get through.

Getting around the filters

Not surprisingly, large-scale spammers have become more adaptable and sophisticated over the years. This is partly a matter of simple economics. According to some estimates, for a spammer to bring in $1 million a month, all that is required is a $20 purchase from one out of every 2,000 "spammees" -- a mere 0.05% response rate.

Given such favorable economics, spammers will cycle through fake domain names and alter email subject lines so precisely and efficiently that by the time older antispam technologies can discern a pattern, the damage is done and a new attack with different characteristics has already been launched. Mass mail software even allows spammers to run mail through preprogrammed checklists, evaluating whether it is likely to be blocked by spam filters.

Increasingly, content modification using HTML has become the spammer's most powerful antifiltering technique. Spammers choose HTML because it attracts attention, enables tracking (spammers can verify whether a targeted email address is valid), and allows them to insert bogus tags in order to circumvent filtering.

Spammers have also proven adept at disguising the external appearance of URLs so that recipients are fooled into believing that the URLs belong to a legitimate organization. The success of email "brand spoofing" attacks is testimony to the power of this tactic. In such attacks, spammers create fraudulent emails and disguise URLs, purporting to originate from legitimate organizations in order to entice recipients to provide private and financial information.

While sending out bulk email is a fairly simple matter, spammers need a mechanism to conceal their identity, thus avoiding source blocking by IP address. One method involves the misuse of open proxy servers. Open proxy servers are misconfigured or virus-infected computers that allow traffic for virtually any network service to be channeled through a host computer.

Spammers routinely identify and hijack insecure proxy servers, whose owners may have no idea that their systems have been misappropriated. By some accounts, two-thirds of all spam emanates from these hijacked servers.

Spammers are also coming up with new ways to hijack computers, as shown by such mass-mailing computer worms as Sasser, Netsky, and SoBig. While these viruses didn't have especially malicious payloads, they did install mail programs on victims' computers, setting the stage for an immense network through which spam could be relayed.

Today's filtering technologies

Although many antispam solutions claim to work right out of the box, they actually offload much of the spam-fighting burden onto administrators and end users. The question for many IT departments then becomes: How much time can we afford to spend fighting and managing spam? After all, ongoing filtering and training costs can quickly spiral out of control.

Spam analysis begins with the Probe Network, an array of more than 2 million decoy email addresses and domains, also known as spamtraps or honeypots. This global network of email accounts attracts and collects large quantities of spam -- tens of millions of spam messages pass through the Probe Network every month. As messages come in, automated processes and expert technicians go to work, analyzing incoming spam and developing countermeasures.

While no one has so far developed a silver bullet against spam, a variety of filtering techniques have been created to keep spammers at bay -- all of which must constantly be evaluated and updated if enterprises are to keep ahead of spam. Among the techniques that may be used:

  • Reputation filtering Reputation-based blocking is a filtering technique that examines the quality or reputation of the sending source or mail server. It involves monitoring hundreds of thousands of email sources to determine how much mail sent from these addresses is legitimate and how much is spam. It also tracks data such as mailing patterns, the presence of open proxy or unsecured mail servers, volume of messages sent, and complaints.
  • Heuristics Heuristic filters analyze the header, body, and envelope information for incoming messages, checking for the presence of distinct spam characteristics. Each message is assigned an overall score, which is then compared to a threshold that determines whether the message is spam or not. Heuristic filters, once they are trained to determine what spam and legitimate mail looks like, can be very effective at identifying new spam. The drawback of many heuristic filters is that they can create a substantial administrative burden. If not properly trained and weighted for accuracy, they can also produce significant numbers of false positives (legitimate email messages that are incorrectly identified as spam).
  • Header filters To proactively identify first-time spam, header filters consist of regular "expression-based" filtering rules that exploit commonalities or trends present in spam messages. Examples of telltale spam characteristics that a header filter would address include traces of information left in messages by spammer tools and modified time zones.
  • Attachment filters Message attachments have long been a favorite tool of spammers. By attaching a deceptively named file or image to an email, spammers tempt recipients to click through and open the file. Filters based on a particular MIME attachment (for example, a specific pornographic image used in a real-time spam attack) can stop that attachment from reaching users. Attachment signatures therefore make it unnecessary to block entire categories of attachments.
  • URL filters Continually evolving URL-based filtering technologies aim to reverse spammers' new methods of URL masking and obfuscation techniques. For example, URL filters can stymie spammers' attempts to encode URLs with extraneous characters.
  • Language identification It's estimated that between 10% and 20% of all spam is written in languages other than English. As multilingual spam becomes a larger problem, antispam solutions must take into account the language in which messages are written. Solutions need to contain language identification abilities and heuristics that apply only to particular languages.
  • Custom filters An effective antispam solution should provide customization tools that allow administrators to be more aggressive in targeting unwanted mail. That way, administrators can create filters to proactively block or handle mail that does not necessarily meet the criteria of spam. For example, administrators could filter email from marketing lists that generate user complaints or use excessive bandwidth.

What to look for when choosing an antispam solution Accuracy is the single largest differentiator when it comes to antispam products. Accuracy in this regard refers to the false positive rate. For example, if a company received 100,000 messages a day, even a 1% false positive rate means 1,000 messages are mistakenly blocked every day. Obviously, such a rate is too high. In addition to evaluating antispam solutions based on the filtering technologies discussed above, enterprises are encouraged to look for solutions that:

  • Produce low or no false positives
  • Have a track record, through product reviews or customer validation, of having extremely low false positive rates
  • Employ a balanced mix of technologies to guard against overaggressive filtering
  • Have safeguards for preventing, detecting, and resolving suspected false positives
  • Provide quarantine options to let users ensure that legitimate messages are not lost

CIO Strategy Center is a daily editorial resource offering innovative insights and strategies for building an integrated, secure and resilient IT infrastructure.

Articles by Topic
Spam and Viruses
Preparedness
Strategies
Related Content
Fast Fact

"According to some estimates, for a spammer to bring in $1 million a month, all that is required is a $20 purchase from one out of every 2,000 "spammees" -- a mere 0.05% response rate."

Sponsor Tools
Podcast Audio Content

CIO Strategy Center is now available in audio format.

This week's feature topic is:


Cyberthieves Turning to More Invasive Approaches
Playtime: 8 min 53 sec



Download | Subscribe