Protect Your Interwebs!
Sucuri CTO, OSSEC Founder, open source developer and information security professional - dcid.me
Receive new posts in your inbox.
Copyright © 2013 Sucuri Inc. · Terms of Service · Privacy Policy
Sucuri® is a registered trademark of Sucuri Inc. in the United States and/or other countries.
Blog Comments – Analysing 100,000 Comments and Spammers
“Nice blog, thanks for the info”
“Awesome site. Great job”
“You should take part in a contest for one of the best blogs on the web. I will recommend this site!”
I know you like flattering comments on your website. And I know you love to see many comments on each one of your posts (say you community participation). Who doesn’t, right? We love them too.
So we decided to take a closer look at the last 100,000 (well, 98,238 to be more exact) comments that were sent to the network of sites that we are monitoring. How much of them are spam? Who are the most annoying spammers? And things like that.
Comment Analysis
We filtered the latest 98,238 comments received (that’s less than a week worth of comments), and ran them through our analysis engine. Guess how many of them were spam? How many were good?
Wow! So according to our analysis, more than 80% of the comments were classified as spam. We even took a conservative approach and classified unsure comments as good comments. So out of every 5 comments received, only 1 was valid.
*Unsure comments were ones we only saw hitting one web site, but the content was suspicious. Those in this list were almost 10,000 (9% of the overall total). If we had classified those as spam, the number would have grown to 90+% spam.
Spam Analysis – Messages
This really amused us. What type of message do you think a spammer was sending? Most of the time, we noticed that they sent a flattering note to increase the odds of the webmaster accepting the comment. Here are the top 10 messages sent by spammers:
The last one in the list is the funniest (“You should take part in a contest for one of the best blogs on the web. I will recommend this site”). Taking out the Viagra and the Louis Vuitton spam, why do they do it?
They do it because in the URL field, they add a link to their own web site (which can increase their page rankings, visitors, etc). Example:
Spam Analysis – Emails
This email analysis was not as useful as we would have hoped. The emails are very random and mostly from gmail and hotmail accounts. These were the top spammer emails:
These were the top domains used by spammers:
Spam Analysis – URLs
Now it is getting useful, let’s see the domains that are using comment spam to increase their ratings and visitors. Top 30 on this one (out of 24,976 different URLs):
Spam Analysis – IP Addresses
To finish, some actionable information for hosting providers and website owners. This is the list of IP addresses sending the most spam so you can block them out:
The total list is very big (12,190 unique IP addresses), but blocking the top ones is a good start.
Spam Analysis – Countries
Out of curiosity we decided to check the top Countries sending spam (based on the IP address):
In the olympics of SPAM, the USA is #1, followed by China (Silver), Russia (Bronze) and Brazil.
Conclusion
Yes, there is a lot of spam out there. I would say that 9 out of 10 comments are spammy in some way (even if not automated – we only classified automated messages as spam). In any event, let us know if you want any more information from this list. We have raw data, so we can run numbers and different analysis as requested.
Sharing is Caring: