Comment Spam Stats

Since January 12th:

  • Valid comments accepted by Akismet: 36
  • Spam comments accepted by Akismet: 17
  • Spam comments rejected by Akismet: 814

I don’t have a number for false positives, but given that I’ve received zero email complaints I’ll assume the number is low if not zero. This gives Akismet about a 98% success rate on catching spam, which is pretty good. It makes my life better. Having more spam comments than real comments get through the gates can be really depressing for a blog owner.

At some point I’ll post my Django newforms/Akismet integration code. It’s very simple, and clearly worth the effort.

Update: James Bennett reminds me that his comment_utils wrap up anti-spam and comment moderation measures in one tidy package.

Alex Ezell commented :

I like to say that comment spam is the reason I always quit blogging, but I suspect it’s because no one wants to read what I write about :)

I guess one of these days, I’ll get around to redoing my blog with Django.

Does anyone else think that Akismet should be doing better? For a distributed spam system, I would expect 99% or better catching spam. I say that based on my, perhaps incorrect, belief that comment spammers usually post the same junk on tons of sites all in a short period of time.

After the first 2 or 3 bloggers mark this stuff, shouldn’t Akismet catch the rest? I probably just don’t know enough about how it happens.


Paul commented :

I suspect that there’s a significant random component to the spam, so that it’s not exactly the same junk everywhere. But given that good Bayesian email filters seem to be able to do better, I do agree that there seems to be room for improvement.

Nonetheless, I can’t begrudge the free service that Akismet is giving me. As we’ve both experienced, comment spam is demoralizing, and it’s nice to be (mostly) free of it.