type in your query to search makeyougohmm
Things that ... make you go hmmtechnology music video art news reviews and muse on the web

November 14, 2007

Does adding CAPTCHA reduce the number of comments?

blogs and podcasting — by TDavid @ 6:52 am PST
New! F = please no more posts like thisD = not among your best stuffC = average postB = good post, I liked itA = great post, please create more like this (Hmm, no ratings yet)
Loading ... Loading ...

On May 24 of this year, CAPTCHA was added to the commenting area as a means of protecting against the onslaught of comment spam and helping the good reCAPTCHA cause. At the time this blog was receiving over 1,000 comment spams a day, no doubt increased because we weren’t and still aren’t using rel=nofollow on links left in comments.

After reCAPTCHA immediately was cut down to less than 10 comment spams per day. From 1,000+ to less than 10 per day on average? From a blog publisher standpoint that’s a win. It’s a timesaver, but what about you, dear readers?

Shortly after activating the reCAPTCHA on May 24, and on a reader’s suggestion (see the comments on that post), I made it so that the CAPTCHA would disappear after commenters left at least five approved comments. It was never my goal for the CAPTCHA to be a nuisance for commenters, which is one major gripe with blogs that use CAPTCHA.

The other claim is that those blogs who use CAPTCHA will have fewer comments. I’m a strong believer in numbers not lying, so I decided to see how many approved comments have been left in the 174 days since the CAPTCHA have been activated.

Before getting to the numbers it’s important to note that the amount of traffic this blog receives isn’t the same. While the number of RSS subscriptions has increased, web traffic to this blog has gone down a little bit over the time range since 2006 which would further impact the actual results.

Wordpress MySQL database queries to count number of comments since date
Below in blockquote you’ll find queries you can use on your blog. If you use a table prefix, be sure to change the wp_comments to TABLEPREFIX_comments instead. For example, let’s say you used myblog as a table prefix then replace wp_comments with myblog_comments in the SQL statements below.

SELECT count(comment_ID) from wp_comments WHERE comment_approved=’1′ and comment_date > ‘2007-05-24′;

Result: 1,739

Now to check the comments in the days before the CAPTCHA was implemented, first the query to use in shell (my favorite) or your favorite MySQL editor (like PHPmyAdmin):

SELECT count(comment_ID) from wp_comments WHERE comment_approved=’1′ and comment_date > ‘2006-12-01′ and comment_date < ‘2007-05-24′;

Result: 1,932. That’s a 10% decrease in comment activity. Now before saying aha! let’s examine a query of the same 174 day period from the previous year.

SELECT count(comment_ID) from wp_comments WHERE comment_approved=’1′ and comment_date > ‘2006-05-24′ and comment_date < ‘2006-11-15′;

Result: 2,198. Ouch, now say aha! That’s a decrease of 21% (rounded) in comment activity over the same period of time a year earlier.

Just for fun, let’s go back one year.

SELECT count(comment_ID) from wp_comments WHERE comment_approved=’1′ and comment_date > ‘2005-05-24′ and comment_date < ‘2005-11-15′;

Result: 1,227. This query result doesn’t mean much because comment activity actually increased significantly in 2006 over previous years at this blog. In 2004 over the same time period the result was 352 comments and this blog was created on July 4, 2003 so the same period can’t be reviewed in 2003.

Bottom line findings: although I replied to reader A. Brinck this morning that comment activity hadn’t decreased and even Twittered about it, I was basing this off a flawed initial query. Further focus revealed I wasn’t using the correct number of days (163 instead of 174) for my past query, which is my bad. Detailed review reveals that indeed, comment activity has decreased by 10% at this blog since adding CAPTCHA. Again, it’s important to note that there has been a decrease in traffic, so the actual amount of impact on blog commenting is less than 10% doing real numbers.

Does CAPTCHA reduce the number of unique commenters?
While the above queries answer the title question of CAPTCHA impact on the overall number of comments, it doesn’t speak about the unique number of commenters. We need to refine our MySQL query to get these results for the research.

With the current setup it is new commenters being inconvenienced the most. I decided to run a query to check the percentage of commenters since adding the CAPTCHA versus the 174 days before. First let’s look at the most recent period and let’s remove trackbacks from the results (comment_type != ‘pingback’):

SELECT count(comment_ID), comment_author from wp_comments WHERE comment_approved=’1′ and comment_type != ‘pingback’ and comment_date > ‘2007-05-24′ group by comment_author;

Result: 431 unique commenters over the last 174 days. Compared to the preceding 174 days?

SELECT count(comment_ID), comment_author from wp_comments WHERE comment_approved=’1′ and comment_type != ‘pingback’ and comment_date > ‘2006-12-01′ and comment_date < ‘2007-05-24′ group by comment_author;

Result: 528 unique commenters, a reducton of 18.4% unique commenters. Ouch. Now let’s check the same 174 days time frame in 2006.

SELECT count(comment_ID), comment_author from wp_comments WHERE comment_approved=’1′ and comment_type != ‘pingback’ and comment_date > ‘2006-05-24′ and comment_date < ‘2006-11-15′ group by comment_author;

Result: 599 unique commenters, 28.1% less unique commenters since adding CAPTCHA over the same time period in 2006.

Get rid of the CAPTCHA or not?
This is another chance for readers to weigh in on the use of CAPTCHA in the comments. Would you like it to go away completely or should the number be reduced to one or two approved comments not to show the CAPTCHA instead of five?

I’m leaning toward reducing the number of comments left as opposed to eliminating CAPTCHA altogether this time but it’s not off the table. Frankly losing 10% of the comments over the same period of time versus having 990+ comment spams a day could be a worthwhile tradeoff. Less time sifting through BS to be able to work on posts, promotion and conversing with those who are leaving comments.

What do you think?

Related Posts

RSS Feed comments for this post 9 Comments »

  1. TDavid, one intangible is as time goes by, the novelty of Blogging has decreased considerably. Also Blog overload due to the proliferation of blogs in general has an impact. Way more competition for commentors eyes and time now versus then. As with most things, the numbers are just numbers and causality cannot be directly inferred.

    Comment by FranciscoIV — November 14, 2007 @ 8:08 am PST

  2. Yes, a lot of factors to consider, but I thought it was interesting looking over the numbers. You shouldn’t be seeing the CAPTCHA any more, did it bother you at all when you did see it?

    Comment by TDavid — November 14, 2007 @ 8:13 am PST

  3. Hi there,

    I’m one of the developers on reCAPTCHA.

    This is a really interesting analysis of the impact of CAPTCHAs on commenting. I like your method of hiding the CAPTCHA for a commenter who has had many approved comments. In fact, one thing we’ve been considering is having a more global version of this. Since reCAPTCHA is used on many blogs, it’d be possible to have a global karma system that allows people to freely comment on blogs without having to enter a CAPTCHA every time.

    A global system would obviously have a few twists that make it more complicated:

    - We need to balance the privacy of the user with the ability to track abuse.
    - It’s possible that, in order to circumvent this, a hacker might create a virus that gathered the cookie which granted the free-pass and sent it to a central source.

    I think that a more global system is something for people to think about, but it’s good to see that it works for you on a small scale!

    A few questions for you:

    1) What does this do to the quality of the comments? Did the CAPTCHA deter high quality comments or just things like “that’s cool”
    2) What other changes can be made to increase the number of comments? Removing the email address field seems like it might be a big one.

    Comment by Ben Maurer — November 14, 2007 @ 11:18 am PST

  4. Hi Ben - thanks for stopping by and suffering through the CAPTCHA ;)

    1) What does this do to the quality of the comments? Did the CAPTCHA deter high quality comments or just things like “that’s cool”

    The comment quality at this blog is one thing I’ve been pretty happy with both pre- and post-CAPTCHA. The implementation of the CAPTCHA didn’t negatively impact the quality of comments left. Neither the comments or the people leaving them.

    Over the last 174 days I found myself able to spend a little more time following the links from commenters (and replying in more depth and frequency) than before where I might have been duped by a spammer who made what appeared to be a legitimate comment. The quality of the comment is very important to us because we see comments as part of the overall content on the page. In some cases the comments left by others I value more than the original blog post.

    2) What other changes can be made to increase the number of comments? Removing the email address field seems like it might be a big one.
    I’ve covered the subject of how to get more comments on a blog in greater depth before from a writer/publisher standpoint. And I also did a retrospective when this blog reached the 10,000 comments mark.

    As far as the reCAPTCHA technology, which is what I believe you meant, you might notice I’m not using the newer version with two different forms. I didn’t like that as it makes it kind of confusing leaving a comment. Not sure if you’ve had complaints about that or not. The interface needs to be very simplistic like what Disqus is doing or people say forget it and leave.

    As a publisher, I like having the email field because it gives us another way to determine if the comment is valid or spam. Most spammers don’t use valid email addresses in our experience. Spammers are getting more creative these days. They employ people to leave real looking comments on blogs not using rel=nofollow with links to spam sites.

    Comment by TDavid — November 14, 2007 @ 12:10 pm PST

  5. TDavid, the CAPTCHA didn’t bother me, but then I’m not a drive by commenter and have invested in your BLOG (figuratively not economically). I did find when I switched PC’s and didn’t have your cookie it did make the CAPTCHA re-appear once (with a blank comment form). After that is was gone. WP must have recognized my User ID and email after the first comment on the different PC….

    Comment by FranciscoIV — November 14, 2007 @ 1:21 pm PST

  6. If you’re running Wordpress (it’s not obvious if you are or aren’t) then use Dr Dave’s Spam Karma 2. No need for registration; people can comment automatically. Kills spam very effectively.

    Comment by Charles — November 16, 2007 @ 6:25 am PST

  7. I have had a terrible experience with re-capture. The words they use are not always readable and sometimes matching them still gives me negative result. I removed re-capture from by blog after playing with it for a couple of weeks.

    Comment by Yan — November 16, 2007 @ 1:36 pm PST

  8. I don’t mind CAPTCHA as long as it’s easy to read and reCAPTCHA words are usually pretty easy to read so far. Some other CAPTCHA systems take me 3-4 tries to get right, which is extremely irritating.

    Comment by Gear — December 6, 2007 @ 12:30 pm PST

  9. I’m always having a hard time reading captcha..It would usually take me 2-3 times before I can get it right..^^..I hope they can come up to of better ways to prevent spams other than captcha.

    Comment by George Lindemann Jr — May 24, 2008 @ 11:28 pm PST


TrackBack URI: http://www.makeyougohmm.com/20071114/4942/trackback/

Leave a comment


By leaving a comment you consent to the Official Hmm Comment Policy

Return Home


Copyright 2003-2008 KMR Enterprises All Rights Reserved