<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" 
      xmlns:thr="http://purl.org/syndication/thread/1.0">
  <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php" />
  <link rel="self" type="application/atom+xml" href="http://www.readwriteweb.com/atom.xml" />
  <id>tag:www.readwriteweb.com,2011:/1/tag:www.readwriteweb.com,2009://1.16420-</id>
  <updated>2011-08-16T16:37:50Z</updated>
  <title>Comments for Google Acquires reCAPTCHA to Fight Spam and Improve Google Books OCR</title>
  
  <generator uri="http://www.sixapart.com/movabletype/">Movable Type 4.35-en</generator>
  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420</id>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php" />
    <link rel="service.edit" type="application/atom+xml" href="http://www.readwriteweb.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=1/entry_id=16420" title="Google Acquires reCAPTCHA to Fight Spam and Improve Google Books OCR" />
    <published>2009-09-16T16:58:19Z</published>
    <updated>2009-09-16T17:25:29Z</updated>
    <title>Google Acquires reCAPTCHA to Fight Spam and Improve Google Books OCR</title>
    <summary>Google just announced that it has acquired reCAPTCHA, one of the leading providers of CPATCHAs, the hard-to-read puzzles you often have to solve before you can sign up for a new web service. Google, of course, isn&apos;t so much interested in owning software that can generate CAPTCHAs - that&apos;s an easy problem to solve -...</summary>
    <author>
      <name>Frederic Lardinois</name>
      
    </author>
    
    <category term="Google" />
    
    <category term="NYT" />
    
    <category term="News" />
    
    <content type="html" xml:lang="en" xml:base="http://www.readwriteweb.com/">
      <![CDATA[<p><img alt="recaptcha_logo_dec08.png" src="http://www.readwriteweb.com/images/recaptcha_logo_dec08.png"  />Google just announced that <a href="http://googleblog.blogspot.com/2009/09/teaching-computers-to-read-google.html">it has acquired reCAPTCHA</a>, one of the leading providers of CPATCHAs, the hard-to-read <a href="http://en.wikipedia.org/wiki/CAPTCHA">puzzles</a> you often have to solve before you can sign up for a new web service. Google, of course, isn't so much interested in owning software that can generate CAPTCHAs - that's an easy problem to solve - but is looking at <a href="http://recaptcha.net/">reCAPTCHA</a> as a way to improve the optical character recognition (OCR) software it uses for large scale text scanning projects like <a href="http://books.google.com/">Google Books</a> and the <a href="http://news.google.com/archivesearch">Google News Archive Search</a>.</p>]]>
      <![CDATA[<p>According to Google, reCAPTCHA is currently in use on over 100,000 websites to prevent spam and fraud. the reCAPTCHA team, which is currently based at Carnegie Mellon University, will join Google.</p>

<h2>Solving CAPTCHAs to Transcribe Books</h2>

<p><img alt="recaptcha_book.png" align="right" src="http://www.readwriteweb.com/images/recaptcha_book.png"  />We took <a href="http://www.readwriteweb.com/archives/recaptcha_stopping_spam.php">detailed looks</a> at reCAPTCHA and how it works last September and in <a href="http://www.readwriteweb.com/archives/recaptcha.php">early 2007</a>. In short, reCAPTCHA has found a nifty way to crowdsource <a href="http://recaptcha.net/digitizing.html">book transcriptions</a>. When users solve a CAPTCHA through reCAPTCHA, the software will give users two words: one with a known answer (the control word) and one where the OCR software wasn't quite sure what the word was. Once a certain number of users have solved the suspicious word with the same result, it becomes a control word itself and the OCR software can learn this word.</p>

<p>Now, Google will be able to use this same technology to improve its own OCR efforts. Google currently makes over <a href="http://www.readwriteweb.com/archives/google_opens_up_its_epub_archive_download_1_million_books_for_free.php">1 million out-of-copyright books available</a> for download through Google Books and one of the main arguments against these books has been the fact that these texts are not edited and include a lot of OCR errors. With reCAPTCHA, Google could potentially bring the error rate down dramatically and make Google Books even more useful. </p>

]]>
    </content>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:185918</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c185918" />
    <title>Comment from sunya on 2010-02-03</title>
    <author>
        <name>sunya</name>
        <uri>http://www.kondigg.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.kondigg.com">
        <![CDATA[<p>Thank you so much for everything</p>]]>
    </content>
    <published>2010-02-04T07:09:51Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:171257</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c171257" />
    <title>Comment from flyman159 on 2009-11-29</title>
    <author>
        <name>flyman159</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>Well, it seems like another blog, but while though i have gone through it, i found it with huge interesting topics and all that. This seems like very informative and i' love to recommend everyone as this blog giving a huge interest to everyone need. I do love this blog and hopefully i would be in touch with it as it becomes my favorite one. lol..Thanks!<br />
</p>]]>
    </content>
    <published>2009-11-29T23:24:54Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158131</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158131" />
    <title>Comment from 吴鹏 on 2009-09-17</title>
    <author>
        <name>吴鹏</name>
        <uri>http://wupeng.cn</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://wupeng.cn">
        <![CDATA[<p>It's a great idea.</p>]]>
    </content>
    <published>2009-09-17T12:36:48Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158123</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158123" />
    <title>Comment from freisprecheinrichtung bluetooth on 2009-09-17</title>
    <author>
        <name>freisprecheinrichtung bluetooth</name>
        <uri>http://www.zoombits.de/bluetooth/</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.zoombits.de/bluetooth/">
        <![CDATA[<p>Well, It's a great news for the google fans because it help to restrict the spammers and will help to provide clean and good services...</p>]]>
    </content>
    <published>2009-09-17T10:39:33Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158122</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158122" />
    <title>Comment from kwyjibo on 2009-09-17</title>
    <author>
        <name>kwyjibo</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>Excellent, instead of producing OCR for public domain works available freely for distribution on the Internet Archive, they'll now be working on providing OCR for books only accessible online through Google Books.</p>

<p>God bless capitalism.</p>]]>
    </content>
    <published>2009-09-17T10:36:46Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158097</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158097" />
    <title>Comment from رسائل العيد - رسايل للعيد on 2009-09-16</title>
    <author>
        <name>رسائل العيد - رسايل للعيد</name>
        <uri>http://vb.qlbe.com/t255686/</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://vb.qlbe.com/t255686/">
        <![CDATA[<p>Gooooood Thank"s  Google is really amazing on acquiring the right companies</p>]]>
    </content>
    <published>2009-09-17T06:10:34Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158071</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158071" />
    <title>Comment from دلع on 2009-09-16</title>
    <author>
        <name>دلع</name>
        <uri>http://www.dll3.cc/</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.dll3.cc/">
        <![CDATA[<p>Thank"s</p>

<p>Good move once again by google</p>]]>
    </content>
    <published>2009-09-17T04:26:44Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158067</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158067" />
    <title>Comment from cb on bonanzle on 2009-09-16</title>
    <author>
        <name>cb on bonanzle</name>
        <uri>http://www.bonanzle.com/booths/chicagobelow</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.bonanzle.com/booths/chicagobelow">
        <![CDATA[<p>Google really and truly needs to focus on working on the products and services they already have before spending money frivolously in trying to buy up everything. Another microsoft type corporation if the truth is to be told.</p>

<p>I am NOT a fan of reCAPTCHA and I encourage everyone to go read mashable's article titled "Facebook Captcha: What You DON’T Need to Type" and start purposely screwing up the Captchas. Google wanting to digitize and own all books of the world is just NOT a good thing.</p>

<p>And also Captcha's do NOT stop spammers in any way, shape or form, as spammers use program scripts to get around them and spam thousands of sites.</p>]]>
    </content>
    <published>2009-09-17T03:58:18Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158045</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158045" />
    <title>Comment from marketing palm beach  on 2009-09-16</title>
    <author>
        <name>marketing palm beach </name>
        <uri>http://www.atlanticoptimize.com/the-need-for-business-marketing-online</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.atlanticoptimize.com/the-need-for-business-marketing-online">
        <![CDATA[<p>Brilliant. Brilliant. Brilliant. Google is really amazing on acquiring the right companies for their improvement. </p>]]>
    </content>
    <published>2009-09-17T00:29:29Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158012</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158012" />
    <title>Comment from Alex Hawkinson on 2009-09-16</title>
    <author>
        <name>Alex Hawkinson</name>
        <uri>http://hawkinson.cloudprofile.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://hawkinson.cloudprofile.com">
        <![CDATA[<p>Cool from a machine learning standpoint but does this freak you out since Google could potentially have registration and use data for a huge number of sites around the web that leverage reCAPTCHA?  More thoughts here <a href="http://bit.ly/ggEWT." rel="nofollow">http://bit.ly/ggEWT.</a> </p>]]>
    </content>
    <published>2009-09-16T18:48:16Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158007</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158007" />
    <title>Comment from Sean on 2009-09-16</title>
    <author>
        <name>Sean</name>
        <uri>http://www.vividwebgraphics.com</uri>
    </author>
    <content type="html" xml:lang="en" xml:base="http://www.vividwebgraphics.com">
        <![CDATA[<p>Good move once again by google</p>]]>
    </content>
    <published>2009-09-16T18:00:10Z</published>
  </entry>

  <entry>
    <id>tag:www.readwriteweb.com,2009://1.16420-comment:158005</id>
    <thr:in-reply-to ref="tag:www.readwriteweb.com,2009://1.16420" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php"/>
    <link rel="alternate" type="text/html" href="http://www.readwriteweb.com/archives/google_acquires_recaptcha.php#c158005" />
    <title>Comment from Mary on 2009-09-16</title>
    <author>
        <name>Mary</name>
        <uri></uri>
    </author>
    <content type="html" xml:lang="en" xml:base="">
        <![CDATA[<p>I'm curious about the data these guys have been collecting. Has Google acquired that, too? It wasn't from Google alone, but the New York Times, the Internet Archive, and maybe others.</p>]]>
    </content>
    <published>2009-09-16T17:56:37Z</published>
  </entry>

</feed>
