How Will ReCAPTCHA Help Google?
By Channel Web
Those aggravating strings of nonsensical words you have to key into your computer when trying to send someone an e-mail or buy tickets online -- they're called CAPTCHAs -- are actually doing a job that Google (NSDQ:GOOG) finds pretty valuable. But in its zeal to teach computers to, in essence, "read," Google may be chipping away at the ability of CAPTCHAs to provide security.
Google will buy ReCAPTCHA, a company that provides CAPTCHAs to help protect more than 100,000 Web sites from spam and fraud. Financial terms were not disclosed.
CAPTCHAs (Completely Automated Public Turing Test To Tell Computers and Humans Apart) are often created from old pieces of text, including books and newspapers.
Computers, through programs called "spiders" or "robots," find it hard to recognize those words because the ink and paper have degraded over time. So far, CAPTCHA programs have been successful in deflecting robotic attacks because the spiders can't recognize the text.
But in addition, the technology can help Google in its large-scale -- and controversial -- text scanning projects such as Google Books and Google News Archive Search.
Google wants the technology because as users decipher the jumbled characters with CAPTCHAs, the software "learns" to interpret those words during organic seo searches.
The technology that Google now uses to scan documents, Optical Character Reader Recognition, stumbles over the translation of print that's faded and worn. ReCAPTCHA's Web site illustrates that accuracy problem.
Because ReCAPTCHA uses old text from old print publications and users then type them in as a CAPTCHA, users teach computers to read the scanned text. Having the text version of documents is beneficial because it facilitates searching and renders it easily on mobile devices. ReCAPTCHA's slogan, "Stop Spam. Read Books," seems to be a good fit with Google's plan.
Therein may lie the rub, however. As we teach computers to read, CAPTCHAs may lose their appeal as a security mechanism. Of course, as hackers' malicious software also becomes more sophisticated, the distorted character strategy would become threatened anyway. Google's next step may need to address that security concern.