CAPTCHA项目是Completely Automated Public Turing Test to Tell Computers and Humans Apart (全自动区分计算机和人类的图灵测试)的简称。CAPTCHA的目的是区分计算机和人类的一种程序算法,是一种区分用户是计算机和人的计算程序,这种程序必须能生成并评价人类能很容易通过但计算机却通不过的测试。
http://baike.baidu.com/view/538168.htm
简单来说,我们在各个网站看到的验证码的算法就是CAPTCHA。
卡内基梅隆大学(Carnegie Mellon University,CMU)设计了一个名叫 reCAPTCHA 的强大系统,让他们的电脑去向人类求助。具体做法是:将OCR软件无法识别的文字扫描图传给世界各大网站,用以替换原来的验证码图片;那些网站的用户在正确识别出这些文字之后,其答案便会被传回CMU。
reCAPTCHA验证码系统在2009年的时候被Google收购,整体界面有点类似于CAPTCHA,但是安全性能更高,通常都是由两个单词组成,一个是机器可以识别的(用来测试你是否是由人来访问的),还有一个是机器不能读取的(用来帮助将书本中的文字数字化)。
http://baike.baidu.com/view/1944705.htm
http://jandan.net/2013/02/17/recaptcha.html
在危地马拉人发明这个验证码程序的时候,是为了防止电脑识别密码盗用,所以用机器难以识别的图片让人识别,精彩的是,谷歌图书馆用它来让人识别机器扫描无法识别的文字,当我们输入验证码的时候,居然是在为谷歌做文字识别工作。
这个验证码的演示地址:http://www.google.com/recaptcha/mailhide/
一些截图:
它的原理如下:
现在OCR的阅读效率实在是不高,下面这张图就可以说明问题。
它的情景是这样的,有一天,某台机器扫描了一本书,想把它转成电子版:
处理出来是这样子的:
are [xxxxxxxxxx] energetically for the campaign: Several prominent Democrats who at first
favoredDonoLea, are coming out. for the other aide, apparently under the [xxxxxxxx] of Federal
[xxxxxxxxx]. An address to the National Democracy of ,1ifornia, urging the party to
supportHaeeslipslDas, has recently been published, which manifestlybss strengthened that aide of
the [xxxxxxxxx]: It is signed by 65 Democrats, many of whom occupy respectab e and prominent
positions in the party, 22 of them are Federal office-holders,[xxxxx] more are recipients of
Federal patronage, and the others represent a mass of politicians giving the document [xxxx][xxxxxx]
mTheDcu8las Democrats are also active The Irish and German vote will mostly go with ths# branch of
the party, but it is[xxxxxxxxx] to [xxxxxxxx] [xxxxx] [xxxx] [xx] the stronger. Thus far 17 IT
newspapers have declared for DonGres, 13 for Base$- IaaIDGS and 9 remain non-committal, with even
chances of going either way. Under these circumstances the Republicans entertain not unjustifiable
hopes that the Democratic divisions may be so equal,- ly balanced as to give the State [xx]
LIaCOLV.Same very [xxxxxxx] Bell and Everett meetings have been held in different parts of the State,
bat thus far that party does not exhibit much rank sad ale air en.