Replies

So if I understand this correctly, people solve the little tests, jumbled letters, and now those solved tests are actually going to be whole digital books down the road?

2 posted on 05/25/2007 5:49:02 PM PDT by padre35 (we are surrounded that simplifies things-Chesty Puller)

As I understand it, they are scanning in old books, and then using the words that the scanner can’t read as test words. When lots of responses are the same for the word, then they know what the word was that the OCR scanner couldn’t read.

3 posted on 05/25/2007 5:57:40 PM PDT by Gondring (I'll give up my right to die when hell freezes over my dead body!)

So if I understand this correctly, people solve the little tests, jumbled letters, and now those solved tests are actually going to be whole digital books down the road?

You're close. The little tests (CAPTCHAs) are how Web logins are handled now. This proposal would substitute images of scannable book text for the puzzles. Each scannable passage would be repeated to several different users, so their answers could be compared. Instead of having to have a computer scan the text, the logon users' answers would "vote" on the correct text corresponding to the image.

This technique would be used to input Gutenberg bibles and other texts that would be difficult to scan with conventional means.

9 posted on 05/25/2007 6:49:14 PM PDT by BlazingArizona

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794