Free Republic
Browse · Search
News/Activism
Topics · Post Article

To: buzzer

Your assumption about how OCR software works is wrong. The image is always maintained. It does not change because of the OCR, error or no error. The OCR generates a text layer that may have an error, but that error would only show up when you copied and pasted the text elsewhere. Viewing the PDF with a generic viewer would show the original image. So the TXE is in the original image.

Not saying this makes it fake or otherwise but it certainly does raise flags.


98 posted on 04/27/2011 3:58:18 PM PDT by Trityn (FUBO and the Soros you rode in on.)
[ Post Reply | Private Reply | To 69 | View Replies ]


To: Trityn

Question - if this is a fraudulant doc (I can’t make a case either way yet), wouldn’t the Registrar, Alvin Onaka (a Ph.D.) be at risk having his stamped signature at the bottom ?

Also, there are so many opinions on PDF layers (both pro and con) that it is difficult to make a judgment. What do you think ? Thx !


99 posted on 04/27/2011 4:06:32 PM PDT by rocco55
[ Post Reply | Private Reply | To 98 | View Replies ]

To: Trityn

“Your assumption about how OCR software works is wrong.”
No it’s not. Take a look on the ocropus soucrcode[1] for example. It does seperate the document into “blocks” and the remaining “background”. Each of the blocks is then OCRed.

“The image is always maintained.”
It’s not. Usually the ocr engine decides which elements it will store as “image” and which it will store as “text”. Usually it tries to store as much as “text” as possible to make the document indexable/searchable.

“It does not change because of the OCR, error or no error. The OCR generates a text layer that may have an error, but that error would only show up when you copied and pasted the text elsewhere.”

No it doesn’t. The “text layer” as you described it is shown as you can easily see. The PDF file format doesn’t allow text to be “hidden” It can only get covered by another layer, get replaced or get removed. That’s why OCR composed PDFs do not contain everything as a image. See the PDF spec on [2][3]. You need a basic understanding of what the PDF file format describes and how it get’s rendered to understand.

“Viewing the PDF with a generic viewer would show the original image. So the TXE is in the original image.”

No. a ordinary PDF viewer would do what it’s assumed to do. Put the “background layer” into the background and the “text layer” on top of it.

“Not saying this makes it fake or otherwise but it certainly does raise flags.”

No it doesn’t. Not that i think obozo is a good president, i think he’s even worse then carter, but all these birthers and their paranoid conspiracy plots make all of us conservatives look like total loons. Even if they would have been witness to his birth and and seen it with their own eyes they would still claim that they didn’t knew where they have been at this moment.

[1]http://code.google.com/p/ocropus/
[2]http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/adobe_supplement_iso32000.pdf
[3]http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/adobe_supplement_iso32000_1.pdf


107 posted on 04/27/2011 10:18:56 PM PDT by buzzer
[ Post Reply | Private Reply | To 98 | View Replies ]

Free Republic
Browse · Search
News/Activism
Topics · Post Article


FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson