Skip to main content
Known Participant
April 25, 2016
Answered

Clearscan printed to pdf gives rectangles as recognized text.

  • April 25, 2016
  • 2 replies
  • 669 views

Hello,

I scanned a file and imported it with clearscan OCR. Everything is perfect: the text when copied is exactly is displayed.

If I now print this pdf to adobe pdf (I have to scale the page) everything looks perfect but when I copy the text I only get scrambled text mostly rectangles.

Acrobat X, Windows 7.

This topic has been closed for replies.
Correct answer Test Screen Name

I think you may be stuck. You cannot expect to redistill clearscan OCR without bad things happening to the text: in general, never redistill (it's been made impossible on the Mac, but Windows they have not closed this gap).

2 replies

Test Screen NameCorrect answer
Legend
April 26, 2016

I think you may be stuck. You cannot expect to redistill clearscan OCR without bad things happening to the text: in general, never redistill (it's been made impossible on the Mac, but Windows they have not closed this gap).

Legend
April 26, 2016

Printing PDF to PDF is considered a Very Bad Thing, and this sort of problem can be expected. Just don't do it!

Perhaps you can scan to TIFF, scale the TIFF, and then import to Acrobat the right size.

Known Participant
April 26, 2016

Thank you for your suggestion but:

The problem is that the pages to scan are title pages with a very big font. Acrobat X doesn't recognize these titles as text unless I change the print size of the bmp before importing in Acrobat. Just because of changing the print size Acrobat does recognize the text. But now I have to scale the pages in the pdf to get the original page size so that the title pages conform to the rest of the (normal print) pages.

Thank you.