Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Incorrect highlighting in "font is italic" preflight report in searchable pdf

New Here ,
Aug 05, 2024 Aug 05, 2024

Dear Colleagues, 

 

I have a problem. There is annoying extra space highlighted after each instance of italic text in preflight reports, which includes 1-2 symbols that are not in italic in the original document. This makes the process of comparing with a source document during its manual conversion to markdown-formatted plaing text less convenient. Attached is an example. 

 

I understand that the root of the problem that it is a searchable pdf.  But is there any possibility on how to make reports more accurate? 

 

Link to a file: https://sport-science.pro/pdf/DEC212023_01B2203.pdf 

 

incorrect_highlighting.png

 

TOPICS
Print and prepress , Scan documents and OCR
287
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Aug 05, 2024 Aug 05, 2024

I don't have this issue:

 

try67_0-1722854983463.png

 

What version of Acrobat are you using, exactly?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Aug 06, 2024 Aug 06, 2024

The latest release, Adobe Acrobat Pro 24.002.20933. I have also checked at other workplace with older version of Adobe Acrobat, result was the same.

 

Is your screenshot also a result of preflight report? You have changed the default color of errors marking, right?

 

Your report doesn't highlight Matter of Chawathe and wrongly highlights "26" which is not written in italic. Well, it does good with highlighting See, so I an interested in finding the reason of the difference between our reports.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Aug 06, 2024 Aug 06, 2024

I didn't notice this was the result of a Preflight report... I just used the Highlighting commenting tool manually.

Does that work correctly for you?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Aug 06, 2024 Aug 06, 2024
LATEST

If you examine the document you'll see that the space characters after each italic text are also in italic, so the output of the report is actually correct. I'm guessing the OCR uses the same font settings until it encounters a new one, so any spaces are automatically assigned the same font settings as that of the text before them. I don't see a way to solve this issue in Acrobat, I'm afraid,. unless you manually change it in each instance.

The best way to solve it would be to export the file to another format, like Word, make the changes there (it can be automated in Word, but not in Acrobat), and then create a new PDF file.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Aug 06, 2024 Aug 06, 2024

I understood the root of the problem. Fonts were not embedded. After fix-up (embed) reports look fine. 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines