Copy link to clipboard
Copied
Dear Colleagues,
I have a problem. There is annoying extra space highlighted after each instance of italic text in preflight reports, which includes 1-2 symbols that are not in italic in the original document. This makes the process of comparing with a source document during its manual conversion to markdown-formatted plaing text less convenient. Attached is an example.
I understand that the root of the problem that it is a searchable pdf. But is there any possibility on how to make reports more accurate?
Link to a file: https://sport-science.pro/pdf/DEC212023_01B2203.pdf
Copy link to clipboard
Copied
I don't have this issue:
What version of Acrobat are you using, exactly?
Copy link to clipboard
Copied
The latest release, Adobe Acrobat Pro 24.002.20933. I have also checked at other workplace with older version of Adobe Acrobat, result was the same.
Is your screenshot also a result of preflight report? You have changed the default color of errors marking, right?
Your report doesn't highlight Matter of Chawathe and wrongly highlights "26" which is not written in italic. Well, it does good with highlighting See, so I an interested in finding the reason of the difference between our reports.
Copy link to clipboard
Copied
I didn't notice this was the result of a Preflight report... I just used the Highlighting commenting tool manually.
Does that work correctly for you?
Copy link to clipboard
Copied
If you examine the document you'll see that the space characters after each italic text are also in italic, so the output of the report is actually correct. I'm guessing the OCR uses the same font settings until it encounters a new one, so any spaces are automatically assigned the same font settings as that of the text before them. I don't see a way to solve this issue in Acrobat, I'm afraid,. unless you manually change it in each instance.
The best way to solve it would be to export the file to another format, like Word, make the changes there (it can be automated in Word, but not in Acrobat), and then create a new PDF file.
Copy link to clipboard
Copied
I understood the root of the problem. Fonts were not embedded. After fix-up (embed) reports look fine.