• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
0

Text duplicated in PDF

Community Beginner ,
Mar 16, 2023 Mar 16, 2023

Copy link to clipboard

Copied

I'm not sure how to fully explain this issue. I have a PDF that I need to be able to highlight, but when I try, it highlights multiple "invisible" boxes.

caileec5637304_0-1678992437726.png

 

When I optimize the text it looks like this, and you can see it has the article multiplied over top of itself:

caileec5637304_1-1678992501134.png

There are no layers, but when I go into edit, you can see all of the bounding boxes for text:

caileec5637304_2-1678992543346.png

Is there an easy way to clean up this pdf? I've never seen anything like this before.

TOPICS
Edit and convert PDFs , PDF forms , Scan documents and OCR

Views

2.0K

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Community Expert , Mar 16, 2023 Mar 16, 2023

Export the pages as TIFF files, create a new PDF file from the TIFF files, and use OCR on the new file.

Votes

Translate

Translate
Community Expert ,
Mar 16, 2023 Mar 16, 2023

Copy link to clipboard

Copied

Export the pages as TIFF files, create a new PDF file from the TIFF files, and use OCR on the new file.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Mar 16, 2023 Mar 16, 2023

Copy link to clipboard

Copied

This worked great! Thank you. 🙂

 

Would you by chance know how to clean up the text? It seemed to make the blue text an image and its a lower quality. I ran it through to save it as a Press-Ready PDF and it seemed to make it worse.

caileec5637304_0-1678996497773.png

 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Mar 17, 2023 Mar 17, 2023

Copy link to clipboard

Copied

What OCR options does you use?

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Mar 17, 2023 Mar 17, 2023

Copy link to clipboard

Copied

Here's what my settings look like under Enhance

caileec5637304_0-1679063612005.png

 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Mar 18, 2023 Mar 18, 2023

Copy link to clipboard

Copied

LATEST

Try the use of Recognize Text at Scan & OCR.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines