Skip to main content
Participant
January 24, 2021
Answered

How to Convert PDF Photo of Newspaper Article

  • January 24, 2021
  • 3 replies
  • 4851 views

I subscribe to Newspapers.com and am clipping newspaper articles and downloading them to my computer as a PDF file.  I then am trying to convert those PDFs to Word.  When I do that, I find that the word document is a photo image and I cannot edit the text.  How can I capture the text of the newspaper article into an editable Word document?  Need instructions.  See sample file below.

This topic has been closed for replies.
Correct answer Bevi Chagnon - PubCom.com

Your downloaded PDFs are graphics of the newspaper pages, not live editable text.

 

You can attempt to convert them to editable, searchable text with Acrobat's OCR tool, Scan & OCR. It will attempt to interpret the graphical text and give you something you can then export to an MS Word.docx.

 

And you'll need either Acrobat Pro and Standard to do this, not the free Reader.

 

 

Once the text is recognized (OCR'ed), you can now Save As / Save As Type = Word Document .docx and open the recovered text in MS Word.

 

3 replies

Abambo
Community Expert
Community Expert
June 14, 2024

@YT Editing Hu38036276gndq ,

Do you have a message for us?

ABAMBO | Hard- and Software Engineer | Photographer
Bevi Chagnon - PubCom.com
Legend
January 24, 2021

Your downloaded PDFs are graphics of the newspaper pages, not live editable text.

 

You can attempt to convert them to editable, searchable text with Acrobat's OCR tool, Scan & OCR. It will attempt to interpret the graphical text and give you something you can then export to an MS Word.docx.

 

And you'll need either Acrobat Pro and Standard to do this, not the free Reader.

 

 

Once the text is recognized (OCR'ed), you can now Save As / Save As Type = Word Document .docx and open the recovered text in MS Word.

 

|    Bevi Chagnon   |  Designer, Trainer, & Technologist for Accessible Documents ||    PubCom |    Classes & Books for Accessible InDesign, PDFs & MS Office |
Participant
July 2, 2023

Hi, I did this, and now I have the same image but as a word file. Is there a way to get it so that I just have the text and it look like a standard word document, not a newspaper?

Abambo
Community Expert
Community Expert
July 2, 2023

Did you do an OCR on the file? The problem is probably, that the news clippings are images. If you convert a PDF image to Word, you get a Word image. I would expect that.

 

To get the text as text (and real images as image) you need to try to run an OCR program on the image. Such a program tries to find text and to convert that back into computer readable text.

 

When you export to Word, you will get the result of this OCR as a Word file, but Acrobat tries to keep the formatting as it was, and you need probably to modify that to get a nice Word file. It helps, if you try working with not too complex layouts.

ABAMBO | Hard- and Software Engineer | Photographer
gary_sc
Community Expert
Community Expert
January 24, 2021

I just took a look at that website and noticed that it's carrying newspaper articles from way way way back. That's cool!

 

However, what they've done is to just post images saved into the PDF format such as you had as a link in your question. They were never made searchable and were never saved or altered to be searchable. They are simply images saved in the PDF format. I was easily able to make it searchable though because I have Acrobat Pro DC.

 

If you are using Adobe Reader, you cannot do this and will need to update your Acrobat application. If you are on Windows you can update to Acrobat or Acrobat Pro. If you are on a Mac, the only option is Acrobat Pro.

 

Please note that I do not work for Adobe, I, like most people in these forums are just folks with Acrobat experience and are willing to take time to help others out. I do not receive a commission for any sales, I'm just answering your question to the best of my ability.