Skip to main content
Participant
June 23, 2014
Question

How do I convert pdf to a clean txt file?

  • June 23, 2014
  • 1 reply
  • 14799 views

I have been trying to convert a pdf file (I can send the file) to a txt file (I can send the file) that look similar.  I have tried multiple ways and seem to end up with a text file that doesn't completely match the pdf.

This topic has been closed for replies.

1 reply

Community Manager
June 23, 2014

Hi gill_ec,

I am sorry that you're having trouble converting the file. Are you Saving As .txt from Acrobat, or using ExportPDF to convert to a .RTF file, perhaps? (ExportPDF doesn't convert PDF to .txt).

If you are truly converting to .txt, I wouldn't expect the two files to look similar, as all formatting and images would be removed in the .txt file. Now, the .RTF file should more closely match the original PDF. Can you please clarify how you're doing the conversion, what format you are converting to, and how the converted file differs from the original?

Looking forward to hearing back from you, so we can sort this out.


Best,

Sara

gill_ecAuthor
Participant
June 23, 2014

Sara,

Here are my steps:

1. I save the file as a .pdf (refer to attached)

2. I open the .pdf from cloud.adobe.com and have converted the file to a .doc and .rtf (refer to attached)

3. When I open the .doc and .rtf files and Word and save as plain text, the result is less than desirable (refer to attached)

wtys,

Ed Cotter

Senior Millennium Consultant

ecotter@hpgresources.com<mailto:ecotter@hpgresources.com>

Cell: (207) 356-5680 | Office (207) 884-6205

www.HPGresources.com<http://www.hpgresources.com/>

<http://www.linkedin.com/pub/edward-cotter/14/894/670>

gill_ecAuthor
Participant
June 24, 2014

Hi Ed,

I've had a chance to look at your files, and the conversion from PDF to Word format looks pretty darn good. Because of the nature of plain text format, the results you are getting seem appropriate (there's no formatting information, so plain text usually isn't very nice to look at).

I guess the question I have is why you are taking that extra step to go from Word to plain text? How are you using that file, that dictates that it needs to be in .txt format? The fewer the conversions the better, I say.

Best,

Sara


Sara,

That conversion, .pdf to .doc, it is. Unfortunately, I need a .txt file per vendor request. My current location is using one software system which I can generate a report but can only generate a .pdf file. The .pdf file is a report of current system information. The new system that the information is going to can only utilize the report as a .txt file. I know it seems archaic. All the advances and system information input is reduced to utilizing .txt files. That’s where I’m stuck. I’m trying to save hundreds of hours of manual data entry, but the only way to accomplish that is utilizing .txt file entry of the data.

wtys,

Ed Cotter

Senior Millennium Consultant

ecotter@hpgresources.com<mailto:ecotter@hpgresources.com>

Cell: (207) 356-5680 | Office (207) 884-6205

www.HPGresources.com<http://www.hpgresources.com/>

<http://www.linkedin.com/pub/edward-cotter/14/894/670>