Skip to main content
Participant
June 23, 2014
Question

How do I convert pdf to a clean txt file?

  • June 23, 2014
  • 1 reply
  • 14799 views

I have been trying to convert a pdf file (I can send the file) to a txt file (I can send the file) that look similar.  I have tried multiple ways and seem to end up with a text file that doesn't completely match the pdf.

This topic has been closed for replies.

1 reply

Community Manager
June 23, 2014

Hi gill_ec,

I am sorry that you're having trouble converting the file. Are you Saving As .txt from Acrobat, or using ExportPDF to convert to a .RTF file, perhaps? (ExportPDF doesn't convert PDF to .txt).

If you are truly converting to .txt, I wouldn't expect the two files to look similar, as all formatting and images would be removed in the .txt file. Now, the .RTF file should more closely match the original PDF. Can you please clarify how you're doing the conversion, what format you are converting to, and how the converted file differs from the original?

Looking forward to hearing back from you, so we can sort this out.


Best,

Sara

gill_ecAuthor
Participant
June 23, 2014

Sara,

Here are my steps:

1. I save the file as a .pdf (refer to attached)

2. I open the .pdf from cloud.adobe.com and have converted the file to a .doc and .rtf (refer to attached)

3. When I open the .doc and .rtf files and Word and save as plain text, the result is less than desirable (refer to attached)

wtys,

Ed Cotter

Senior Millennium Consultant

ecotter@hpgresources.com<mailto:ecotter@hpgresources.com>

Cell: (207) 356-5680 | Office (207) 884-6205

www.HPGresources.com<http://www.hpgresources.com/>

<http://www.linkedin.com/pub/edward-cotter/14/894/670>

Community Manager
June 23, 2014

Sara,

Here are the links.

s_pref_card_epic_extract_efc_ to text.pdf - https://cloud.acrobat.com/file/8bdf44ed-bc98-4ade-8c90-b7b528095a53

s_pref_card_epic_extract_efc_ to text.doc - https://cloud.acrobat.com/file/889511f2-7a0a-4616-b0de-71f22ec00984

s_pref_card_epic_extract_efc_ to text.txt - https://cloud.acrobat.com/file/1ff704b1-341f-46a1-9b53-fcc969b3b355

wtys,

Ed Cotter

Senior Millennium Consultant

ecotter@hpgresources.com<mailto:ecotter@hpgresources.com>

Cell: (207) 356-5680 | Office (207) 884-6205

www.HPGresources.com<http://www.hpgresources.com/>

<http://www.linkedin.com/pub/edward-cotter/14/894/670>


Hi Ed,

I've had a chance to look at your files, and the conversion from PDF to Word format looks pretty darn good. Because of the nature of plain text format, the results you are getting seem appropriate (there's no formatting information, so plain text usually isn't very nice to look at).

I guess the question I have is why you are taking that extra step to go from Word to plain text? How are you using that file, that dictates that it needs to be in .txt format? The fewer the conversions the better, I say.

Best,

Sara