Copy link to clipboard
Copied
Trying to convert a .PDF to an Excel file and when completed, only thing that shows in the resulting Excel worksheet are the lines of the table I am exporting. It was working perfectly up until about 4-6 weeks ago.
Please help. Desperate!
Copy link to clipboard
Copied
Most likely because of an issue with the font used in the PDF. What happens
if you copy the text in the PDF, copy it and paste it into something like
Notepad?
On 8 January 2018 at 17:59, stephanief91464426 <forums_noreply@adobe.com>
Copy link to clipboard
Copied
I tried converting and got a load of illegible symbols. Dont know why !
Copy link to clipboard
Copied
Most likely because of an issue with the font used in the PDF. What happens
if you copy the text in the PDF, copy it and paste it into something like
Notepad?
On 8 January 2018 at 17:59, stephanief91464426 <forums_noreply@adobe.com>
Copy link to clipboard
Copied
I'll try that , thanks
Copy link to clipboard
Copied
No, it wont have any of it. Will print out pdf's and type into excel by hand. An afternoon's work and 1000 entries pffft.
Copy link to clipboard
Copied
Á¤£@Õ¤ z@ôôðð@ööõ÷@øóøò@ôø÷ñ Ä @ñò@`@Ѥ¨@ññk@òðñ÷@
Á¤£@É£z
|
|
¦¦¦K K
Ô@@¤ ¢@£z Â@@Á
Ô@¨ £¢@£z Â@@Á
䢣 @â ¥ z ñKøððKôòñKòññð
㢣¢
㢣 ×¢£ Ù Á¤£
Ä£ Ä£ Ä ¢£ Õ¤ Õ¤ Á¤£ ã£
ר £¢@@Ö£ @à £¢
ðñaðõ ðñaðõ Ö @¨ £@@ÃÈÒ@÷ öôõó ôø÷ñ 6ù÷óKöõ
Copy link to clipboard
Copied
Then the font(s) used in these files are badly encoded.
Before you do that, there is another option: Re-creating the PDF files using Acrobat.
To do that you would need to first export all the pages as images (using a lossless format, like PNG), via File - Save As Other - Image, and then create a new PDF from those images and run Text Recognition on it. See if that helps...
Copy link to clipboard
Copied
Really think I'll be quicker extracting the lines I need. Thanks anyhow.
Copy link to clipboard
Copied
You just wrote you have a 1000 entries to extract... But whatever. Good luck!
Copy link to clipboard
Copied
Exactly...that's why I tried to export it all. Emphasis being on tried. The font is crap so I can understand that not everything will work. i'll have it done by tomorrow lunchtime and a tree worth of paper to shred. I hate it when bank statements get downloaded as pdf not csv. There's another 10 different accounts to do after that, so I shall try your approach with them. Many thanks i know a bit more now
Copy link to clipboard
Copied
Whenever Acrobat cannot extract information from a file (or correctly convert to to a different document format), one way to get a potentially better result is to throw away all the font information that is stored in the PDF file, and the best way to do that is to export your document as a series of high quality TIFF images, and then re-import these images into Acrobat to create a new PDF file. You can then run OCR to convert to text that can be extracted again (this last step can also be done as part of the export function if you have a newer version of Acrobat).
Copy link to clipboard
Copied
Many thanks Karl, I will try that
[personal data removed by forum moderator]
Find more inspiration, events, and resources on the new Adobe Community
Explore Now