Copy link to clipboard
Copied
I've created a PDF using Adobe InDesign and was attempting to extract the text from the PDF using the API. However, I'm getting the following error:
Then further down:
When I use the Export PDF API to turn the PDF file into a MSWord .docx file, then use Word to print to PDF, and try the Extract PDF API on the modified pdf file, I dont encounter the same problem.
Does anyone know of a way to make the Extract API more forgiving? IE Allowing me to get the desired result without jumping through additional hoops? Or why the original PDF has been generated in a way that the Extract API doesnt like the font metadata.
An example PDF is attached.
Copy link to clipboard
Copied
Can you share the PDF in question?
Copy link to clipboard
Copied
Hi Joel, I've added a copy of the knitting pattern called "flax sweater" as a separater reply to my original post.
Copy link to clipboard
Copied
Copy link to clipboard
Copied
Copy link to clipboard
Copied
Thanks for taking a look Joel. There's a couple of things that don't seem right to me about this:
Copy link to clipboard
Copied
I wish I had answers for you. Fonts are really complicated.