Extract PDF API returns BAD_PDF_UNSUPPORTED_FONT
I've created a PDF using Adobe InDesign and was attempting to extract the text from the PDF using the API. However, I'm getting the following error:
- Known exception encountered while executing operation ServiceApiError: BAD_PDF - Unable to extract content.: The input file contains font data that is corrupted or not supported
Then further down:
- BAD_PDF_UNSUPPORTED_FONT
When I use the Export PDF API to turn the PDF file into a MSWord .docx file, then use Word to print to PDF, and try the Extract PDF API on the modified pdf file, I dont encounter the same problem.
Does anyone know of a way to make the Extract API more forgiving? IE Allowing me to get the desired result without jumping through additional hoops? Or why the original PDF has been generated in a way that the Extract API doesnt like the font metadata.
