I have an application which I use to convert PDF files in various languages, Sometimes some of the languages doesn't get converted properly, like Hebrew and few other languages, I'd like to know if there's a spicified list of supported languages so that I can know for sure beforehand sending the pdf.
Are you asking what the application supports? That's not ours (Adobe) - so I don't know if we can help here. Did I misunderstand your question?
Sorry, Ya I don't think I have phrased the question correctly,
What I would like to know is, Is there any specific languages that the PDF Extract API doesn't support?
For instance, when I send a PDF in Hebrew or Bengali language to the PDF Extract API the response I get have incorrect characters.
Attached bellow is one of the responses I got from the PDF Extract API which is from a PDF in Armenien language.
Thank you for the prompt response.
Thanks for clarifying. According to our docs:
Language: The API is currently optimized for English language content. Files containing content in other Latin languages should return good results, but may have issues with non-English punctuation.
So I think it's a bit up in the air in terms of what to expect. That being said, would you be able to share any PDFs we can use to help test?
Thanks for the reply,
Yes I'd be happy to provide the PDF file for your development, but since that's not a public document is there a way that I can send it to you without posting it publicly?
I have mailed you the files, Please do keep me posted if there's in development in this regard.
Thank you for the quick response Ben. Thats really helpful, just to make it more clear, is python sdk available for export API ? I couldn't find the same.
Yes, and currently, it's _only_ available for Extract, but you could use the REST API in Python to call other parts of our stuff. More info here: https://opensource.adobe.com/pdftools-sdk-docs/extract/latest/quickstarts.html#python
Sorry - misread Export as Extract. Please disregard. Although what I said is still right, the Python SDK only supports Extract, but you could hit the REST APIs.