Skip to main content
Participant
March 1, 2024
Question

Convert Pdf to Docx via API lost sentence, paragraph

  • March 1, 2024
  • 2 replies
  • 505 views

Hi team,

I am trying to convert Pdf to Docx via Adobe services API, I followed the sample here https://developer.adobe.com/document-services/docs/overview/pdf-services-api/howtos/export-pdf/. After getting output, it look fine as first, but when I check it in XML format, almost everyword become its own element, so it doesn't maintain the original sentence, paragraph anymore. Is there any option to keep the elements together as it should be?

 

Regards,

James Nguyen

 

This topic has been closed for replies.

2 replies

Bernd Alheit
Community Expert
Community Expert
March 8, 2024

Try the forum for this API.

BarlaeDC
Community Expert
Community Expert
March 8, 2024

Hi,

 

Is the PDF that you are trying to convert tagged?  if not this may be what is causing all the words to be separate.

Participant
March 8, 2024

Hi Barlae,

 

Thanks for responding, I'm not sure whether it is tagged, how can I check that.
I think it is not, as when I convert via Adobe Acrobat DC, it work as expected, only issue when I use the API
Regards,

James Nguyen