Skip to main content
Participating Frequently
May 24, 2024
Question

Status Code : 400 "code": "CORRUPT_DOCUMENT", "message": "The input file appears to be corrupted

  • May 24, 2024
  • 2 replies
  • 4838 views

Hello,

  I am trying to use Adobe API for PDF processing, but I receive an error: 


{
"error": {
"code": "CORRUPT_DOCUMENT",
"message": "The input file appears to be corrupted and cannot be processed.; requestId=09b7def5-1615-4584-9551-d925e7e12b42"
}
}


 My request body is:
  
{
"$content-type": "multipart/form-data",
"$multipart": [
{
"headers": {
"Content-Disposition": "form-data; name=\"inputFileName\""
},
"body": "OoPdfFormExample.pdf"
},
{
"headers": {
"Content-Disposition": "form-data; name=\"intent\""
},
"body": "PDF_TO_EXCEL"
},
{
"headers": {
"Content-Disposition": "form-data; name=\"targetFormat\""
},
"body": "XLSX"
},
{
"headers": {
"Content-Disposition": "form-data; name=\"InputFile0\""
},
"body": "base64Content"
}
]
}
  
  If I decode the base64Content, I can regenerate my PDF file back, so I don't think there is an issue on the file side.

  I also tried with other sample files, but everytime I have same error code in response.

 

Can I know the root cause of this issue ?

This topic has been closed for replies.

2 replies

Participating Frequently
May 27, 2024

Request and response files are attached

Raymond Camden
Community Manager
Community Manager
May 24, 2024

This is not at all how the REST API works. Did you use the docs here? https://developer.adobe.com/document-services/docs/apis/

Participating Frequently
May 25, 2024

Hello,

  No, I didn't use the docs you mentioned here, as I am acesssing from Azure Logic App through my clientID and token, to access path: /operation/v1/exportPDFToExcel.

  Do you also need the raw request (header & body) if the request ID isn't enough to diagnose the issue ?

  

  

Participating Frequently
May 29, 2024

Eh, it's a bit hard to read with the German I believe. First off, where are you using base64? The connector expects the binary bits, not base64.


its little bit correct what you suggested, actually base64toBinary() has to be done

It would be applicable for any pdf to supported format conversion.

base64toBinary(@{items('For_each')?['ContentBytes']})

 

In case of prod technical issues, what is the support response time (SLA)  if enterprise plan is opted ?