Skip to main content
Known Participant
January 13, 2021
Answered

How to get PDF buffer when loading the pdf with a URL

  • January 13, 2021
  • 1 reply
  • 3168 views

as title

 

thanks in advance. 

This topic has been closed for replies.
Correct answer Joel Geraci
i need to parse the actual content for the real title... lots of time the metadata gives wrong one.

Get Outlook for iOS

I think I understand but let me check. The title in the metadata isn't the text that would be recognized by a human as the title of the document if they were reading it on screen. If that's the case, take a look at our PDF Tools / Extract API. 

 

You can read about it here https://medium.com/adobetech/extract-content-structure-from-pdfs-using-ai-powered-adobe-pdf-extract-api-1593ad6b79b5

 

1 reply

Joel Geraci
Community Expert
Community Expert
January 13, 2021

Can you elaborate on your question?

tangyunAuthor
Known Participant
January 14, 2021

Hi thanks for your reply.

 

basically i used the previewPDF function in window.AdobeDC.View class to render a pdf from a URL. 

 

and after it loads the PDF automatically, i want to get a raw arrayBuffer content of the loaded PDF so that i could do some processing on it. 

 

So i wonder if there is an API for requesting the raw pdf buffer.

 

I know that annotationManager.removeAnnotationsFromPDF returns the PDF buffer, but to use this function, i have to set the "IncludePDFAnnotations" to true, which I don't want, cos the additional save button will appear...

tangyunAuthor
Known Participant
January 20, 2021

I'm curious to hear what preprocessing you are doing. Is it something that makes sense to add to Embed API? 


basically i wish to parse the pdf raw data to extract some useful information such as the title for instance.

Get Outlook for iOS