How to get PDF buffer when loading the pdf with a URL

Forum|Forum|5 years ago
January 13, 2021
1 reply
3168 views

as title

thanks in advance.

This topic has been closed for replies.

Correct answer Joel Geraci

i need to parse the actual content for the real title... lots of time the metadata gives wrong one.

Get Outlook for iOS

I think I understand but let me check. The title in the metadata isn't the text that would be recognized by a human as the title of the document if they were reading it on screen. If that's the case, take a look at our PDF Tools / Extract API.

You can read about it here https://medium.com/adobetech/extract-content-structure-from-pdfs-using-ai-powered-adobe-pdf-extract-api-1593ad6b79b5

Joel Geraci

Community Expert

Can you elaborate on your question?

T

tangyunAuthor

Known Participant

Hi thanks for your reply.

basically i used the previewPDF function in window.AdobeDC.View class to render a pdf from a URL.

and after it loads the PDF automatically, i want to get a raw arrayBuffer content of the loaded PDF so that i could do some processing on it.

So i wonder if there is an API for requesting the raw pdf buffer.

I know that annotationManager.removeAnnotationsFromPDF returns the PDF buffer, but to use this function, i have to set the "IncludePDFAnnotations" to true, which I don't want, cos the additional save button will appear...

T

tangyunAuthor

Known Participant

I'm curious to hear what preprocessing you are doing. Is it something that makes sense to add to Embed API?

basically i wish to parse the pdf raw data to extract some useful information such as the title for instance.

Get Outlook for iOS

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.

Scanning file for viruses.

This file cannot be downloaded