Why I am getting extra attributes Even don't have into original pdf?

Question

Hi Community members,I am exploring the pdf OCR and EXTRACT APIs ( OCR, EXTRACT  )I have a Scanned pdf so to make it editable i applied the OCR and then for the pdf style and content information i am using the Extract API ( Extract Text and Tables and Character Bounding Boxes (w/ Renditions) ) I have used this api into node like below    const params = new ExtractPDFParams({
                            elementsToExtract: [ExtractElementType.TEXT, ExtractElementType.TABLES],
                            addCharInfo: true                
                        });But the JSON which is extracted contains some extra info like added some attributes (boxes into the elements) but if you look into the original pdf then there are no boxes then why those added?

Joel Geraci · Answer

I don't understand your question. Your code indicates that you are using the Extract API but your images show PDF. Extract API does not return PDF, just JSON, tables, and images.

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.