PDF TO JSON CONVERSION USING NODE JS

Question

Below is the source code which is given by adobe ,but im notgetting expext output any thing i missed

ACTUAL INPUT

============

IM NOT GETTING EXPECTED OUTPUT

OUTPUT:

==========

{"pages":[{"page_number":0,"is_scanned":false,"width":595.3200073242188,"has_structure":true,"content":{"number_of_images":1,"only_images":false,"has_text":true,"has_images":true,"is_empty":false},"height":841.9199829101562},{"page_number":1,"is_scanned":false,"width":595.3200073242188,"has_structure":true,"content":{"number_of_images":0,"only_images":false,"has_text":true,"has_images":false,"is_empty":false},"height":841.9199829101562}],"document":{"is_linearized":false,"pdfe_compliance_level":"","is_tagged":false,"is_portfolio":false,"is_certified":false,"is_encrypted":false,"info_dict":{"CreationDate":"D:20211124101405Z","Producer":"PDFium","Creator":"PDFium"},"is_FTPDF":false,"pdf_version":"1.7","has_acroform":false,"file_size":"112.94 KB","is_signed":false,"incremental_save_count":0,"has_embedded_files":false,"is_XFA":false,"fonts":[{"name":"BCDEEE+Calibri","font_type":"TrueType","family_name":"Calibri"},{"name":"BCDFEE+Calibri-Bold","font_type":"TrueType","family_name":"Calibri"},{"name":"BCDGEE+Calibri-Bold","font_type":"CIDFontType2","family_name":"Calibri"}],"pdfa_compliance_level":"","pdfvt_compliance_level":"","pdfx_compliance_level":"","pdfua_compliance_level":"","XMP":"","page_count":2}}

Joel Geraci · Answer

That looks like output from the Properties API and is exactly what should be expected. Based on your title though, I think you might be interested in the Extract API which extracts the text from a PDF as JSON.

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.

Scanning file for viruses.

This file cannot be downloaded