PDF TO JSON CONVERSION USING NODE JS
Copy link to clipboard
Copied
Below is the source code which is given by adobe ,but im notgetting expext output any thing i missed
ACTUAL INPUT
============
IM NOT GETTING EXPECTED OUTPUT
OUTPUT:
==========
{"pages":[{"page_number":0,"is_scanned":false,"width":595.3200073242188,"has_structure":true,"content":{"number_of_images":1,"only_images":false,"has_text":true,"has_images":true,"is_empty":false},"height":841.9199829101562},{"page_number":1,"is_scanned":false,"width":595.3200073242188,"has_structure":true,"content":{"number_of_images":0,"only_images":false,"has_text":true,"has_images":false,"is_empty":false},"height":841.9199829101562}],"document":{"is_linearized":false,"pdfe_compliance_level":"","is_tagged":false,"is_portfolio":false,"is_certified":false,"is_encrypted":false,"info_dict":{"CreationDate":"D:20211124101405Z","Producer":"PDFium","Creator":"PDFium"},"is_FTPDF":false,"pdf_version":"1.7","has_acroform":false,"file_size":"112.94 KB","is_signed":false,"incremental_save_count":0,"has_embedded_files":false,"is_XFA":false,"fonts":[{"name":"BCDEEE+Calibri","font_type":"TrueType","family_name":"Calibri"},{"name":"BCDFEE+Calibri-Bold","font_type":"TrueType","family_name":"Calibri"},{"name":"BCDGEE+Calibri-Bold","font_type":"CIDFontType2","family_name":"Calibri"}],"pdfa_compliance_level":"","pdfvt_compliance_level":"","pdfx_compliance_level":"","pdfua_compliance_level":"","XMP":"","page_count":2}}
Copy link to clipboard
Copied
That looks like output from the Properties API and is exactly what should be expected. Based on your title though, I think you might be interested in the Extract API which extracts the text from a PDF as JSON.

