Skip to main content
Participant
November 29, 2021
Question

PDF TO JSON CONVERSION USING NODE JS

  • November 29, 2021
  • 1 reply
  • 418 views

Below is the source code which is given by adobe ,but im notgetting expext output any thing i missed

ACTUAL INPUT

============

IM NOT GETTING EXPECTED OUTPUT

 

OUTPUT:

==========

{"pages":[{"page_number":0,"is_scanned":false,"width":595.3200073242188,"has_structure":true,"content":{"number_of_images":1,"only_images":false,"has_text":true,"has_images":true,"is_empty":false},"height":841.9199829101562},{"page_number":1,"is_scanned":false,"width":595.3200073242188,"has_structure":true,"content":{"number_of_images":0,"only_images":false,"has_text":true,"has_images":false,"is_empty":false},"height":841.9199829101562}],"document":{"is_linearized":false,"pdfe_compliance_level":"","is_tagged":false,"is_portfolio":false,"is_certified":false,"is_encrypted":false,"info_dict":{"CreationDate":"D:20211124101405Z","Producer":"PDFium","Creator":"PDFium"},"is_FTPDF":false,"pdf_version":"1.7","has_acroform":false,"file_size":"112.94 KB","is_signed":false,"incremental_save_count":0,"has_embedded_files":false,"is_XFA":false,"fonts":[{"name":"BCDEEE+Calibri","font_type":"TrueType","family_name":"Calibri"},{"name":"BCDFEE+Calibri-Bold","font_type":"TrueType","family_name":"Calibri"},{"name":"BCDGEE+Calibri-Bold","font_type":"CIDFontType2","family_name":"Calibri"}],"pdfa_compliance_level":"","pdfvt_compliance_level":"","pdfx_compliance_level":"","pdfua_compliance_level":"","XMP":"","page_count":2}}

    This topic has been closed for replies.

    1 reply

    Joel Geraci
    Community Expert
    Community Expert
    November 30, 2021

    That looks like output from the Properties API and is exactly what should be expected. Based on your title though, I think you might be interested in the Extract API which extracts the text from a PDF as JSON.