Copy link to clipboard
Copied
We are wanting to parse a PDF email attachment, extracting specific text and putting it into a SQL Server database.
I'm using Adobe Services "Extract PDF Structure in JSON File" in a Microsoft Power Automate Flow. The output is outputs('Extract_PDF_Structure_in_a_JSON_File')?['body/jsonFileContent'], which is unreadable. How do I convert that to something I can work with?
Also, I was able to output that to a JSON file. I don't know much about JSON. When I open the file in Notepad++ I see there are four sections, "pages", "extended_metadata", "elements", and "version". I only need the data in the "elements" section, and that section is a JSON array.
In the "elements" array the fields I need are all called "Text":
Any suggestions how to get this data so I can put it into a database? Thanks!
Here's my flow:
And the run:
1 Correct answer
I have figured this out.
I have a Compose action to convert the Extract output - base64ToString(output).
A second Compose to convert that to JSON.
Then a select to get the 'elements' values I need.
Copy link to clipboard
Copied
I have figured this out.
I have a Compose action to convert the Extract output - base64ToString(output).
A second Compose to convert that to JSON.
Then a select to get the 'elements' values I need.
Copy link to clipboard
Copied
Long shot here...But would you be able to show your whole flow? I'm stuck where you were but not following your compose actions. I get expression not valid.
Copy link to clipboard
Copied
Glad here you found the solution. The other option is you can use Extract PDF Structure in a JSON Object, which will return in the body the JSON that you can then use a Parse JSON action to use in the rest of your flow.
Copy link to clipboard
Copied
I have a similar flow - using the Extract to JSON (both object and tried with file) but when I inspected the contents of the JSON text, I do not see any values that were supplied to the fillable PDF. Searching for things known in the PDF (an amount field, name field, signature printed field) and nothing seems to be getting extracted. Any thoughts on why this is?
Copy link to clipboard
Copied
Same as of today. Elements returns null after parsing JSON using the Adobe extract schema.

