Skip to main content
Participating Frequently
January 25, 2023
Question

Can I extract pdf by bullet as whole paragraph?

  • January 25, 2023
  • 1 reply
  • 494 views

Good day,

Is there's a property to set so that I can get the whole bulleted content as paragraph in json? Currently json returned are parsed by label and label body.
Pdf Extract json result:

"path": "//document/l[6]/li[3]/lbody/l/li[2]/lbody/l/li[2]/lbl",
"text": "2) ",

 

"path": "//document/l[6]/li[3]/lbody/l/li[2]/lbody/l/li[2]/lbody",
"text": "cancellation during extended term.

 

Expectation:

"path": "//document/l[6]/li[3]/lbody/l/li[2]/lbody/l/li[2]/lbody",
"text": "2) cancellation during extended term.

 

Thanks.

This topic has been closed for replies.

1 reply

Joel Geraci
Community Expert
Community Expert
January 26, 2023

Is there's a property to set so that I can get the whole bulleted content as paragraph in json?

 

Not at this time but the JSON is pretty basic, it'd be easy enough to post-process into a paragraph.

Participating Frequently
January 30, 2023

Thank you. I'll just parse it using the returned json. Thanks again.