Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Can I extract pdf by bullet as whole paragraph?

Community Beginner ,
Jan 24, 2023 Jan 24, 2023

Good day,

Is there's a property to set so that I can get the whole bulleted content as paragraph in json? Currently json returned are parsed by label and label body.
Pdf Extract json result:

"path": "//document/l[6]/li[3]/lbody/l/li[2]/lbody/l/li[2]/lbl",
"text": "2) ",

 

"path": "//document/l[6]/li[3]/lbody/l/li[2]/lbody/l/li[2]/lbody",
"text": "cancellation during extended term.

 

Expectation:

"path": "//document/l[6]/li[3]/lbody/l/li[2]/lbody/l/li[2]/lbody",
"text": "2) cancellation during extended term.

 

Thanks.

TOPICS
PDF Extract API , PDF Services API
408
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 26, 2023 Jan 26, 2023

Is there's a property to set so that I can get the whole bulleted content as paragraph in json?

 

Not at this time but the JSON is pretty basic, it'd be easy enough to post-process into a paragraph.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jan 29, 2023 Jan 29, 2023
LATEST

Thank you. I'll just parse it using the returned json. Thanks again.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Resources