Question
Issue with parsing Large PDFs, and conveting JSON to consistently applied html
I'm having issue converting a large PDF, 100+ pages, with images and complex tables (attached PDF and converted html)
Some paragraphs lead with numbers and they are being assigned in separate divs, and are overlapping the paragraphbtext they are assigned to.
There are also large white spaces where headers/footer/page breaks are
Any help appreciated
