Using the extract, not getting some text that is in the margin of the page.

Report · Jul 23, 2021

Just for reference, on page 31 at the bottom is some text "Reference ID: 3610837" It is not in the json output from the extraction API. I have attached the original PDF as well as the json output of your tool.

Report · Jul 23, 2021

I attached a Greenshot image capture of the text I'm referring to.

Report · Jul 23, 2021

I think Extract API is interpreting that area as a footer and ignoring it. Unfortunately, there is no setting to force it to not do that.

Report · Jul 24, 2021

Not that it's a showstopper for us, but if that is the confirmed reason that the text is not being extracted from the footer, is there a chance in a future sprint/improvement cycle of the tool that a config option can be added to broaden the text search to the entire page?

Report · Jul 26, 2021

Great minds... I've already submitted that as a feature request. It'll be important for documents that have been bates numbered too.

Report · Mar 23, 2022

I face the same problem, I have headers and footers with usefull information in it, but I am not able to use this information..

I hope they will integrate this feature

Report · Jun 27, 2022

Hello Joel, Any update on this feature update? We want to use this feature to extract account numbers on the bank statements. Whithout this feature these API's are not going to useful for us. If you can recommend some solution or resource it is going to be very helpful.

Report · Apr 17, 2023

Same issue here, a lot of the good stuff is in the headers and footers. We're in 2023 now, any idea when getting them would be possible?