Cannot detect headers and footers
Copy link to clipboard
Copied
I have a document that has what should be some text and page numbers in the headers and footers. However, when I open the PDF in Adobe Acorbat Pro Version 2022.003.20263 and attempt to edit, I see the text is not recognized as a header or footer. Example of the expected footer text below.
Here is the message I get when I attempt to update:
The message is clear. But is there anyway I can get the headers and footers to be detected? Unfortunatley I cannot share the document because it contains sensitive information. Any help is much appreciated.
I can provide a little more information. I am working on an NLP project. Currently the NLP is picking up a lot of false poisitives in headers and footers. Therefore, I need a way to discount the header and footer information. But since the text in the PDF is not being detected as a header and footer I cannot find a simple solution to eliminate this false positives. Thanks!!
Copy link to clipboard
Copied
"But is there anyway I can get the headers and footers to be detected?"
Yes, when you add header and footer with Adobe Acrobat.
Copy link to clipboard
Copied
document that already has text which should be detected as headers and
footers. I cannot go page by page and take the text and manually enter it.
--
[image: @real_life_Sciences_logo] <>
Jonathan Nolan
Director, Product Management
1.267.615.8214
565 E. Swedesford Road • Suite 205 • Wayne, PA 1908
rlsciences.com <>
[image: @linkedin] <>
Copy link to clipboard
Copied
With the redaction tool you can remove text at top and bottom of the pages.
Copy link to clipboard
Copied
just don't want NLP to view this information which tends to lead to false
positives. In an ideal world, I could detect the information as headers and
footers, then train my model to ignore headers and footers. Thanks again!
--
[image: @real_life_Sciences_logo] <>
Jonathan Nolan
Director, Product Management
1.267.615.8214
565 E. Swedesford Road • Suite 205 • Wayne, PA 1908
rlsciences.com <>
[image: @linkedin] <>

