Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
0

Cannot detect headers and footers

New Here ,
Oct 26, 2022 Oct 26, 2022

Copy link to clipboard

Copied

I have a document that has what should be some text and page numbers in the headers and footers. However, when I open the PDF in Adobe Acorbat Pro Version 2022.003.20263 and attempt to edit, I see the text is not recognized as a header or footer. Example of the expected footer text below.

Jonathan26806110k8yc_0-1666796541321.pngexpand image

 

Here is the message I get when I attempt to update:

Jonathan26806110k8yc_1-1666796754055.pngexpand image

The message is clear. But is there anyway I can get the headers and footers to be detected? Unfortunatley I cannot share the document because it contains sensitive information. Any help is much appreciated.  

 

I can provide a little more information. I am working on an NLP project. Currently the NLP is picking up a lot of false poisitives in headers and footers. Therefore, I need a way to discount the header and footer information. But since the text in the PDF is not being detected as a header and footer I cannot find a simple solution to eliminate this false positives. Thanks!!

 

TOPICS
Edit and convert PDFs , How to , Scan documents and OCR

Views

3.0K
Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 26, 2022 Oct 26, 2022

Copy link to clipboard

Copied

"But is there anyway I can get the headers and footers to be detected?"

 

Yes, when you add header and footer with Adobe Acrobat.

Votes

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 26, 2022 Oct 26, 2022

Copy link to clipboard

Copied

Thanks, but unfortunately this does not help. This is a 2,000+ page
document that already has text which should be detected as headers and
footers. I cannot go page by page and take the text and manually enter it.

--
[image: @real_life_Sciences_logo] <>
Jonathan Nolan
Director, Product Management
1.267.615.8214
565 E. Swedesford Road • Suite 205 • Wayne, PA 1908
rlsciences.com <>
[image: @linkedin] <>

Votes

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 26, 2022 Oct 26, 2022

Copy link to clipboard

Copied

With the redaction tool you can remove text at top and bottom of the pages.

Votes

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 26, 2022 Oct 26, 2022

Copy link to clipboard

Copied

LATEST
That's a good suggestion, but I still need the content to be visible. I
just don't want NLP to view this information which tends to lead to false
positives. In an ideal world, I could detect the information as headers and
footers, then train my model to ignore headers and footers. Thanks again!

--
[image: @real_life_Sciences_logo] <>
Jonathan Nolan
Director, Product Management
1.267.615.8214
565 E. Swedesford Road • Suite 205 • Wayne, PA 1908
rlsciences.com <>
[image: @linkedin] <>

Votes

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines