• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
14

Bug in search for subscripted text with a small font size

Community Beginner ,
Oct 30, 2023 Oct 30, 2023

Copy link to clipboard

Copied

Hi,

 

using Acrobat Reader (23.006.20360) on Windows 10 but the same issue was encountered in the pro version.

 

The attached PDF has the word Foobar with the bar being in subscript and hence also smaller in fontsize. The second table is scaled to 70% which brings down the fontsize of "bar" to <4pt. A search for "foobar" without any quotes etc. will only bring up the one in the first table. It's broken for the one in the second table. Selecting and copy-pasting either instance of foobar works just fine.

TOPICS
View PDF

Views

169

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 30, 2023 Oct 30, 2023

Copy link to clipboard

Copied

The issue lies with how the file was created. In the second instance you'll see that the two parts are separate.

If you double-click the word "Foo" it will only select the first two letters of it. If you do the same on the first word it will select all of it. That means the former is composed out of two parts, and the latter out of one.

The solution needs to come from the application that generated the PDF, something called "Antenna House PDF Output Library".

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Oct 31, 2023 Oct 31, 2023

Copy link to clipboard

Copied

Thanks for your response! I agree that looks odd. When looking at the textbox in Adobe Pro it does show up as a single Object though, see attached. Also interesting is, that despite the fact it shows "Fo" "o" "bar" as three different things in the reader when double clicking parts of it. It still finds "foo" when searching for it.

Extracting the objects with pdfminer shows a single text object for both instances, not sure how else to check what's a different object and what not. Testing this on the internal pdf readers from Firefox and Edge as well as PDF X change is able to find both instances without issues. Are you sure the double click is not just another bug in how Acrobat reader handles this document?

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 31, 2023 Oct 31, 2023

Copy link to clipboard

Copied

The double-click method is actually a better indication than the Edit Text & Images tool, as that uses all kinds of internal algorithms that try to section the file's contents into editable sections, and doesn't necessarily represent the internal structure of the file.

Analyzing this kind of issue can be quite complicated, but if you believe it's a bug then report it to Adobe, by all means: http://www.adobe.com/products/wishform.html

 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Oct 31, 2023 Oct 31, 2023

Copy link to clipboard

Copied

Ok, have created it over there. Looking at existing tickets I doubt they will care 😞

https://acrobat.uservoice.com/forums/926812-acrobat-reader-for-windows-and-mac/suggestions/46804948-...

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 31, 2023 Oct 31, 2023

Copy link to clipboard

Copied

LATEST

Unfortunately, I have to agree. It's a very bad platform and there's almost no feedback on any reports.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines