Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Adobe Embed API - Search across multiple words / lines not working properly for many PDFs

New Here ,
Nov 07, 2022 Nov 07, 2022

I am expericing the same issues with the Adobe Embed API Search API as reported here -> https://community.adobe.com/t5/document-services-apis-discussions/can-t-search-phrase-in-adobe-embed...

 

To ensure this issue doesn't relate to my implemtation of the search API, I used the sample Adobe Embed API for Angular from Github ( https://github.com/adobe/pdf-embed-api-samples/tree/master/More%20Samples/Angular%20Samples ) and simply opened the document and used the basic search built into the viewer.

 

What I have found is that a number of PDFs we try and search against will return no matches when multiple words are used.   If the text to match is in the middle of a line, you WILL get a match if you remove the spaces between words).  If the multiple words to match goes over onto a second you will only get a match by using the space. 

 

For example, in this sample the text in the PDF 'our mission' is only matched if the search term is 'ourmission'.

Brian26152863ljie_0-1667846754179.pngexpand image

Brian26152863ljie_1-1667846833572.pngexpand image

And when searching across text that wraps across multiple lines, the space is required:

Brian26152863ljie_2-1667846917154.pngexpand image

 

This is affect a number of PDF I am testing with, though not all.  Here is a second PDF where the search term will only match if spaces are removed.

Brian26152863ljie_3-1667847012707.pngexpand image

 

And here is a sample of a PDF where the search works as expected:

Brian26152863ljie_4-1667847074241.pngexpand image

 

I have included copies of the PDFs that are failing as well as one that is working.

 

The same files can be opened and search using other PDF viewers, like pdf.js, with no issues.

 

Thank you in advance for any help and comments.

 

 

 

TOPICS
PDF Embed API
510
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
no replies

Have something to add?

Join the conversation
Resources