Skip to main content
Participant
December 17, 2020
Question

Text copied from some PDFs has all spaces stripped

  • December 17, 2020
  • 2 replies
  • 1319 views

In many of the PDFs my users are working with, copying text out of the PDF loses all whitespace, with the words all concatenated together. This makes the PDF Embed API unusable, and they're having to fall back to the browser's native PDF viewer.

 

In the paper available here (open access, PDF also attached), copying the caption for Table 1 in the Chrome PDF viewer gives "The acylcarnitine profiles in dried blood spots detected by tandem mass spectrometry", but the PDF Embed API gives "Theacylcarnitineprofilesindriedbloodspotsdetectedbytandemmassspectrometry" instead, which is not useful for my purposes. This is particularly common with table and figure captions.

    This topic has been closed for replies.

    2 replies

    Participant
    May 19, 2024

    This issue still has not been fixed. When I try to search for the word using ctrl + F, the words have spaces. But when I copy the text, all the spaces are gone. This can be quite problematic for the user. 

    This happens in only some pdfs and not all. 

     

    Please provide an update. 

    Raymond Camden
    Community Manager
    Community Manager
    May 20, 2024

    I can confirm the issue still exists. I will search to see if this was properly logged, and if not, will log it. 

    Participant
    May 21, 2024

    Hi Raymond, 

    Thank you for replying. 

    Thanks to you and Joel, I understand why this issue occurs now.

    But is there a timeline for fixing issues such as this?

    If not, do you know of any workarounds that can be implemented on the developers' end, such as pre-processing the PDF and merging boxes together? 

     

    Thanks again and I look forward to your response,

    Sina

    Adobe Employee
    June 11, 2021

    Sorry for the inconvenience. We have noted this down and will let you know when it gets resolved. Stay tuned!