Skip to main content
Participant
July 18, 2023
Question

OCR , help !!!!

  • July 18, 2023
  • 3 replies
  • 925 views

Good afternoon, I have an urgent query, knowing that a pdf/a file is editable and the OCR feature can be added through software, I wanted to know which subtype of PDF/A already comes with OCR feature. I'm not sure, but I think PDF/A-2u and PDF/A-3u already come with OCR. can you help me?

 

Buenas tardes, tengo una consulta urgente , sabiendo que un archivo pdf/a es editable y me diante software se le puede añadir la caracteristica OCR, queria saber que subtipo de PDF/A ya viene con caracteristica OCR. No estoy seguro , pero creo que PDF/A-2u y PDF/A-3u ya vienen con OCR.
¿pueden ayudareme?

This topic has been closed for replies.

3 replies

JR Boulay
Community Expert
Community Expert
July 19, 2023

Vous devriez lire tous les articles de cette rubrique (Google Translate est votre ami) :

https://www.abracadabrapdf.net/category/format_pdf/normes_iso_et_pdf/

Acrobate du PDF, InDesigner et Photoshopographe
Participant
July 19, 2023

Thanks for the information, I have read and it is very important. I have another question, do all pdf files allow software to convert them to OCR?

Community Expert
July 20, 2023

PDFs can have security added such as only allow people to open with a password or prevent editing. If it is protected, you wouldn't eb able to use the OCR on the file.

JR Boulay
Community Expert
Community Expert
July 19, 2023

Pour savoir si un PDF contient du texte reconnu par OCR il suffit d'utiliser la fonction de recherche, si le texte recherché est trouvé c'est que le PDF contient du texte. Dans le cas contraire (document image seulement) la recherche ne trouve rien.

Que le document soit un PDF/A ou un PDF tout court n'y change rien.

 

S'il y a beaucoup de PDF à trier on peut aussi utiliser le Contrôle en amont d'Acrobat Pro, il contient un "profil" qui permet de détecter le texte ajouté par l'OCR.

 

PS : Il ne faut pas poster de message en deux langues car le traducteur automatique ne sait pas quoi traduire.

 

 

Acrobate du PDF, InDesigner et Photoshopographe
Participant
July 19, 2023

I know and understand your answer, I work with PDF/a and OCR and it's clear to me that it's a feature, but still, we think that some subtype of pdf includes character search, one of the following subtypes:
PDF/A-1b
PDF/A-1a
PDF/A-2b
PDF/A-2a
PDF/A-2u
PDF/A-3b
PDF/A-3a
PDF/A-3u
PDF/UA-1
PDF/A-4e
PDF/A-4f
PDF/E
PDF/X-1
PDF/X-3
PDF/X-4
PDF/X-5
PDF/VT-2
PDF/VT-2 - VT - 2

Bernd Alheit
Community Expert
Community Expert
July 18, 2023

OCR is a function of Adobe Acrobat. It is not a feature of PDF.

Participant
July 19, 2023

Even knowing that OCR is a feature of Adobe, I wanted to know if any new pdf/a subtype already incorporates it and how to know in a pdf/a document that it has OCR, in addition to checking it, yes, but what attribute or metadata does it indicate? that the pdf/a has the OCR feature

 

 

Bernd Alheit
Community Expert
Community Expert
July 19, 2023

PDF files doesn't have a OCR feature.

They can contain text.