Copy link to clipboard
Copied
Good afternoon, I have an urgent query, knowing that a pdf/a file is editable and the OCR feature can be added through software, I wanted to know which subtype of PDF/A already comes with OCR feature. I'm not sure, but I think PDF/A-2u and PDF/A-3u already come with OCR. can you help me?
Buenas tardes, tengo una consulta urgente , sabiendo que un archivo pdf/a es editable y me diante software se le puede añadir la caracteristica OCR, queria saber que subtipo de PDF/A ya viene con caracteristica OCR. No estoy seguro , pero creo que PDF/A-2u y PDF/A-3u ya vienen con OCR.
¿pueden ayudareme?
Copy link to clipboard
Copied
OCR is a function of Adobe Acrobat. It is not a feature of PDF.
Copy link to clipboard
Copied
Even knowing that OCR is a feature of Adobe, I wanted to know if any new pdf/a subtype already incorporates it and how to know in a pdf/a document that it has OCR, in addition to checking it, yes, but what attribute or metadata does it indicate? that the pdf/a has the OCR feature
Copy link to clipboard
Copied
PDF files doesn't have a OCR feature.
They can contain text.
Copy link to clipboard
Copied
Pour savoir si un PDF contient du texte reconnu par OCR il suffit d'utiliser la fonction de recherche, si le texte recherché est trouvé c'est que le PDF contient du texte. Dans le cas contraire (document image seulement) la recherche ne trouve rien.
Que le document soit un PDF/A ou un PDF tout court n'y change rien.
S'il y a beaucoup de PDF à trier on peut aussi utiliser le Contrôle en amont d'Acrobat Pro, il contient un "profil" qui permet de détecter le texte ajouté par l'OCR.
PS : Il ne faut pas poster de message en deux langues car le traducteur automatique ne sait pas quoi traduire.
Copy link to clipboard
Copied
I know and understand your answer, I work with PDF/a and OCR and it's clear to me that it's a feature, but still, we think that some subtype of pdf includes character search, one of the following subtypes:
PDF/A-1b
PDF/A-1a
PDF/A-2b
PDF/A-2a
PDF/A-2u
PDF/A-3b
PDF/A-3a
PDF/A-3u
PDF/UA-1
PDF/A-4e
PDF/A-4f
PDF/E
PDF/X-1
PDF/X-3
PDF/X-4
PDF/X-5
PDF/VT-2
PDF/VT-2 - VT - 2
Copy link to clipboard
Copied
Vous devriez lire tous les articles de cette rubrique (Google Translate est votre ami) :
https://www.abracadabrapdf.net/category/format_pdf/normes_iso_et_pdf/
Copy link to clipboard
Copied
Thanks for the information, I have read and it is very important. I have another question, do all pdf files allow software to convert them to OCR?
Copy link to clipboard
Copied
PDFs can have security added such as only allow people to open with a password or prevent editing. If it is protected, you wouldn't eb able to use the OCR on the file.