Adobe Acrobat DC is distorting pages

Report · Aug 14, 2019

Hello,

I need help; the company I work for uses this program mainly to edit pdfs, but it's been distorting many pages, normally, I'd just redo the work and keep moving, this time the lost was big, this is it did:

I REALLY need a solution for this and I honestly don't know where to look for, I'm already looking into the possibility of using another program because it's been happening way too often.

Grateful,

As of now,

For your attention,

Lucas Jardim Pinheiro

Report · Aug 14, 2019

Hi Lucas,

Wow, I'm not at all certain what's taking place but to help you the following things would be helpful:

What is your operating system (and what build)?

You stated you are using Acrobat DC, I'm assuming that that's the Pro version, what build are you using?

As I look at your sample, I cannot really determine what changed because I do not have a "before" view. Can you please supply that?

And lastly, EXACTLY what are the steps you are taking to get this result.

Thanks,

Report · Aug 14, 2019

Hi, Gary,

Wow indeed , here's a screenshot of what the file looked like

I've been having a lot of small problems with Acrobat not correctly detecting which fonts are been used and/or detecting the images as fonts (and vice-versa), this time it actually disappeared with every character except "o".

I open Adobe Acrobat Pro DC, click on Edit PDF, and start manually translating the file (whilst having some weird Fonts with asterisks next to it whose name have some letters switched), then I move on to the next page (I noticed the program scans per page, so it takes a few seconds) and I proceed to do the same thing while saving in between pages and breaks.

About the OS and Acrobat Version, I'm using Windows 10 Enterprise 2016 LTSB and Adobe Acrobat Pro DC (18.9.20044.251705).

And these are the System Parameters from Adobe Acrobat (which include informations about the computer), unfortunately the Adobe is installed in portuguese, so if there's any relevant information you cannot properly translate, please do question me:

"

Acrobat instalado: C:\Program Files (x86)\Adobe\Acrobat DC\Acrobat\Acrobat.exe

Versão: 18.9.20044.251705

Data da criação:2017/11/04

Hora de criação:17:57:50

Acrobat instalado: C:\Program Files (x86)\Adobe\Acrobat Reader DC\Reader\AcroRd32.exe

Versão: 19.12.20035.332343

Data da criação:2019/06/10

Hora de criação:16:57:56

Aplicativos instalados:

Versão do Office: Office 2016 64-bit

Área de arquivos de página: 4194303 KB

Correio padrão:

Detalhe da conta:

Direitos do usuário: Regular

Controle de conta do usuário: Padrão

Integridade do processo: Média

Tipo de perfil: Nenhum

Detalhe da sessão:

Tipo de inicialização: Normal

está desligando: Não

Rede: Disponível

Dentro do Citrix: Não

Dentro do VMWare: Não

Sessão remota: Não

Controle remoto: Não

Com JAWS: Não

Uso do ZoomText: Não

Com Windows-Eyes: Não

Com NVDA: Não

Detalhe do Acrobat:

Área restrita: Desativar

Leitor cativo: Não

Suporte a vários leitores no desktop: Desativar

Detalhe do Windows:

Tablet: Não

Starter Edition: Não

Media Center Edition: Não

Máquina lenta: Não

Detalhes de vídeo:

Largura da tela: 1920

Altura da tela: 1080

Número de monitores: 2

Número de botões do mouse: 5

Tem roda do mouse: Sim

Tem Pen Windows: Não

Conjunto de caracteres de byte duplo: Não

Tem editor de método de entrada: Sim

Dentro do leitor de tela: Não

Diretório do Windows: C:\Windows

Diretório temporário: C:\Users\0055616\AppData\Local\Temp\

Fabricante do SO: Microsoft Corporation

Fuso horário: Hora oficial do Brasil

Locale: Português (Brasil)

Memória física disponível: 4194303 KB

Memória física total: 4194303 KB

Memória virtual disponível: 3607508 KB

Memória virtual total: 4194176 KB

Monitor:

Nome: Intel(R) UHD Graphics 630

Resolução: 1920 x 1080 x 60

Bits por pixel: 32

Monitor:

Nome: NVIDIA GeForce GTX 1050 Ti

Resolução: 1920 x 1080 x 60

Bits por pixel: 32

Navegador padrão: C:\Program Files\Internet Explorer\iexplore.exe

Versão: 11.00.14393.0 (rs1_release.160715-1616)

Data da criação:2016/07/16

Hora de criação:08:43:06

Nome do sistema: CRLABCONT-301

Nome do SO: Microsoft Windows 10

Nome do usuário: 0055616

Placa gráfica: NVIDIA GeForce GTX 1050 Ti

Versão: 23.21.13.9125

Verificar: Sem suporte

Processador: Intel64 Family 6 Model 158 Stepping 10 GenuineIntel ~3192 Mhz

Versão do BIOS: DELL - 1072009

Versão do SO:10.0.14393

"

Grateful,

As of now,

For your attention,

Lucas Jardim Pinheiro

Report · Aug 14, 2019

Hi Lucas,

OK, the first thing I did when I saw your new notice was to verify that Acrobat Pro DC does work with Protuguês. Good news (at least for me, you probably already knew that), it does.

OK, the next question(s) I have as to where the original scanned documents come from: Who's doing the scanning and how was it done. What is the resolution the scanning is done at? Are you receiving these as images (e.g., JPG, TIF, etc.) or as PDFs?

Have you received other PDFs from this source (even you) that have worked in the past? What changes might have occurred since there were successful OCR operation (e.g., updates to OS, updates to Acrobat Pro DC, updates to your scanning software, change in scanner?

I have to leave in a few minutes and will not return until some 4 hours or so. So if you do not get an immediate response, sorry.

Good luck,

Report · Aug 14, 2019

Hi Gary,

So, first, I'd like to say that I'm already impressed with the speed you've been replying, so no worries about that, I'll actually have some other appointments now and won't be able to respond for a while aswell, so take your time.

I receive the files as PDFs through the company's branch in Brazil and as far as I know the company receives the PDFs (not sure if scanned or not) throught the branch in the United States; I can check with them about the origin of the PDF, if you claim to be important.

If you don't mind me asking, is this uncommon? Because I've been having this and similar problems with nearly every PDF they've sent. And I must admit this is the first time I work with translation through Adobe Acrobat Pro DC; I've used PDF Expert, Adobe Photoshop and Adobe Premiere (to embed some subtitles), in other translation tasks.

Lastly, could this be caused by low-quality PDFs? Could there be a recurring distorting of the words due to the program having constant difficulties in reading the text? Because I've noticed that the more I use the OCR the worst the quality of these words become like it's applying some Bold Print-Like texture on top of the words over and over;

Grateful,

As of now,

For your attention,

Lucas Jardim Pinheiro

Report · Aug 14, 2019

I Lucas,

Wait, are you doing OCR AND trying to translate at the same time? Acrobat cannot translate so if you were trying to do that, how??

Meanwhile, yes, the quality of the scan can make a tremendous difference in the quality of the result. f

A lot of the problems/issues typically revolve around both the size of the text and the resolution of the text. So, a size 18 font will have more success than a size 9 font. In addition, if the document was scanned at 300 ppi, it will have less issues than if it was scanned at 150 ppi. Ideally 600 ppi is best.

The why of this is how clearly Acrobat (or any OCR software) can distinguish edges. One common example of error is when you have a letter combination such as "ri." If the font is too small and/or if the resolution is too low, the OCR software might see that as "n."

However, I gather that the documents are coming into you only as a PDF that has not been OCRed, correct? I would be curious as to what kind of scanner was used and what kind of software converted the scan into a PDF. I'm not all that certain that that will have much of an issue here but I am curious.

Now one other area of potential issue is what setting are you using to do the OCR.

If you go into "Enhance Scans" there is the option to Recognize Text. When you select that, you will see a gear to set your settings.

Note the 2nd dropdown menu, there are three options:

Because I do not know the history of your documents, take one page and try each of these options and see what you get. (total of three results).

Let me know what you end up with.

Report · Aug 15, 2019

Hi Gary,

I've been using the Edit PDF (As illustrated by Image 1) Function to detect the text; as for the translation, I've been doing it manually, re-typing the documents.

I've asked the client about the possibility of better quality scanning and he say he'd see into that and the error ocurred now with another file (as illustrated by Image 2) , it detected most of the letters as being individual images which is taking me a longer time to translate and adjust.

I also did not have a lot of time to look into it but I (quickly looked into it and) did not find the Recongnize Text window that you showed, I'll look more into it tomorrow but if you have any more instructions on how to get to this window in the first place (and beat me to it), please do enlighten me.

Image 1

Image 2

Grateful,

As of now,

For your attention,

Lucas Jardim Pinheiro

Report · Aug 15, 2019

Hi Lucas,

OK, a number of issues here.

One of the Tools is Enhance Scans. If it's already on the right hand side it'l look like this:

If you have to go into the Tools window, it looks like this:

If you want it on the right hand side, just drag it there. Done!

This is the Tool you want to use to do the OCR-ing. It is from there you'll find the selection of ways to do things that I showed above. I do want you to test each of them to determine which gives you the best results. Since I do not know the origin of the original PDFs, I'd really suggest trying all three and you decide which gives you the best result.

The "fat" "els" (L) is a bug I haven't seen in years (seen in image 2). Let's see if working from the Enhance Tool and selecting a better process solves your other problems and this before we worry about this one for right now.

Let us know,

Report · Aug 16, 2019

Hi Gary,

I tried to enhance scans with all the options but nothing seemed to change in the second and third option, the message (illustrated by Image 1) happened and a while later the pages turned black and what it was already understanding as text turned white (it still didn't recognize any new text); the first option was really fast and had no change whatsoever.

I've asked the client about the PDFs source and am waiting for his answer.

You mentioned you haven't seen this bug in years, do you recall why it used to happen in the first place? And also, would it be helpful if you could have access to one of the PDFs I'm working on? Maybe analysing it could help us figure this out quicker

Image 1

Grateful,

As of now,

For your attention

Lucas Jardim Pinheiro

Report · Aug 19, 2019

Hey Gary,

I just had the same error again (as illustrated by Image 1), I was manually translating these PDFs through the Edit Page feature and as I was editing the text box above (in page 6) and the text box in page 7 just turned white, I tried to change the font, color, size but nothing works, I managed to save the text to a Notepad and it is still there, it just doesn't show, I'll redo it but I would like to know if this could be related to the other bugs I've been experiencing.

Image 1

Grateful,

As of now,

For your attention

Lucas Jardim Pinheiro