I'm having trouble extracting text from PDF files. These files are proofs that I am using for my master's research and I really need to extract the text, however when I extract (copy), the characters are strange and a tool that I use (PDFminer) presents several "(cid: 12)" Can you help me or pass on the contact to someone who can? I really need it, because this problem is delaying my research.
see these proofs of the link, specifically from the year 2017. If you copy the textual content strange characters appear, PDFMiner shows the (cid :). I would need to know if you can copy the textual content of these tests (from the year 2017). I am extracting the textual content from these tests, which I did not create.