Skip to main content
Participating Frequently
July 17, 2009
Question

OCR for numbers in a PDF

  • July 17, 2009
  • 1 reply
  • 910 views

Hi guys

I've got a situation where I need to take numbers from a PDF file which is a previsouly scanned-in copy of an invoice. The PDF will be opened alongside a data entry form for the user to complete the form with information from the invoice. However, what I need to do is offer the user suggestions for some of the form fields (such as Net Total, VAT (Tax), Grand Total etc). I know that for specific suggestions for each field I would need some kind of 'zone' OCR, so if I could only pull out all numbers in the scanned image of the invoice from the PDF, I could offer all numbers as a suggestion in a drop-down.


I am using CFMX7, so I'm looking for a way to do this or some kind of component which will allow me to do this.

All the best


Wes

This topic has been closed for replies.

1 reply

Inspiring
July 20, 2009

I am not sure it if will meet your needs, but you might look into jPedal. IIRC, it has some text extraction capabilities.