Skip to main content
New Participant
June 1, 2023
Question

OCR and Redact: SearchRedactPatterns.xml and RTI codes for Australia

  • June 1, 2023
  • 1 reply
  • 1437 views

OK - so Adobe DC Pro doesn't currently have language packs for very many countries e.g. not for Australia. Nor does it have Redaction search patterns for Australian formats such as bank BSB & account, TFN, phone numbers etc. Nor does it have a set of redaction codes applicable to Australian Privacy Laws and Right to Information requests.

 

I'm aware it can be done by editing SearchRedactPatterns.xml and I'm guessing editing the std U.S.FOIA.xlm (or maybe create a new one ???) in:

C:\Users\<NAME>\AppData\Roaming\Adobe\Acrobat\DC\Redaction\ENU

 

I'm not a guru at editing xml and REGEX standards etc, so I'm wondering if anyone been able to find (or create) such sets and is will to share?

 

Thanks in advance

This topic has been closed for replies.

1 reply

try67
Community Expert
June 1, 2023

You should edit SearchRedactPatterns.xml, not any of the other files XML.

If you tell us what's the pattern used for these codes we might be able to help with the regexp's.

If you're looking for a more complete solution it might be a better idea to hire a developer to do it for you.

try67
Community Expert
June 1, 2023

PS. This has nothing to do with OCR.

try67
Community Expert
June 1, 2023

Thanks for your constructive feedback.

However - I was advised by Adobe Support that because there is no EN-AU (Australia) language pack (which IS OCR related), there was therefore no AU specific set of Redaction Search Patterns. I think this is a bit of a fob off as they're not specifcally related. I was advised to lodge a request supporting the development of an EN-AU language pack !

Whilst yes an EN-AU language would be great - my burning issue is specifically redaction related. I was thinking that surely other Australian organisations have hit a similar issue and may be able to assist.

Cheers

Dion

Oh ... and there wasn't any 'Redact' option to select from 'Topics'


I also think it's a fob off. The two are not related, as far as I know. I'm not aware of any other locale-specific redaction patterns other than US ones, even if that language is supported for OCR. Also, language!=country. You can have support for Arabic OCR, say, but each Arabic-speaking country will have its own set of codes, so this doesn't really make sense anyway.

You can just create your own redaction patterns yourself, if no one has does so thus far.