Extract and Compare text between two PDFs
Hi,
I need a tool / JavaScript to extract and compare the text between two pdfs or in plain simple language, I want to find out the missing text.
SourceCopy.pdf – contains the original/source text that should be available in the FinalArtwork.pdf
FinalArtwork.pdf – The final PDF that should hold all the copy that is available in the SourceCopy.pdf
The source and final might contain the same text in multiple places. For example, '10 years' might be available thrice in the SourceCopy.pdf, so it should find three instances in the FinalArtwork.pdf.
So, the script should create a new text file on the desktop containing the missing text. If nothing is missing, then the text file should say, 'Nothing is missing, good work!'
On comparing the files manully, I figured out that only line of text is missing in the FinalArtwork.pdf i.e.
Missing Lines:
N/A from Data-File.ai
Comparison complete!
Can you please help me on this. Thanks in advance.
