Copy link to clipboard
Copied
I have a 120 MB file (PDF). Contents in that file especially from non English. It includes almost Indian languages. Its a PDF file so i opened from acrobat reader. And copied the contents by 'select All' then right click 'copy' or 'ctrl c' options/any. 120 MB file so it taken a couple of minutes to copy the entire contents (progress bar).
It copied to clipboard and saved in a txt file normal notepad and 'encoding unicode big endian' format. But the file size became very less in notepad (around 17 MB).
I have doubt that, why this happened this much small file size rater that big size. Does there anything miss out?. i mean any data?
PDF files contain much more information than just text. This is perfectly normal.
Copy link to clipboard
Copied
PDF files contain much more information than just text. This is perfectly normal.
Copy link to clipboard
Copied
its may be the background rectangular window, on that window only the text appears.
Copy link to clipboard
Copied
No further explanation or guesswork is needed. Extracted text is usually much, much smaller than a PDF, because of the nature of PDF and of text.