Hyphen and Soft Hyphen Handling Issue During PDF to Word Conversion
Hello Team,
I’m facing a consistent issue while converting PDF files to Word (DOCX) using Adobe Acrobat.
Issue description:
Many PDFs contain line-end hyphenated words (for example: techni- cal, inter- national).
During PDF to Word conversion, all hyphens are merged, and the words are merged incorrectly. (for example: techni-cal, inter-national)
This includes both visible hyphens and soft hyphens.
Expected output should preserve the hyphen with a space (e.g., techni- cal).
Environment details:
OS: Windows 10
Adobe Acrobat version: 25.001.20997
Conversion method: Export PDF → Microsoft Word (.docx)
What I’ve already tried:
Multiple PDF files (scanned and digitally generated)
Different export and OCR settings
Opening the PDF directly in Microsoft Word and saving as DOCX
Batch conversions
The issue occurs consistently, especially with scientific and academic PDFs, where preserving original hyphenation is critical for text accuracy.
Request for guidance:
Is there any Acrobat preference or advanced setting to preserve line-end hyphens during PDF to Word conversion?
Is this a known issue in version 25.001.20997, or is there a planned fix?
Are there recommended workflows, patches, or plugins to address this behavior?
This issue significantly impacts professional publishing and conversion workflows.
Thank you for your time and support.
I look forward to your guidance.
Best regards,
Shan
