Skip to main content
Participant
March 19, 2025
Question

Page breaks in Word excludes topic from the CHM file

  • March 19, 2025
  • 2 replies
  • 660 views

I have a project in RoboHelp 2022. I have imported about 5 word documents in the project. I use the Microsoft HTML Help output preset.  When I create the CHM file I notice that some topics are missing from the CHM file. 
* The topics are referenced from the toc but with a '?'-prefix in the link. 

* When I click one of the ? links, information is shown in the reading pane that the HTML file is  missing.

* The missing HTML files are not really missing, they are located in the correct folders

* However, it the title tag in a "missing" HTML file a "LTR" (u200E) unicode character has been introduced.

After some debugging, it turned out that it was manual page breaks that was the culprit. Page breaks had been inserted before some Heading 2 to make the document look good when PDF versions are created. The "HTML topic" following the page break is the one not appearing in the CHM file

So what to do? I remove the page breaks and the CHM is created correctly and all topics are included as expected.

Why does this happen? Is there a work around so that I can keep the page breaks where needed?

Thanks
/Henrik  

    2 replies

    Peter Grainge
    Community Expert
    Community Expert
    March 26, 2025

    This started with Word import but now seems to be PDF import. However, there is no PDF import in 2022. Please clarify.

    ________________________________________________________

    My site www.grainge.org includes many free Authoring and RoboHelp resources that may be of help.

     

    Use menu (bottom right) to mark as Best Answer or to Highlight particularly useful replies. Found the answer elsewhere? Share it here.
    Inspiring
    March 26, 2025

    No, it's a Word import. HenriK is doing two things:

     

    1. Word source to PDF

    2. Word source to RoboHelp to CHM

    Jeff_Coatsworth
    Community Expert
    Community Expert
    March 19, 2025

    Personally, I'd be copy/pasting the content from the Word doc into a text editor like Notepad first to remove all the cruft that Word sticks in & then copy that plain text to RH and apply my styles there.

    Participant
    March 19, 2025

    I can relate to that strategy, unfortunately the amount of information in the production batch of documents, does not allow for non-automated solutions.
    I see that there is an alternative when importing word documents, to use Post import scripts. Could that be an alternative?

     

    Jeff_Coatsworth
    Community Expert
    Community Expert
    March 19, 2025

    Some sort of grand find-and-replace script? Probably.