Skip to main content
Participating Frequently
August 12, 2020
Answered

HTML5 Webhelp not discoverable on open web

  • August 12, 2020
  • 4 replies
  • 575 views

Hello,

 

A quick question to the community: When I generate our help files and we post them, they are available on the www, and catalogued by Google's search engine, etc. My understanding is there is is way to prevent (or at least request that) Google not catelogue these files so they will not appear in open Google searches.

 

Is there some way to flag a project at export to include metadata that prevents search engines from cateloging the content?

 

My thanks in advance for any thoughts!

This topic has been closed for replies.
Correct answer Peter Grainge

When you generate everything in the target folder is deleted. When you publish nothing is deleted. You only need to set things up once.

 

There's something on my site about it.

 

4 replies

Peter Grainge
Community Expert
Community Expert
August 12, 2020

Pretty sure the devs can come up with something similar to how RoboHelp works.

 

Use the menu (bottom right) to mark the Best Answer or Highlight particularly useful replies. Found the answer elsewhere? Share it here.
Peter Grainge
Community Expert
Peter GraingeCommunity ExpertCorrect answer
Community Expert
August 12, 2020

When you generate everything in the target folder is deleted. When you publish nothing is deleted. You only need to set things up once.

 

There's something on my site about it.

 

Use the menu (bottom right) to mark the Best Answer or Highlight particularly useful replies. Found the answer elsewhere? Share it here.
Participating Frequently
August 12, 2020

It's a little different how we publish... I generate to the target folder and that folder then syncs to our servers through SVN, so I'm not exactly sure what does and does not get eliminated when it comes out the other side of the process. The best solutoin for me would be to have the needed metadata be part and parcel of the generation process, but if that's not easily acheivable, then I'd need to go back to dev and see what they can do. Thanks!

Jeff_Coatsworth
Community Expert
Community Expert
August 12, 2020

The only way you can have RH add in stuff like that is to mess about with the "factory" ingredients that go into making the output - all javascript I suspect. Unless you've got developers that can help you with that, I suspect running your own post-production find-and-replace script would be easier.

Participating Frequently
August 12, 2020

All right. Thanks again. I will go talk to Dev and see what they say. Messing about in javascripting isn't anything I would want to do if it could be avoided. Cheers!

Jeff_Coatsworth
Community Expert
Community Expert
August 12, 2020
Participating Frequently
August 12, 2020

Thank you Jeff, for your reply. 🙂 I hadn't looked at that yet, but it does mention the Robots.txt file, which has come up. The concern was that each time the project regenerates and we stick it on our server, the robots.txt would be overwritten. I've been asked to see if there's a way for RoboHELP to include this information on generation so we don't lose it. The meta robots tag sounds like it could be a solution, but again, it would need to somehow be included in every html file in the project, and part of what I was trying to determine is if there's a way for RH to add that information at generation.