Skip to main content
Participant
February 4, 2023
Question

Newbie to XML publishing. What role does XML, DTD, Stylesheets

  • February 4, 2023
  • 3 replies
  • 1003 views

Hi all,

I'm new to XML publishing and want to support a simple workflow for translation. Goal is to...

Format an existing InDesign file using tags and stylesheets, export XML-->translation vendor-->import new translation in file (using exact same structure and formating of master language file). I have a few questions:

Do I need to have to a DTD if my paragraph styles match the tag name?

I assume the XML structure controls the order of publishing?? Does this mean I must use one connected text box, or can there be multiple containers. It should be noted that the doc contains- headers, paragraph, bulleted, tables with inline graphics throughout.

 

Following some other guidance video I found online, I have no cloned styles/tags. Each section is defined for both. When I export the XML, it appears to be a well-formed XML with table tags and href calls. It fact, the XML can be imported into a field based content tool (ultimately the goal).

However, when I make a simple content change in the XML to reimport, it blows up the doc. I used Merge & Append both with no success. I also used a checked XLST and non-checked. I'm using INDD 2022 on OS 13.0.1.

 

I'd be super grateful if someone can state the process I should use to achieve the automation I'm looking for by exporting and importing XML.

Is it because I don't have a DTD??

Is there a way to create one by manually tagging everything to have a tag that matches a paragraph style?

If I do need a DTD, can someone reference a good source to learn the basics of creating one?

 

Thanks in advance! I've attached a screen capture of one small section that illustrates an example of the tables which contain the more complex formatting. The rest are different section header & paragraph formats- with bulleted and non-bulleted text.

This topic has been closed for replies.

3 replies

Legend
February 5, 2023

One more thought. If you are working with an external translation vendor, there isn't much point guessing what to do. They are likely to have very rigorous requirents for what they receive from you, and if it's XML I'd expect that THEY have a mandatory DTD and predefined workflow. It's nice to practice with the tools, but don't put too much into the detail at this stage.

Joel Cherney
Community Expert
Community Expert
February 5, 2023

I'm new to XML publishing and want to support a simple workflow for translation. Goal is to...

Format an existing InDesign file using tags and stylesheets, export XML-->translation vendor-->import new translation in file (using exact same structure and formating of master language file). I have a few questions:

 

I do, too. Why do you want to do it this way? 

 

No, but seriously. I will totally take a stab at answering your questions, but first, I want to know why you've started with such a complex translation workflow. InDesign can export a file of the type "IDML" which is a markup language rendering of an InDesign document. It successfully captures everything but placed images, and I think pretty much every CAT tool and TMS in existence can consume and generate IDML. There must be exceptions, but I can't think of them at the moment. So, you're starting your translation process in a place where you seem to be doing way more heavy lifting than is necessary. Why have you skipped IDML and gone straight to XML export/reimport? 

 

Do I need to have to a DTD if my paragraph styles match the tag name?

 

Nope. 

 

Does this mean I must use one connected text box, or can there be multiple containers.

 

There can be as many text frames as you like, so long as you tag 'em all in the Structure panel. In fact, one set of threaded text frames - one "story" - winds up getting a single tag. Stuff inside those threaded text frames (e.g. paragraph styles that you've mapped to tags) appear under the tagged frame in the Structure panel. 

 

Following some other guidance video I found online, I have no cloned styles/tags.

 

That's good advice for an XML translation workflow in InDesign... can you post a link to the video? 

 

It fact, the XML can be imported into a field based content tool (ultimately the goal).

 

So, what's the goal again? You have English-language content in an InDesign file, and at the end you don't want a non-EN InDesign file? It's just a step on the way to your goal of content management? I'm not completely sure what your field-based content tool is, in this context, but if your main goal is to get all of your content into it, I can offer you some suggestions. If you're going to be using a language service provider of some kind (that's the "tranlsation vendor" in your post), then it should be Extremely Easy to get them to deliver your translated content in a way that makes it very easy to hoover up into your content management tool, which is what I am guessing your field-based content tool is. 

 

Said another way: if you have content in English, and you want it in a lot of languages, and your English content isn't already 100% in InDesign documents that are already well tagged and ready for XML export, then there is approximately a 99.44% chance that there is a better way to get your content management tool full of translated content. 

 

I could be wrong! Maybe, after you explained your workflow to me, I would respond, "... aw, jeez! You're right, the fastest & least error-prone way for you to do this is to house your English content in InDesign, and tag it up there, and export XML for your LSP." But I really doubt it. If it needs to be in XML, there are better places to work than InDesign, and if it needs to be in InDesign, there are better ways to translate it than exporting XML. Usually. 

Participant
February 6, 2023

Thanks for the response... 

I'll back up a little. 

The docs already exist in InDesign and our teams are trained in IDD & Illustrator. 

The translation process today is handled by copying and pasting EN fields in a word table. After translation is completed per language- copy and paste back into IDD doc.  As the translations involve over 24 languages, making it field based is the only way for designers to know what goes where and ensures stylized text is also in the right place especially for Asian and Middle Eastern languages. Translation company would prefer XML and we would prefer a more automated workflow. They ensure they will keep the XML structure identical as their software just replaces live text in the XML. 

 

Having played around this weekend, I got it to mostly work. Two outstanding elements. It doesn't like two different fonts in one paragraph style (even though it's fine with stylied character styles like Bold and bold italic). One section uses a bullet graphic from dingbats and futura for the rest, even when I tag separately, it wants to make the paragraph all dingbats. 

The second issue that I'm still working on, as it worked for a very basic test I did with inline graphics, is bringing those graphics back into the doc when replacing the XML. 

 

The other reason for this workflow is to introduce a copy management tool to allow changes to be done there by the client and not by annotating a PDF.  If we split the content with that in mind, we will be able to populate the database from the XMLs. (crawl, walk, run... )

 

BTW, the style sheets are built with a core reference style to "base" all other styles on. As language in some cases requires a different font, we are able to easily edit the english style sheets to another font set for that language. So there are a lot of things that were done well in the IDD files already. 

 

Hopefully, that makes a bit more sense. I'm getting closer.. I can probably work around the two fonts with an inline graphic instead, but hoping there's a better way. 

 

thanks 

Joel Cherney
Community Expert
Community Expert
February 8, 2023

I'll try tomorrow... as it's a client's content, I need to pick one that is a little less obvious in whose it is... I'll probably post a Zip of the InDesign, graphics. On top of it, my computer is up to date on software- but the fonts for this job aren't... bunch of Type 1 fonts which I can no longer load!! So for the test, I ended up changing all of them to Arial. I'm open to any ideas on a better workflow to accomplish (except using Framemaker). I thought of using InCopy in between, but that doesn't seem to add value to the workflow. 

 

In answer as to why Translation won't take an IDML, if they have XML they can just run it through their Translation memory and spit out the translations automatically. Having the IDD or IDML to open in an IDD file would require human intervention to enter the translation in the doc. 


Well... If you can post a sample, I can see if I can figure out what's not working. My own TM systems can apply translation memory to IDML without human intervention, but unless your supplier wants me to teach 'em how, I suppose they're not going to be able to do so. With that limitation, I can see why you're trying to use an XML workflow. So the success or failure of your XML export is going to be down to how you are tagging your documents for export, and how well the document is built to account for text expanison upon re-import. 

James Gifford—NitroPress
Legend
February 4, 2023

As InDesign does very little with XML, I think you must have meant to place this in the FrameMaker forum, which is very much an XML/DTD-driven app. In any case, you will get much more useful answers there, I think.

 

(Flagged for moderator attention.)