Copy link to clipboard
Copied
Dear all,
In FM 10, when exporting XML, minus signs (U+002D) appear in text content at the location where (it seems) frames are anchored. What might be causing this? Is there a way to prevent this happening?
- Alex.
Copy link to clipboard
Copied
Alex,
What structured application are you using (what DTD/Schema are your documents using) and are there some read/write rules configured?
- Michael
Copy link to clipboard
Copied
It's a custom application based around the NLM (Journal Publishing) tag set. There *are* read/write rules but they only apply to table handling.
The export to XML is under control of an ExtendScript script, so I could potentially do something programatically as part of the export process, but would rather not have to ...
- Alex.
Copy link to clipboard
Copied
On closer inspection I was mistaken - these are not anchors being exported as dashes but discretionary hyphens. Is there a way to prevent these being exported in the XML? I really don't need them ...
- Alex.
Copy link to clipboard
Copied
Am 26.08.2011 um 10:13 schrieb alexbrn:
On closer inspection I was mistaken - these are not anchors being exported as dashes but discretionary hyphens. Is there a way to prevent these being exported in the XML? I really don't need them ...
Alex,
How are you looking at your XML file? I suspect that discretionary hyphens are exported not as dashes (\U002D) but as \U00AD. Some advanced editors can show you the hex code of a character. Whether it is necessary to remove them depends on what you plan to do with the XML file.
To remove a certain character from the XML you have to make sure it is exported as an entity and then define this entity as an empty string. It could work like this:
Add this line to the Read/Write rules file:
entity "softhyphen" is fm char "\x04";
Add this to the external DTD:
<!ENTITY softhyphen "">
For testing you could write <!ENTITY softhyphen "removed \x04">
- Michael
Copy link to clipboard
Copied
Michael hi
Thanks for the helpful reply - yes I would have expected these to be something other than U+002D, but that is what Frame is exporting (I double checked with a hex editor to confirm). This is the case both for discretionary hyphens which have been rendered as soft hyphens, and the ones Frame records within moved content and that no longer falls at a line-end.
- Alex.
Get ready! An upgraded Adobe Community experience is coming in January.
Learn more