Skip to main content
deepspace9
Participant
May 10, 2011
Question

How to convert unstructured document into structured document

  • May 10, 2011
  • 4 replies
  • 3622 views

I have many unstructured documents. Is it possible to convert these documents into strcutured documents? I know that structured FM uses DITA which is different from unstrcutured document. This is client's requirement so I want to get community views on this.

    This topic has been closed for replies.

    4 replies

    May 18, 2011
    Mark Southee
    Inspiring
    May 12, 2011

    Yanesh,

    I would approach this in 2 stages.

    1. Understand how DITA works independently of FrameMaker first. I say this because DITA is not a straight forward 'out of the box' experience. This could be a good starting point -
    2. Understand how FrameMaker works with DITA. FrameMaker has it's own way of how you author DITA based content. You might want to look at tools such as DITA FMx - http://www.leximation.com/dita-fmx/, which claims to make DITA authoring easier with FrameMaker.

    If you are initially just producing PDF from your DITA content, then FrameMaker is probably the best way of getting the results you want, as FrameMaker templates work the same with structured content, as they do with unstructured.

    Good luck

    Mark

    Legend
    May 11, 2011

    Hi YaneshTyaqi,

    Jeff is correct... DITA is a specific, standardized structure definition. You can structure according to DITA, or you can structure according to any other definition that you like. With respect to methods, there are many:

    - The FrameMaker structure generator/conversion table process - This is built into FrameMaker and I think is generally the preferred method. You can structure without ever having to leave the FrameMaker environment. The process is described in the Structure Application Developers Guide:

    http://help.adobe.com/en_US/FrameMaker/8.0/structapp_dev.pdf

    - A third-party app like Mif2Go - I believe Mif2Go can convert straight from unstructured to DITA XML. This would give you XML files, not necessarily structured FM documents, but if you want DITA specifically and/or already have Mif2Go, this might be a good option.

    - A MIF converter such as MIFML from Leximation - This app can convert a MIF file to XML. Another possible option to go from unstructured FM -> XML -> structured FM.

    http://www.leximation.com/tools/mifml/

    There are many additional techniques that may be employed, especially in addition to those mentioned above. It is never a turnkey process; rather, one that demands time and attention to detail. The length of time is directly proportional to your expertise in structured markup and related tools. It sounds like you are a beginner at this... I might suggest that you start early and begin some experimentations. Read up in that Structure Dev Guide... lots of good stuff there. You could also pay somebody, but then you won't learn very much. Given the importance of these technologies, I'd suggest any route that includes the maximum amount of learning.

    Russ

    [Edit] Mif2Go link:

    http://www.omsys.com/dcl/mif2gopg.htm

    Jeff_Coatsworth
    Community Expert
    Community Expert
    May 11, 2011

    In fact I just got an announcement of a series of Webinars about doing this very thing from Adobe (converting unstructured to structured):

    June 14th - Unstructured To XML Workflow Series Part 1: Analyzing Your

    Unstructured Document - Preprocess Steps

    June 21st - Unstructured To XML Workflow Series Part 2: Creating A

    FrameMaker Conversion Table - Part 1

    June 28th - Unstructured To XML Workflow Series Part 3: Creating A

    FrameMaker Conversion Table - Part 2

    July 7th - Unstructured To XML Workflow Series Part 4: Element

    Definition Document (EDD) - Home Grown or a Standard

    July 14th - Unstructured To XML Workflow Series Part 5: Creating your

    Structured Template from an Unstructured One

    July 28th - Unstructured To XML Workflow Series Part 6: Structure

    Applications and What Is Necessary and How To Create

    August 2nd - Unstructured To XML Workflow Series Part 7: Save As XML,

    Now What?

    They will be found here:

    http://www.adobe.com/cfusion/event/index.cfm?event=list&loc=en_us&type=eseminar&product=TechnicalCommunicationSuite&interest=&audience=&monthyear=

    Jeff_Coatsworth
    Community Expert
    Community Expert
    November 14, 2013

    https://blogs.adobe.com/techcomm/2013/06/classic-7-part-aldous-series-on-unstructured-to-xml-webinar-series-republished.html


    Thanks, Error – I was scrambling to figure out where Adobe had put all of that ;>)

    Jeff_Coatsworth
    Community Expert
    Community Expert
    May 10, 2011

    Structured FM can use DITA, but doesn't have to - it's just one of many structured schemas