Structure/Schema - Custom or off the shelf?
Alan Houser
arh at groupwellesley.com
Fri Feb 3 08:30:34 PST 2006
Organizations are "successful" when they meet their business
requirements as efficiently (time and $$$) as possible. I talk lots of
people _out_ of migrating to XML for this reason. I even occasionally
say "you're doing just fine with MS Word."
I think the answer to the "custom or off-the-shelf" question differs
with each customer's business requirements and even tool choice. For
FrameMaker XML implementations, I'm much more likely to recommend
designing your own DTD. FrameMaker handles the "hard" part of publishing
XML (printing it), and provides a reasonable environment for building
your information model (the structured EDD). It's also relatively
difficult to get FrameMaker to play with most off-the-shelf DTDs,
mainly because of the constraints of the FrameMaker import/export model.
Rolling your own DTD becomes an attractive option with FrameMaker.
If you're migrating to XML, *not* using FrameMaker, and have print/PDF
as one of your output requirements, take a good hard look at using or
leveraging a standard information model, like DocBook or DITA. Printing
XML using XSL-FO is one of the most difficult tasks you will face, and
leveraging the XSLT transforms for an off-the-shelf DTD is likely to
save a very substantial amount of development time.
By the way, I never recommend starting out with out-of-the-box DocBook
-- if you use DocBook, I always favor constraining it. My observation
comes from a couple of implementations in which the client said "no
thanks, we're not going to bother constraining DocBook." Those clients
were creating fairly homogeneous information deliverables, and it turned
out that the writers were far more successful at marking up these
deliverables than I had expected, because they did so by example. Just
an observation.
It's also important to remember that a DTD alone isn't sufficient to
structure your data. A DTD cannot specify all possible constraints (the
recent "nesting level" discussion is just one example), nor can it
prevent the use of valid but semantically-incorrect tags. There always
needs to be a human element for markup review and quality control.
-Alan
eric.dunn at ca.transport.bombardier.com wrote:
>
> Alan Houser wrote:
> > Regarding DocBook -- I acknowledge that it's big, and has a "designed by
> > committee" feel. However, I've seen too many companies use it
> > successfully to dismiss it.
>
> Many companies use MSWord and other less than ideal tools. Also
> strongly depends on your definition of "successfully".
>
> > Having a set of extensible XSLT
> > transformations is absolutely invaluable -- not for the easy stuff, like
> > transforming "<title>" to "<h1>", but for the hard stuff, like building
> > a back-of-the-book index. Try writing that XSLT code from scratch.
>
> True. But, any self respecting implementer should see how they can
> create their own work environment that can perhaps integrate with and
> leverage the existing tools.
>
> > DocBook's size isn't necessarily the problem it may appear to be.
> > Authors tend to learn markup languages by example. Their approach is
> > typically "here's how we mark up a help topic in DocBook" (bottom-up),
> > as opposed to "which of DocBook's 400 elements do I need to mark up a
> > help topic?" (top-down).
>
> Another Yuck. Many people use MSWord styles in much the same manner.
> What's the point of adopting a structured environment if your not
> going to adopt a design that actually structures and controls your data?
>
> "here's how we mark up a help topic in DocBook" Should be a very easy
> exercise to take and create a working subset of Docbook. Then there's
> little authors can do wrong.
>
> Eric L. Dunn
> Senior Technical Writer
>
> PS: If this makes it to the list could someone give me a heads up?
>
>
--
---
Alan Houser, President
Group Wellesley, Inc.
412-363-3481
www.groupwellesley.com
More information about the framers
mailing list