RE: Corpora: Corpora and XML

Andrew Bredenkamp (andrewb@dfki.de)
Thu, 30 Sep 1999 09:36:39 +0200

> -----Original Message-----
> From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no]On
> Behalf Of Burnard Towers
> Sent: 30 September 1999 00:41
> To: Andrew Bredenkamp
> Cc: corpora@hd.uib.no
> Subject: RE: Corpora: Corpora and XML
>
>
> At the risk of prolonging a discussion which may be of only peripheral
> interest to most corpora readers, I think Andrew may be confusing
> the issue
> of how easily an SGML dtd can be converted to an XML one with the issue of
> how easily a (valid) SGML document can be converted to a valid XML one. In
> my posting I intended to make clear that the latter was simple, not the
> former.

Sorry, now I am confused!

How can you convert a valid SGML into a valid (not just well-formed) XML
document without validating it against an XML DTD? And how do you get this
XML DTD? if as you say...

<SNIP>
> Whether it is also a *valid* XML document cannot
> be determined without creating an XML dtd to validate it against, and this
> is certainly not a readily automatable process. (Unless you are
> using a TEI dtd, of course)

The points made by Ted are slightly orthogonal what I was saying, but
further illustrate that this whole process should not be taken lightly.
Users should take several deep breaths before embarking on this on any large
scale....

Cheers,
Andrew
-------------------------
Dr. Andrew Bredenkamp
Senior Researcher
Andrew.Bredenkamp@dfki.de
http://www.dfki.de/~andrewb
-------------------------