Re: Corpora: Corpus markup checking programs

Patrice Bonhomme (Patrice.Bonhomme@loria.fr)
Thu, 06 Nov 1997 10:29:32 +0100

eiaamme@msmail.lancs.ac.uk said:
] but rather programs which check, say, whether DTDs have been adhered
] to, or check that SGML has been properly applied to a document.

It is what we call an SGML parser. The more famous one is nsgmls coming with
the James Clark package SP (available at http://www.jclark.com/).

But what you mentioned will not check the semantic integrity of corpus
encoding. For example, yuo can put every thing you want within a <P> (let say
a paragraph) and even if your data is not a paragraph while the SGML syntax is
correct ! As i know, there is no tool or software to check that level of
integrity.

Pat.

-- 
  ==============================================================
  bonhomme@loria.fr               |      Office : B.228
  http://www.loria.fr/~bonhomme   |      Phone  : 03 83 59 20 37
  --------------------------------------------------------------
   * Projet Aquarelle : http://aqua.inria.fr
   * Serveur Silfide  : http://www.loria.fr/Projet/Silfide
  ==============================================================