Re: [Corpora-List] Translator_HTML_to_XML

From: d'Armond Speers (speersdl@msn.com)
Date: Sat May 03 2003 - 01:36:38 MET DST

  • Next message: Scott James Cederberg: "Re: [Corpora-List] Translator_HTML_to_XML"

    >Dear all,
    >
    >I'm working on an Internet Query System,
    >Can somebody point me to : any system for translating
    >HTML to XML (In Java)?

    Hmm, HTML is a form of XML, isn't it?

    For converting between different XML specs (as defined by a DTD or XML
    Schema), you should take a look at XSLT (XML transforms). This is an
    XML-based programming language. There are quite a few XSLT processors out
    there that include Java libraries, such as Saxon and Xalan. You write the
    XSLT, and apply the XSLT to the input XML to generate the output XML. Check
    out XML, XSL and XML Schemas at the W3C (www.w3.org).

    >Thanks a lot,
    >wassim

    --
    d'Armond Speers, Ph.D.
    speersd@georgetown.edu
    

    _________________________________________________________________ Tired of spam? Get advanced junk mail protection with MSN 8. http://join.msn.com/?page=features/junkmail



    This archive was generated by hypermail 2b29 : Sat May 03 2003 - 01:36:56 MET DST