Class HTMLDocumentBuilder


  • public class HTMLDocumentBuilder
    extends java.lang.Object
    Build a DOM document from HTML content converting from 'plain' HTML into 'XHTML' along the way with the help of a TolerantSaxDocumentBuilder and the Swing html parser classes. This allows XML assertions to be made against badly formed HTML.
    See Also:
    TolerantSaxDocumentBuilder
    • Constructor Detail

      • HTMLDocumentBuilder

        public HTMLDocumentBuilder​(TolerantSaxDocumentBuilder tolerantSaxDocumentBuilder)
        Constructor
        Parameters:
        tolerantSaxDocumentBuilder - the instance that will receive SAX calls generated as the HTML is parsed and build up a DOM Document
    • Method Detail

      • parse

        public org.w3c.dom.Document parse​(java.io.Reader reader)
                                   throws org.xml.sax.SAXException,
                                          java.io.IOException
        Returns:
        a DOM document parsed from the Reader via an SwingEvent2SaxAdapter and TolerantSaxBuilder. Not thread-safe!
        Throws:
        org.xml.sax.SAXException
        java.io.IOException
        See Also:
        TolerantSaxDocumentBuilder
      • parse

        public org.w3c.dom.Document parse​(java.lang.String htmlString)
                                   throws org.xml.sax.SAXException,
                                          java.io.IOException
        Returns:
        a DOM document parsed from the String via an SwingEvent2SaxAdapter and TolerantSaxBuilder. Not thread-safe!
        Throws:
        org.xml.sax.SAXException
        java.io.IOException
        See Also:
        TolerantSaxDocumentBuilder
      • getTrace

        public java.lang.String getTrace()
        Returns:
        the trace of events and / or warnings encountered during parsing
      • trace

        private void trace​(java.lang.String msg)
        Append to the log built up during parsing
        Parameters:
        msg - what to append