jtidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, jtidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, jtidy provides a DOM interface to the document that is being processed, which effectively makes you able to use jtidy as a DOM parser for real-world HTML.

WWW: http://jtidy.sourceforge.net/
