http://www.commandprompt.com/products_DocParse.lxp - converts HTML into DocBook
http://www.arbortext.com/html/products.html - comprehensive suite of both editing and processing tools
http://www.renderx.com/products.html - a FO to PDF rendering engine