HTMLZ? I regret to inform you that you have stumbled upon one of the rare Topaz format experiments, and discovered exactly why it is so horrible.
Topaz used, IIRC, some form of embedded image, backed by OCR. The DeDRM plugin and calibre as well can only extract the OCR layer.
Basically it is the PDF of ebook formats.