Html5lib Python

screenshot of Html5lib Python

Standards-compliant library for parsing and serializing HTML documents and fragments in Python