Skip to content
Sections
>> Trisquel >> Packages >> etiona >> python >> python-html5-parser
etiona  ]
[ Source: html5-parser  ]

Package: python-html5-parser (0.4.4-1)

fast, standards compliant, C based, HTML 5 parser for python

A fast implementation of the HTML 5 parsing spec for Python. Parsing is done in C using a variant of the gumbo parser. The gumbo parse tree is then transformed into an lxml tree, also in C, yielding parse times that can be a thirtieth of the html5lib parse times. That is a speedup of 30x. This differs, for instance, from the gumbo python bindings, where the initial parsing is done in C but the transformation into the final tree is done in python.

Other Packages Related to python-html5-parser

  • depends
  • recommends
  • suggests
  • dep: libc6 (>= 2.14)
    GNU C Library: Shared libraries
    also a virtual package provided by libc6-udeb
  • dep: libxml2 (>= 2.7.4)
    GNOME XML library
  • dep: python
    interactive high-level object-oriented language (default version)
    dep: python (<< 2.8)
    dep: python (>= 2.7~)
  • dep: python-chardet
    universal character encoding detector for Python2
  • dep: python-lxml
    pythonic binding for the libxml2 and libxslt libraries

Download python-html5-parser

Download for all available architectures
Architecture Package Size Installed Size Files
amd64 123.1 kB480 kB [list of files]