Skip to content

pullrequests Search Results · repo:html5lib/html5lib-python language:Python

Filter by

322 results
 (107 ms)

322 results

inhtml5lib/html5lib-python (press backspace or delete to remove)

Provides a migration path for #525. - ~The python debug.py post-build step from .travis.yml has not been migrated; GitHub Actions workflows support a continue-on-error configuration flag at the ...
  • jayaddison
  • 10
  • Opened 
    on Jan 10, 2021
  • #527

This change uses Python s builtin str.lower to perform string lowercasing (typically used during case-insensitive element/attribute name comparison), and provides a nice small parsing performance benefit ...
  • jayaddison
  • 1
  • Opened 
    on Jan 9, 2021
  • #526

Use Cython to make the parser quicker; see #445. This builds on top of #272. This is a long way from ready to land, but shows potential. We probably also want to split this up so many of the earlier changes ...
  • gsnedders
  • 4
  • Opened 
    on Jan 5, 2021
  • #524

The InBodyPhase parser phase creates and processes implied close-tag tokens for a small number of stopNames elements. This changeset updates the phase s logic so that processing of these implied tokens ...
  • jayaddison
  • 1
  • Opened 
    on Dec 31, 2020
  • #523

This changeset should functionally be a no-op, and may be best reviewed with whitespace changes ignored since the indentation level for some of the loop s logic has been reduced. The changes appear to ...
  • jayaddison
  • 2
  • Opened 
    on Dec 31, 2020
  • #522

This change refactors the tokenizer module to use Python object instances where previously plain dictionaries were used to hold token state. This builds upon #519, #520 and attempts to resolve #24.
  • jayaddison
  • 3
  • Opened 
    on Dec 30, 2020
  • #521

During tokenization, some element names, attributes, and temporary buffered strings are compared in a case-insensitive mode. To avoid repeat string transformation operations, this change performs lowercasing ...
  • jayaddison
  • 14
  • Opened 
    on Dec 29, 2020
  • #520

This is a small consistency fixup relating to the way that attribute names are retrieved; it also makes some follow-up refactoring work a little cleaner. Parsing continues fine if we consume a single ...
  • jayaddison
  • 3
  • Opened 
    on Dec 29, 2020
  • #519

This PR adds CI support for the IBM Power Little Endian (ppc64le) architecture. The idea is to ensure that the builds on this architecture are continuously tested along with the Intel builds (amd64) as ...
  • nageshlop
  • 2
  • Opened 
    on Dec 13, 2020
  • #518