A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://mail.python.org/pipermail/python-dev/2001-July/015949.html below:

[Python-Dev] re with Unicode broken?

[Python-Dev] re with Unicode broken? [Python-Dev] re with Unicode broken?Fredrik Lundh fredrik@pythonware.com
Fri, 13 Jul 2001 16:44:22 +0200
sjoerd wrote:

> This is not for the faint of heart.
>
> My validating XML parser doesn't work anymore, even though I didn't
> change a thing (except update Python from CVS).

when did you last update without problems?

the likely cause for this is MvL's "big char set" patch, which
I checked in on July 6.

here's a workaround: tweak sre_compile.py so it doesn't generate
BIGCHARSET op codes. in _optimize_charset, change this:

    except IndexError:
        # character set contains unicode characters
        return _optimize_unicode(charset, fixup)
    # compress character map

to

    except IndexError:
        # character set contains unicode characters
        return charset # WORKAROUND: no compression
    # compress character map

I'll look into this over the weekend.

Cheers /F





RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4