From: "Fredrik Lundh" <effbot@telia.com> > > my unicode database compression project has resulted in its first > > patch: > > > > https://sourceforge.net/patch/index.php?func=detailpatch&patch_id=100899&gro up_id=5470 > two additional comments: > first, the patch was too large for sourceforge (>512000 bytes), so > I've uploaded it to the secret effbot site instead: > http://w1.132.telia.com/~u13208596/uninames-patch.txt > second, the uninames module provides two public Python functions: > getname(code) returns the character name corresponding to > the given character code, or None if the name is not known. > > getcode(name) returns the character code for the given character > name. again, it returns None if the code is not known. Getcode/getname need to use Py_UCS4 instead of int. Smaller data sizes are definitely cool, but as MAL mentioned you didn't include the genration code, nor an explanation about how the stuff works. Looking good so far! Bill
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4