RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from http://mail.python.org/pipermail/python-dev/2000-May/003883.html below:

[I18n-sig] Re: [Python-Dev] Unicode debate

[I18n-sig] Re: [Python-Dev] Unicode debate [I18n-sig] Re: [Python-Dev] Unicode debatePaul Prescod paul@prescod.net
Tue, 02 May 2000 11:11:13 -0500

Previous message: [I18n-sig] Re: [Python-Dev] Unicode debate
Next message: [I18n-sig] Re: [Python-Dev] Unicode debate
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Combining characters are a whole 'nother level of complexity. Charater
sets are hard. I don't accept that the argument that "Unicode itself has
complexities so that gives us license to introduce even more
complexities at the character representation level."

> FYI: Normalization is needed to make comparing Unicode
> strings robust, e.g. u"é" should compare equal to u"e\u0301".

That's a whole 'nother debate at a whole 'nother level of abstraction. I
think we need to get the bytes/characters level right and then we can
worry about display-equivalent characters (or leave that to the Python
programmer to figure out...).
-- 
 Paul Prescod  - ISOGEN Consulting Engineer speaking for himself
It's difficult to extract sense from strings, but they're the only
communication coin we can count on. 
	- http://www.cs.yale.edu/~perlis-alan/quotes.html

Previous message: [I18n-sig] Re: [Python-Dev] Unicode debate
Next message: [I18n-sig] Re: [Python-Dev] Unicode debate
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4