[doing an archeological dig through e-mail] On Tue, Aug 13, 2002, M.-A. Lemburg wrote: > > At least is good :-) NFC is NFD + canonical composition. Decomposition > isn't all that hard (using unicodedata.decomposition()). For > composition the situation is different: not all information is > available in the unicodedata database (the exclusion list) and > the database also doesn't provide the reverse mapping from > decomposed code points to composed one. See the Annexes to the > tech report to get an impression of just how hard combining is... In a message just prior to this one, you wrote: The recommended way of doing normalization is to go by Normalization Form C: Canonical Decomposition, followed by Canonical Composition. So, um, which way is it? -- Aahz (aahz@pythoncraft.com) <*> http://www.pythoncraft.com/ Project Vote Smart: http://www.vote-smart.org/
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4