Tim Lesher sagte: > Here's the first draft of the python-dev summary for the first half of April. Please send any corrections or suggestions to > the summarizers. > [...] > ---------------------------------------- > Unicode byte order mark decoding > ---------------------------------------- > > Evan Jones saw that the UTF-16 decoder discards the byte-order mark (BOM) from Unicode files, while the UTF-8 decoder > doesn't. Although the BOM isn't really required in UTF-8 files, many Unicode-generating applications, especially on Microsoft > platforms, add it. > > Walter Dörwald created a patch_ to add a UTF-8-Sig codec that generates a BOM on writing and skips it on reading, but after a > long discussion on the history of the Unicode, Microsoft's influence over its > evolution, the consensus was that BOM and signature handling belong at a higher level (for example, a stream API) than the > codec. All codecs provide a stream API, so there is no higher level. Bye, Walter Dörwald
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4