"Stephen J. Turnbull" <stephen@xemacs.org> writes: > I would think that UTF-8 can be quite reliably detected without the > "BOM". There is a difference between auto-detection and declaration. Sure, you can auto-detect UTF-8; you might have to read the entire text for that, though. This is quite different from a declaration: The text either is declared as UTF-8, or it isn't. > Microsoft software for Japanese apparently ignores Content-Type > headers and the like in favor of autodetection (probably because the > same MS software regularly relies on users to set things like > charset parameters in MIME Content-Type). Auto-detection is useful for displaying content to users. It is evil for a programming language. Regards, Martin
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4