On Mon, Nov 2, 2009 at 9:51 PM, ssteinerX at gmail.com <ssteinerx at gmail.com> wrote: > BeautifulSoup, which I use every day, is one such product. Since the crappy > old SMGL parser's gone, BeautifulSoup uses the one that's left in Python 3 > and it makes BeautifulSoup completely useless for my daily work. This sounds an area where some help might be useful. Perhaps the quickest solution would simply be to copy the old crappy "sgml" based html parser into a new version of BeautifulSoup. Though I imagine what it really needs is a "quirks mode" parser that is compatible with the HTML dialect accepted by, say, IE6. Maybe a summer of code project? -- --Guido van Rossum (python.org/~guido)
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4