On Mar 4, 2009, at 12:32 PM, James Y Knight wrote: > I think html5lib would be a better candidate for an imrpoved HTML > parser in the stdlib than BeautifulSoup. While we're talking about alternatives, Ian Bicking appears to swear by lxml: <http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/ > Cheers, -- Ivan Krstić <krstic at solarsail.hcs.harvard.edu> | http://radian.org
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4