A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://mail.python.org/pipermail/python-dev/2009-March/086583.html below:

[Python-Dev] Integrate BeautifulSoup into stdlib?

[Python-Dev] Integrate BeautifulSoup into stdlib? [Python-Dev] Integrate BeautifulSoup into stdlib?Vaibhav Mallya vaibhavmallya at gmail.com
Mon Mar 2 13:23:37 CET 2009
I haven't seen a lot of discussion on this - maybe I didn't search hard 
enough - but  what are people's thoughts on including BeautifulSoup in 
stdlib? It's small, fast, and pretty widely-liked by the people who know 
about it. Someone mentioned that web scraping needs are infrequent. My 
argument is that people ask questions about them less because they feel 
they can just reinvent the wheel really easily using urllib and regexes. 
It seems like this is similar to the CSV problem from a while back 
actually, with everyone implementing their own parsers.

We do have HTMLParser, but that doesn't handle malformed pages well, and 
just isn't as nice as BeautifulSoup.

In a not-entirely-unrelated vein, has there been any discussion on just 
throwing all of Mechanize into stdlib?

BeautifulSoup: http://www.crummy.com/software/BeautifulSoup/
mechanize: http://wwwsearch.sourceforge.net/mechanize/

Regards,
Vaibhav Mallya
More information about the Python-Dev mailing list

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4