A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://www.mail-archive.com/html5lib-discuss@googlegroups.com/msg00454.html below:

Crash when parsing three swedish news sites with beautifulsoup treebuilder

Comment #2 on issue 124 by EmilStenstrom: Crash when parsing three swedish  
news sites with beautifulsoup treebuilder
http://code.google.com/p/html5lib/issues/detail?id=124
Also happens on the latest source chekout, but with an extra  
DataLossWarning:

C:\Program Files  
(x86)\python\lib\site-packages\html5lib-0.11-py2.5.egg\html5lib
\treebuilders\soup.py:139: DataLossWarning: BeautifulSoup cannot represent  
eleme
nts in any namespace
   warnings.warn("BeautifulSoup cannot represent elements in any namespace",  
Data
LossWarning)
C:\Program Files  
(x86)\python\lib\site-packages\html5lib-0.11-py2.5.egg\html5lib
\treebuilders\soup.py:161: DataLossWarning: BeautifulSoup cannot represent  
elements
in any namespace
   warnings.warn("BeautifulSoup cannot represent elements in any namespace",  
Data
LossWarning)
Traceback (most recent call last):
   File "C:\Emils\Kod\sammanfatta\html5bug.py", line 11, in <module>
     doc = parser.parse(page)
   File "build\bdist.win32\egg\html5lib\html5parser.py", line 211, in parse
   File "build\bdist.win32\egg\html5lib\html5parser.py", line 111, in _parse
   File "build\bdist.win32\egg\html5lib\html5parser.py", line 179, in  
mainLoop
   File "build\bdist.win32\egg\html5lib\html5parser.py", line 447, in  
processStartTag
   File "build\bdist.win32\egg\html5lib\html5parser.py", line 1041, in  
startTagA
   File "build\bdist.win32\egg\html5lib\html5parser.py", line 1437, in  
endTagFormatting
   File "build\bdist.win32\egg\html5lib\treebuilders\soup.py", line 96, in  
removeChild
TypeError: list indices must be integers

--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"html5lib-discuss" group.
 To post to this group, send email to html5lib-discuss@googlegroups.com
 To unsubscribe from this group, send email to 
html5lib-discuss+unsubscr...@googlegroups.com
 For more options, visit this group at 
http://groups.google.com/group/html5lib-discuss?hl=en-GB
-~----------~----~----~----~------~----~------~--~---


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4