Comment #2 on issue 124 by EmilStenstrom: Crash when parsing three swedish news sites with beautifulsoup treebuilder http://code.google.com/p/html5lib/issues/detail?id=124
Also happens on the latest source chekout, but with an extra DataLossWarning: C:\Program Files (x86)\python\lib\site-packages\html5lib-0.11-py2.5.egg\html5lib \treebuilders\soup.py:139: DataLossWarning: BeautifulSoup cannot represent eleme nts in any namespace warnings.warn("BeautifulSoup cannot represent elements in any namespace", Data LossWarning) C:\Program Files (x86)\python\lib\site-packages\html5lib-0.11-py2.5.egg\html5lib \treebuilders\soup.py:161: DataLossWarning: BeautifulSoup cannot represent elements in any namespace warnings.warn("BeautifulSoup cannot represent elements in any namespace", Data LossWarning) Traceback (most recent call last): File "C:\Emils\Kod\sammanfatta\html5bug.py", line 11, in <module> doc = parser.parse(page) File "build\bdist.win32\egg\html5lib\html5parser.py", line 211, in parse File "build\bdist.win32\egg\html5lib\html5parser.py", line 111, in _parse File "build\bdist.win32\egg\html5lib\html5parser.py", line 179, in mainLoop File "build\bdist.win32\egg\html5lib\html5parser.py", line 447, in processStartTag File "build\bdist.win32\egg\html5lib\html5parser.py", line 1041, in startTagA File "build\bdist.win32\egg\html5lib\html5parser.py", line 1437, in endTagFormatting File "build\bdist.win32\egg\html5lib\treebuilders\soup.py", line 96, in removeChild TypeError: list indices must be integers -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue. You may adjust your issue notification preferences at: http://code.google.com/hosting/settings --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "html5lib-discuss" group. To post to this group, send email to html5lib-discuss@googlegroups.com To unsubscribe from this group, send email to html5lib-discuss+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/html5lib-discuss?hl=en-GB -~----------~----~----~----~------~----~------~--~---
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4