A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://www.mail-archive.com/html5lib-discuss@googlegroups.com/msg00215.html below:

TypeError when serializing some pages to BeautifulSoup

Issue 80: TypeError when serializing some pages to BeautifulSoup
http://code.google.com/p/html5lib/issues/detail?id=80
New issue report by bhawkeslewis:
What steps will reproduce the problem?

Serializing 'http://uk.movies.yahoo.com/cannes/photos/' to BeautifulSoup  
(see attached
testcase).

What is the expected output? What do you see instead?

Expected result: successful parsing.

Actual result: error message:

Traceback (most recent call last):
   File "broken-html5lib.py", line 6, in <module>
     soup          = parser.parse(serialization)
   File "build/bdist.macosx-10.5-i386/egg/html5lib/html5parser.py", line  
155, in parse
   File "build/bdist.macosx-10.5-i386/egg/html5lib/html5parser.py", line  
130, in _parse
   File "build/bdist.macosx-10.5-i386/egg/html5lib/html5parser.py", line  
316, in
processStartTag
   File "build/bdist.macosx-10.5-i386/egg/html5lib/html5parser.py", line  
894, in startTagA
   File "build/bdist.macosx-10.5-i386/egg/html5lib/html5parser.py", line  
1162, in
endTagFormatting
   File "build/bdist.macosx-10.5-i386/egg/html5lib/treebuilders/soup.py",  
line 92, in
removeChild
TypeError: list indices must be integers






Attachments:
        broken-html5lib.py  1.1 KB


Issue attributes:
        Status: New
        Owner: ----
        Labels: Type-Defect Priority-Medium

-- 
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"html5lib-discuss" group.
 To post to this group, send email to html5lib-discuss@googlegroups.com
 To unsubscribe from this group, send email to [EMAIL PROTECTED]
 For more options, visit this group at 
http://groups.google.com/group/html5lib-discuss?hl=en-GB
-~----------~----~----~----~------~----~------~--~---


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4