burakkeskin / phpquery

Automatically exported from code.google.com/p/phpquery
0 stars 0 forks source link

Wrong parsed HTML from HTML cleared by Tidy #229

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
Accidentally found a pattern in which losted part of HTML - a whole table with 
id = dgBldgs!

What steps will reproduce the problem?
1. Clean HTML using Tidy
2. Create phpQuery object from this HTML
3. Use html() function to get whole parsed HTML (in which will no table) or use 
 find('#dgBldgs') function to get table (no table too)

System:
Windows NT 6.1 build 7601 (Windows 7x64 Ultimate Edition Service Pack 1)
XAMPP 1.7.4 
PHP: 5.4.8
Tidy: 2.0,  25 March 2009

Attached files: 
checktidy.php - code sample
test.html - original HTML with table
test-after-phpQuery.html - parsed HTML after phpQuery with table
test-after-tidy.html - cleaned HTML after Tidy with table
test-after-tidy-phpQuery.html - wrong HTML after Tidy+phpQuery without table

Original issue reported on code.google.com by vovan.gr...@gmail.com on 16 May 2013 at 4:50

Attachments: