Yes, parsing real world (broken) HTML is not an easy task. It's true you could try using HTML Tidy but if you're already using Qt I would advise not to do so and to use something already available in Qt. Use QtWebKit and QWebElement which is new in Qt 4.6 and you have your DOM ready in 15 minutes.




Reply With Quote
Bookmarks