]> git.phdru.name Git - bookmarks_db.git/history - Robots/parse_html_html5.py
Moved lxml-based parser after BeautifulSoup - it doesn't accept charset.
[bookmarks_db.git] / Robots / parse_html_html5.py
2010-08-13 Oleg BroytmanLookup title in html if not found in head.
2010-08-12 Oleg BroytmanTry parser in order until the first one finds a title.
2010-08-12 Oleg BroytmanDo not parse meta charset if there is HTTP charset.
2010-08-11 Oleg BroytmanFixed a bug - check if childNodes not empty.
2010-08-11 Oleg BroytmanAdded HTML Parser based on html5 library.