]> git.phdru.name Git - bookmarks_db.git/history - parse_html
Feat(subproc.py): Get rid of module string
[bookmarks_db.git] / parse_html /
2017-08-13 Oleg BroytmanFix(parse_html): encode title to utf-8 as the last...
2017-08-13 Oleg BroytmanFix(parse_html): Fix BS4 parser: encode title to utf...
2017-06-25 Oleg BroytmanHTML parser based on BeautifulSoup4
2017-06-16 Oleg BroytmanAdd script parse_html/count-parsers
2017-05-13 Oleg BroytmanCleanup code: use 4 spaces
2017-05-13 Oleg BroytmanFeat(Python3): `<>` -> `!=`
2017-05-13 Oleg BroytmanFeat(Python3): Use print() function
2015-12-01 Oleg BroytmanFix a bug: html_tree may have no childNodes
2014-04-30 Oleg BroytmanSplit parse_html/__init__.py into __main__.py
2014-04-30 Oleg BroytmanChange parse_html to parse strings, not files
2013-12-04 Oleg BroytmanParse <meta charset="...">
2012-09-23 Oleg BroytmanExtract html redirect even if ther is no title
2012-04-14 Oleg BroytmanRemoved svn:keywords. Extended copyright to 2012.
2011-12-18 Oleg BroytmanChanged the order or parser according to their success...
2011-12-11 Oleg BroytmanDebug HTML parsers.
2011-04-14 Oleg BroytmanFixed a bug - restored DEFAULT_CHARSET.
2011-01-06 Oleg BroytmanFixed grammar.
2011-01-06 Oleg BroytmanExport main. Import universal_charset only in main.
2011-01-06 Oleg BroytmanFixed logging.
2011-01-06 Oleg BroytmanAdded code to collect statistics on parsers;
2011-01-05 Oleg BroytmanRenamed parse_html modules to bkmk_ph_* to avoid name...
2011-01-04 Oleg BroytmanReport what parser is in use.
2011-01-04 Oleg BroytmanAdded __all__.
2011-01-04 Oleg BroytmanAdded docstrings, __{version,revision,etc}__ boilerplates.
2011-01-03 Oleg BroytmanMoved parse_html.py and its submodules to a separate...