]> git.phdru.name Git - bookmarks_db.git/history - parse_html
Fix(Robot): Stop splitting and un-splitting URLs
[bookmarks_db.git] / parse_html /
2024-03-03 Oleg BroytmanFix(parse_html/bkmk_ph_beautifulsoup4): Find "shortcut...
2023-11-20 Oleg BroytmanFix(Py3): `html.parser` cannot parse bytes
2023-11-20 Oleg BroytmanFix(parse_html CLI): Report encodings and the title
2023-11-20 Oleg BroytmanFix(parse_html/bkmk_parse_html.py): Open the file with...
2023-11-20 Oleg BroytmanFix(parse_html/bkmk_ph_beautifulsoup4): Fix title recom...
2023-11-20 Oleg BroytmanFix(Py3): Remove forgotten `.decode()`/`.encode()`
2023-11-20 Oleg BroytmanFeat: Remove some HTML parsers
2023-11-17 Oleg BroytmanFix(parse_html): Do not parse empty strings
2023-11-17 Oleg BroytmanFix(parse_html): Do not parse empty strings
2023-11-15 Oleg BroytmanFix(Py3): Fix HTML parsers
2023-11-13 Oleg BroytmanChore: Explicitly open text files in text mode
2023-11-12 Oleg BroytmanFix(Py3): Fix `htmlentities` import
2023-11-12 Oleg BroytmanFix(Py3): Stop encoding unicode to bytes
2023-11-12 Oleg BroytmanFix(parse_html): Fix import
2023-11-12 Oleg BroytmanStyle: Fix flake8 W605 invalid escape sequence
2023-11-12 Oleg BroytmanStyle: Fix flake8 F821 undefined name 'unichr'
2023-11-12 Oleg BroytmanFix(Py3): Fix `basestring` compatibility
2023-11-12 Oleg BroytmanFix(parse_html/bkmk_parse_html): Fix imports
2023-11-12 Oleg BroytmanFix(Py3): Fix `unicode` compatibility
2023-11-12 Oleg BroytmanFix(Py3): Replace `unicode()` with `.decode()`
2023-11-12 Oleg BroytmanStyle: Fix flake8 F401 module imported but unused
2023-11-12 Oleg BroytmanStyle: Fix flake8 E741 ambiguous variable name 'l'
2023-11-12 Oleg BroytmanStyle: Fix flake8 E501 line too long
2023-11-12 Oleg BroytmanStyle: Fix flake8 E402 module level import not at top...
2023-11-12 Oleg BroytmanStyle: Fix flake8 E305 expected 2 blank lines, found 1
2023-11-12 Oleg BroytmanStyle: Fix flake8 E302 expected 2 blank lines, found 1
2023-11-12 Oleg BroytmanStyle: Fix flake8 E265 block comment should start with...
2023-11-12 Oleg BroytmanStyle: Fix flake8 E261 at least two spaces before inlin...
2023-11-12 Oleg BroytmanStyle: Silent flake8 E227 missing whitespace around...
2021-05-23 Oleg BroytmanStyle: Fix `flake8` E114
2017-08-13 Oleg BroytmanFix(parse_html): encode title to utf-8 as the last...
2017-08-13 Oleg BroytmanFix(parse_html): Fix BS4 parser: encode title to utf...
2017-06-25 Oleg BroytmanHTML parser based on BeautifulSoup4
2017-06-16 Oleg BroytmanAdd script parse_html/count-parsers
2017-05-13 Oleg BroytmanCleanup code: use 4 spaces
2017-05-13 Oleg BroytmanFeat(Python3): `<>` -> `!=`
2017-05-13 Oleg BroytmanFeat(Python3): Use print() function
2015-12-01 Oleg BroytmanFix a bug: html_tree may have no childNodes
2014-04-30 Oleg BroytmanSplit parse_html/__init__.py into __main__.py
2014-04-30 Oleg BroytmanChange parse_html to parse strings, not files
2013-12-04 Oleg BroytmanParse <meta charset="...">
2012-09-23 Oleg BroytmanExtract html redirect even if ther is no title
2012-04-14 Oleg BroytmanRemoved svn:keywords. Extended copyright to 2012.
2011-12-18 Oleg BroytmanChanged the order or parser according to their success...
2011-12-11 Oleg BroytmanDebug HTML parsers.
2011-04-14 Oleg BroytmanFixed a bug - restored DEFAULT_CHARSET.
2011-01-06 Oleg BroytmanFixed grammar.
2011-01-06 Oleg BroytmanExport main. Import universal_charset only in main.
2011-01-06 Oleg BroytmanFixed logging.
2011-01-06 Oleg BroytmanAdded code to collect statistics on parsers;
2011-01-05 Oleg BroytmanRenamed parse_html modules to bkmk_ph_* to avoid name...
2011-01-04 Oleg BroytmanReport what parser is in use.
2011-01-04 Oleg BroytmanAdded __all__.
2011-01-04 Oleg BroytmanAdded docstrings, __{version,revision,etc}__ boilerplates.
2011-01-03 Oleg BroytmanMoved parse_html.py and its submodules to a separate...