2023-11-12 |
Oleg Broytman | Style: Fix flake8 E261 at least two spaces before inlin... |
tree | commitdiff |
2023-11-12 |
Oleg Broytman | Style: Fix flake8 E231 missing whitespace after ',' |
tree | commitdiff |
2023-11-12 |
Oleg Broytman | Style: Silent flake8 E221 multiple spaces before operator |
tree | commitdiff |
2023-11-12 |
Oleg Broytman | Style: Fix flake8 E131 continuation line unaligned... |
tree | commitdiff |
2023-11-12 |
Oleg Broytman | Style: Fix flake8 E128 continuation line under-indented... |
tree | commitdiff |
2023-11-12 |
Oleg Broytman | Style: Fix flake8 E127 continuation line over-indented... |
tree | commitdiff |
2023-11-12 |
Oleg Broytman | Style: Fix flake8 warning E116 unexpected indentation... |
tree | commitdiff |
2021-05-23 |
Oleg Broytman | Style: Fix `flake8` E114 |
tree | commitdiff |
2017-05-13 |
Oleg Broytman | Cleanup code: use 4 spaces |
tree | commitdiff |
2017-05-13 |
Oleg Broytman | Feat(Python3): `except Error, value` -> `except Error... |
tree | commitdiff |
2017-05-13 |
Oleg Broytman | Feat(Python3): `raise Error, value` -> `raise Error... |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Change default subprocess robot to urllib2 |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Handle ftp - get welcome message |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Merge bkmk_rurllib_to.py into bkmk_robot_base.py |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Handle HTTPException and IOError (socket errors) |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Minor refactoring: rename msg to e |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Add robot based on urllib2 |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Don't use urllib._urlopener - it isn't available with... |
tree | commitdiff |
2014-07-06 |
Oleg Broytman | Minor refactoring |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Remove self.cleanup |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Minor refactoring |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Change default subprocess robot to urllib_to |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Pass subproc_* parameters to the subprocess |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Allow to set default timeout from parameters |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Return redirect code/destination URL |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Minor refactoring: reorder return values |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Rename urlretrieve to get |
tree | commitdiff |
2014-07-04 |
Oleg Broytman | Remove Accept-Charset even in case of error |
tree | commitdiff |
2014-06-29 |
Oleg Broytman | Pass subproc parameter to the subprocess to allow diffe... |
tree | commitdiff |
2014-06-22 |
Oleg Broytman | Fix simple robot with timeout |
tree | commitdiff |
2014-06-12 |
Oleg Broytman | Fix comments |
tree | commitdiff |
2014-06-12 |
Oleg Broytman | Handle HTTP Error 303 redirects |
tree | commitdiff |
2014-05-31 |
Oleg Broytman | Do not assign icon errors to bookmark.error |
tree | commitdiff |
2014-05-31 |
Oleg Broytman | Split simple robot |
tree | commitdiff |
2014-04-30 |
Oleg Broytman | Change parse_html to parse strings, not files |
tree | commitdiff |
2012-09-22 |
Oleg Broytman | Handle redirect 303 |
tree | commitdiff |
2012-09-21 |
Oleg Broytman | Handle HTTP Redirect 307. |
tree | commitdiff |
2012-04-14 |
Oleg Broytman | Removed svn:keywords. Extended copyright to 2012. |
tree | commitdiff |
2011-12-01 |
Oleg Broytman | Moved robot_simple into run(). |
tree | commitdiff |
2011-01-06 |
Oleg Broytman | Unindented log texts. |
tree | commitdiff |
2011-01-06 |
Oleg Broytman | Fixed comment. |
tree | commitdiff |
2011-01-04 |
Oleg Broytman | Added __all__. |
tree | commitdiff |
2011-01-04 |
Oleg Broytman | Added docstrings, __{version,revision,etc}__ boilerplates. |
tree | commitdiff |
2011-01-03 |
Oleg Broytman | Moved parse_html.py and its submodules to a separate... |
tree | commitdiff |
2011-01-03 |
Oleg Broytman | ElementTidy often segfaults. |
tree | commitdiff |
2011-01-03 |
Oleg Broytman | 2011. |
tree | commitdiff |
2011-01-03 |
Oleg Broytman | Changed wording. |
tree | commitdiff |
2011-01-03 |
Oleg Broytman | Get favicon even if it's of a wrong type. |
tree | commitdiff |
2011-01-02 |
Oleg Broytman | Get favicon before HTML redirect (refresh). |
tree | commitdiff |
2011-01-02 |
Oleg Broytman | Encode icon's URL from unicode. |
tree | commitdiff |
2010-10-07 |
Oleg Broytman | Remove all temporary files with urlcleanup(). |
tree | commitdiff |
2010-10-07 |
Oleg Broytman | Fixed a bug. |
tree | commitdiff |
2010-10-07 |
Oleg Broytman | Robots no longer have one global temporary file - there... |
tree | commitdiff |
2010-08-24 |
Oleg Broytman | No need to call .lower() two times. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Fixed encoding. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Fixed a bug. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Test for completely broken HTML. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Moved lxml-based parser after BeautifulSoup - it doesn... |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Insert lxml-based parser at the beginning. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Added HTML Parser based on lxml. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Lookup title in html if not found in head. |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Fixed a bug - moved the code where meta_charset is... |
tree | commitdiff |
2010-08-13 |
Oleg Broytman | Lookup title in the root if not found in head. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Nicer logging. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Fixed a bug - don't do a double encode. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Try parser in order until the first one finds a title. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Parser could be None. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Test if m_lib is available. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Do not parse meta charset if there is HTTP charset. |
tree | commitdiff |
2010-08-12 |
Oleg Broytman | Move charset to the beginning of the list. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Fixed a bug - check if childNodes not empty. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Store icon's URIs. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Added HTML Parser based on html5 library. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Removed parse_html_etreetidy - TidyHTMLTreeBuilder... |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Added HTML Parser based on TidyHTMLTreeBuilder. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Moved HTMLParser from parse_html_beautifulsoup.py to... |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Fixed a bug in case there are more than one Content... |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | More logging. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | Set timeout to 60 seconds. |
tree | commitdiff |
2010-08-11 |
Oleg Broytman | 2010. |
tree | commitdiff |
2010-08-08 |
Oleg Broytman | Fixed a bug. |
tree | commitdiff |
2010-08-08 |
Oleg Broytman | Fixed parsing in case of unknown entity. |
tree | commitdiff |
2010-08-08 |
Oleg Broytman | Fixed a bug. |
tree | commitdiff |
2010-08-08 |
Oleg Broytman | Fixed a bug - parse "HTTP-Equiv" without content. |
tree | commitdiff |
2009-09-27 |
Oleg Broytman | "BroytMann" => "Broytman". |
tree | commitdiff |
2008-06-29 |
Oleg Broytman | Process http error 307 as a temporary redirect. |
tree | commitdiff |
2008-03-09 |
Oleg Broytman | Title (and refresh) can be None. |
tree | commitdiff |
2008-03-07 |
Oleg Broytman | Fixed a misspelled HTML entity. |
tree | commitdiff |
2008-03-07 |
Oleg Broytman | Split the title into subparts, reassemble the subparts... |
tree | commitdiff |
2008-03-07 |
Oleg Broytman | Lookup TITLE in HEAD, in HTML and in the root; test... |
tree | commitdiff |
2008-03-07 |
Oleg Broytman | Fixed a misspelling. |
tree | commitdiff |
2008-03-07 |
Oleg Broytman | Pass charset from the command line. |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | Extract charset from "text/html; foo; charset=UTF-8... |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | Full name for "IGNORECASE". |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | Ignore case for DOCTYPE. |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | Check root. |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | application/xhtml+xml is HTML, too. |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | There could be more than one semicolon in Content-Type... |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | Reparse the HTML if the charset was changed. |
tree | commitdiff |
2008-03-04 |
Oleg Broytman | I have never saw pages in MacCyriliic. |
tree | commitdiff |
next |