]> git.phdru.name Git - bookmarks_db.git/history - Robots
Refactor update script
[bookmarks_db.git] / Robots /
2014-07-06 Oleg BroytmanChange default subprocess robot to urllib2
2014-07-06 Oleg BroytmanHandle ftp - get welcome message
2014-07-06 Oleg BroytmanMerge bkmk_rurllib_to.py into bkmk_robot_base.py
2014-07-06 Oleg BroytmanHandle HTTPException and IOError (socket errors)
2014-07-06 Oleg BroytmanMinor refactoring: rename msg to e
2014-07-06 Oleg BroytmanAdd robot based on urllib2
2014-07-06 Oleg BroytmanDon't use urllib._urlopener - it isn't available with...
2014-07-06 Oleg BroytmanMinor refactoring
2014-07-04 Oleg BroytmanRemove self.cleanup
2014-07-04 Oleg BroytmanMinor refactoring
2014-07-04 Oleg BroytmanChange default subprocess robot to urllib_to
2014-07-04 Oleg BroytmanPass subproc_* parameters to the subprocess
2014-07-04 Oleg BroytmanAllow to set default timeout from parameters
2014-07-04 Oleg BroytmanReturn redirect code/destination URL
2014-07-04 Oleg BroytmanMinor refactoring: reorder return values
2014-07-04 Oleg BroytmanRename urlretrieve to get
2014-07-04 Oleg BroytmanRemove Accept-Charset even in case of error
2014-06-29 Oleg BroytmanPass subproc parameter to the subprocess to allow diffe...
2014-06-22 Oleg BroytmanFix simple robot with timeout
2014-06-12 Oleg BroytmanFix comments
2014-06-12 Oleg BroytmanHandle HTTP Error 303 redirects
2014-05-31 Oleg BroytmanDo not assign icon errors to bookmark.error
2014-05-31 Oleg BroytmanSplit simple robot
2014-04-30 Oleg BroytmanChange parse_html to parse strings, not files
2012-09-22 Oleg BroytmanHandle redirect 303
2012-09-21 Oleg BroytmanHandle HTTP Redirect 307.
2012-04-14 Oleg BroytmanRemoved svn:keywords. Extended copyright to 2012.
2011-12-01 Oleg BroytmanMoved robot_simple into run().
2011-01-06 Oleg BroytmanUnindented log texts.
2011-01-06 Oleg BroytmanFixed comment.
2011-01-04 Oleg BroytmanAdded __all__.
2011-01-04 Oleg BroytmanAdded docstrings, __{version,revision,etc}__ boilerplates.
2011-01-03 Oleg BroytmanMoved parse_html.py and its submodules to a separate...
2011-01-03 Oleg BroytmanElementTidy often segfaults.
2011-01-03 Oleg Broytman2011.
2011-01-03 Oleg BroytmanChanged wording.
2011-01-03 Oleg BroytmanGet favicon even if it's of a wrong type.
2011-01-02 Oleg BroytmanGet favicon before HTML redirect (refresh).
2011-01-02 Oleg BroytmanEncode icon's URL from unicode.
2010-10-07 Oleg BroytmanRemove all temporary files with urlcleanup().
2010-10-07 Oleg BroytmanFixed a bug.
2010-10-07 Oleg BroytmanRobots no longer have one global temporary file - there...
2010-08-24 Oleg BroytmanNo need to call .lower() two times.
2010-08-13 Oleg BroytmanFixed encoding.
2010-08-13 Oleg BroytmanFixed a bug.
2010-08-13 Oleg BroytmanTest for completely broken HTML.
2010-08-13 Oleg BroytmanMoved lxml-based parser after BeautifulSoup - it doesn...
2010-08-13 Oleg BroytmanInsert lxml-based parser at the beginning.
2010-08-13 Oleg BroytmanAdded HTML Parser based on lxml.
2010-08-13 Oleg BroytmanLookup title in html if not found in head.
2010-08-13 Oleg BroytmanFixed a bug - moved the code where meta_charset is...
2010-08-13 Oleg BroytmanLookup title in the root if not found in head.
2010-08-12 Oleg BroytmanNicer logging.
2010-08-12 Oleg BroytmanFixed a bug - don't do a double encode.
2010-08-12 Oleg BroytmanTry parser in order until the first one finds a title.
2010-08-12 Oleg BroytmanParser could be None.
2010-08-12 Oleg BroytmanTest if m_lib is available.
2010-08-12 Oleg BroytmanDo not parse meta charset if there is HTTP charset.
2010-08-12 Oleg BroytmanMove charset to the beginning of the list.
2010-08-11 Oleg BroytmanFixed a bug - check if childNodes not empty.
2010-08-11 Oleg BroytmanStore icon's URIs.
2010-08-11 Oleg BroytmanAdded HTML Parser based on html5 library.
2010-08-11 Oleg BroytmanRemoved parse_html_etreetidy - TidyHTMLTreeBuilder...
2010-08-11 Oleg BroytmanAdded HTML Parser based on TidyHTMLTreeBuilder.
2010-08-11 Oleg BroytmanMoved HTMLParser from parse_html_beautifulsoup.py to...
2010-08-11 Oleg BroytmanFixed a bug in case there are more than one Content...
2010-08-11 Oleg BroytmanMore logging.
2010-08-11 Oleg BroytmanSet timeout to 60 seconds.
2010-08-11 Oleg Broytman2010.
2010-08-08 Oleg BroytmanFixed a bug.
2010-08-08 Oleg BroytmanFixed parsing in case of unknown entity.
2010-08-08 Oleg BroytmanFixed a bug.
2010-08-08 Oleg BroytmanFixed a bug - parse "HTTP-Equiv" without content.
2009-09-27 Oleg Broytman"BroytMann" => "Broytman".
2008-06-29 Oleg BroytmanProcess http error 307 as a temporary redirect.
2008-03-09 Oleg BroytmanTitle (and refresh) can be None.
2008-03-07 Oleg BroytmanFixed a misspelled HTML entity.
2008-03-07 Oleg BroytmanSplit the title into subparts, reassemble the subparts...
2008-03-07 Oleg BroytmanLookup TITLE in HEAD, in HTML and in the root; test...
2008-03-07 Oleg BroytmanFixed a misspelling.
2008-03-07 Oleg BroytmanPass charset from the command line.
2008-03-04 Oleg BroytmanExtract charset from "text/html; foo; charset=UTF-8...
2008-03-04 Oleg BroytmanFull name for "IGNORECASE".
2008-03-04 Oleg BroytmanIgnore case for DOCTYPE.
2008-03-04 Oleg BroytmanCheck root.
2008-03-04 Oleg Broytmanapplication/xhtml+xml is HTML, too.
2008-03-04 Oleg BroytmanThere could be more than one semicolon in Content-Type...
2008-03-04 Oleg BroytmanReparse the HTML if the charset was changed.
2008-03-04 Oleg BroytmanI have never saw pages in MacCyriliic.
2008-03-04 Oleg BroytmanReplace ISO-8859-2 to the default encoding.
2008-03-04 Oleg BroytmanDo not log TypeError.
2008-03-03 Oleg BroytmanIn the default hierarchy "root > html > head > title...
2008-03-03 Oleg BroytmanLog more parsers errors.
2008-03-03 Oleg BroytmanAlways log guessed charset even if it's utf-8.
2008-03-03 Oleg BroytmanCharset was guessed if it is not from META and not...
2008-03-03 Oleg BroytmanFixed a bug in case there is no charset in META Content...
2008-03-03 Oleg BroytmanTest meta charset by looking in META HTTP-Equiv.
2008-03-03 Oleg BroytmanCreate the list of charsets outside of the parsers...
2008-02-25 Oleg Broytman  is an entity that needs to be encoded.
2008-02-24 Oleg BroytmanUsed name2codepoint directly; recode it.
next