]> git.phdru.name Git - bookmarks_db.git/history - Robots
Add ChangeLog
[bookmarks_db.git] / Robots /
2012-09-22 Oleg BroytmanHandle redirect 303
2012-09-21 Oleg BroytmanHandle HTTP Redirect 307.
2012-04-14 Oleg BroytmanRemoved svn:keywords. Extended copyright to 2012.
2011-12-01 Oleg BroytmanMoved robot_simple into run().
2011-01-06 Oleg BroytmanUnindented log texts.
2011-01-06 Oleg BroytmanFixed comment.
2011-01-04 Oleg BroytmanAdded __all__.
2011-01-04 Oleg BroytmanAdded docstrings, __{version,revision,etc}__ boilerplates.
2011-01-03 Oleg BroytmanMoved parse_html.py and its submodules to a separate...
2011-01-03 Oleg BroytmanElementTidy often segfaults.
2011-01-03 Oleg Broytman2011.
2011-01-03 Oleg BroytmanChanged wording.
2011-01-03 Oleg BroytmanGet favicon even if it's of a wrong type.
2011-01-02 Oleg BroytmanGet favicon before HTML redirect (refresh).
2011-01-02 Oleg BroytmanEncode icon's URL from unicode.
2010-10-07 Oleg BroytmanRemove all temporary files with urlcleanup().
2010-10-07 Oleg BroytmanFixed a bug.
2010-10-07 Oleg BroytmanRobots no longer have one global temporary file - there...
2010-08-24 Oleg BroytmanNo need to call .lower() two times.
2010-08-13 Oleg BroytmanFixed encoding.
2010-08-13 Oleg BroytmanFixed a bug.
2010-08-13 Oleg BroytmanTest for completely broken HTML.
2010-08-13 Oleg BroytmanMoved lxml-based parser after BeautifulSoup - it doesn...
2010-08-13 Oleg BroytmanInsert lxml-based parser at the beginning.
2010-08-13 Oleg BroytmanAdded HTML Parser based on lxml.
2010-08-13 Oleg BroytmanLookup title in html if not found in head.
2010-08-13 Oleg BroytmanFixed a bug - moved the code where meta_charset is...
2010-08-13 Oleg BroytmanLookup title in the root if not found in head.
2010-08-12 Oleg BroytmanNicer logging.
2010-08-12 Oleg BroytmanFixed a bug - don't do a double encode.
2010-08-12 Oleg BroytmanTry parser in order until the first one finds a title.
2010-08-12 Oleg BroytmanParser could be None.
2010-08-12 Oleg BroytmanTest if m_lib is available.
2010-08-12 Oleg BroytmanDo not parse meta charset if there is HTTP charset.
2010-08-12 Oleg BroytmanMove charset to the beginning of the list.
2010-08-11 Oleg BroytmanFixed a bug - check if childNodes not empty.
2010-08-11 Oleg BroytmanStore icon's URIs.
2010-08-11 Oleg BroytmanAdded HTML Parser based on html5 library.
2010-08-11 Oleg BroytmanRemoved parse_html_etreetidy - TidyHTMLTreeBuilder...
2010-08-11 Oleg BroytmanAdded HTML Parser based on TidyHTMLTreeBuilder.
2010-08-11 Oleg BroytmanMoved HTMLParser from parse_html_beautifulsoup.py to...
2010-08-11 Oleg BroytmanFixed a bug in case there are more than one Content...
2010-08-11 Oleg BroytmanMore logging.
2010-08-11 Oleg BroytmanSet timeout to 60 seconds.
2010-08-11 Oleg Broytman2010.
2010-08-08 Oleg BroytmanFixed a bug.
2010-08-08 Oleg BroytmanFixed parsing in case of unknown entity.
2010-08-08 Oleg BroytmanFixed a bug.
2010-08-08 Oleg BroytmanFixed a bug - parse "HTTP-Equiv" without content.
2009-09-27 Oleg Broytman"BroytMann" => "Broytman".
2008-06-29 Oleg BroytmanProcess http error 307 as a temporary redirect.
2008-03-09 Oleg BroytmanTitle (and refresh) can be None.
2008-03-07 Oleg BroytmanFixed a misspelled HTML entity.
2008-03-07 Oleg BroytmanSplit the title into subparts, reassemble the subparts...
2008-03-07 Oleg BroytmanLookup TITLE in HEAD, in HTML and in the root; test...
2008-03-07 Oleg BroytmanFixed a misspelling.
2008-03-07 Oleg BroytmanPass charset from the command line.
2008-03-04 Oleg BroytmanExtract charset from "text/html; foo; charset=UTF-8...
2008-03-04 Oleg BroytmanFull name for "IGNORECASE".
2008-03-04 Oleg BroytmanIgnore case for DOCTYPE.
2008-03-04 Oleg BroytmanCheck root.
2008-03-04 Oleg Broytmanapplication/xhtml+xml is HTML, too.
2008-03-04 Oleg BroytmanThere could be more than one semicolon in Content-Type...
2008-03-04 Oleg BroytmanReparse the HTML if the charset was changed.
2008-03-04 Oleg BroytmanI have never saw pages in MacCyriliic.
2008-03-04 Oleg BroytmanReplace ISO-8859-2 to the default encoding.
2008-03-04 Oleg BroytmanDo not log TypeError.
2008-03-03 Oleg BroytmanIn the default hierarchy "root > html > head > title...
2008-03-03 Oleg BroytmanLog more parsers errors.
2008-03-03 Oleg BroytmanAlways log guessed charset even if it's utf-8.
2008-03-03 Oleg BroytmanCharset was guessed if it is not from META and not...
2008-03-03 Oleg BroytmanFixed a bug in case there is no charset in META Content...
2008-03-03 Oleg BroytmanTest meta charset by looking in META HTTP-Equiv.
2008-03-03 Oleg BroytmanCreate the list of charsets outside of the parsers...
2008-02-25 Oleg Broytman  is an entity that needs to be encoded.
2008-02-24 Oleg BroytmanUsed name2codepoint directly; recode it.
2008-02-24 Oleg BroytmanCombined two "if"s.
2008-02-24 Oleg BroytmanDo not unquote standard HTML entities.
2008-02-24 Oleg BroytmanEmulate log.
2008-02-23 Oleg BroytmanFixed a bug - break out of the loop after finding the...
2008-02-23 Oleg BroytmanIt is not HTTP charset, it is guessed charset.
2008-02-23 Oleg BroytmanTry a list of charsets, including the universal (utf...
2008-02-13 Oleg BroytmanStop meddling with cp1252.
2008-02-13 Oleg BroytmanReplace BeautifulSoup's guessed cp1252 with DEFAULT_CHA...
2008-02-12 Oleg Broytmancurrent_charset is only needed in main.
2008-02-11 Oleg BroytmanTimeout 5 minutes is enough (Squid uses 3 minutes).
2008-02-11 Oleg BroytmanRecode entities before num. entities.
2008-02-11 Oleg BroytmanSwitched to utf-8.
2008-02-11 Oleg BroytmanRecode HTML entities.
2008-02-11 Oleg BroytmanPython 2.5 allows try/except/finally in one clause.
2008-02-10 Oleg BroytmanNo need to catch BaseException.
2008-02-10 Oleg Broytmanfp could be None in http_error_default().
2008-02-10 Oleg BroytmanCatch and print all exceptions.
2008-01-09 Oleg BroytmanDo the second check for title only if there is HEAD.
2008-01-08 Oleg BroytmanSome sites put TITLE in HTML outside of HEAD.
2008-01-08 Oleg BroytmanSome sites put TITLE in HTML without HEAD.
2008-01-08 Oleg BroytmanDo not return an empty string - pass it to BSoupParser.
2008-01-08 Oleg BroytmanIf there is HEAD but no TITLE - return empty title.
2007-12-31 Oleg BroytmanFixed a bug created by removing module 'string'.
2007-12-31 Oleg BroytmanCache icons.
next