X-Git-Url: https://git.phdru.name/?a=blobdiff_plain;f=doc%2FANNOUNCE;h=3a63157ec356d2d1c426ae4246865925093e6ab2;hb=8140acc7570a4cbd5673a7a74009bfe5caa41816;hp=3dcc08f3441b186b2c3ca95aa4055202d9c0172d;hpb=634e9adc02b0697c9731e127288663ec37dac561;p=bookmarks_db.git diff --git a/doc/ANNOUNCE b/doc/ANNOUNCE index 3dcc08f..3a63157 100644 --- a/doc/ANNOUNCE +++ b/doc/ANNOUNCE @@ -2,75 +2,47 @@ Bookmarks Database and Internet Robot WHAT IS IT - There is a set of classes, libraries, programs and plugins I use to -manipulate my bookmarks.html. I like Netscape Navigator, but I need more -features, so I write and maintain these programs for my needs. I need to -extend Navigator's "What's new" feature (Navigator 4 calls it "Update -bookmarks"). + A set of classes, libraries, programs and plugins I use to manipulate my +bookmarks.html. +WHAT'S NEW in version 4.4.0 (2011-01-07). -WHAT'S NEW in version 3.3.2 - parse_html.py can now recode unicode entities in titles. +Moved BeautifulSoup.py and subproc.py from Robots/ to the top-level directory. +Moved parse_html.py and its submodules to a separate parse_html package. -WHAT'S NEW in version 3.3.0 - Required Python 2.2. - HTML parser. If the protocol is HTTP, and there is Content-Type header, and -content type is text/html, the object is parsed to extract its title; if the -Content-Type header has charset, or if the HTML has with charset, the -title is converted from the given charset to the default charset. The object is -also parsed to extract tag with redirect. +Added statistics code to parse_html, got a statistics on parser +success/failrure rate, reordered parsers. +Removed old cruft. -WHAT'S NEW in version 3.0 - Complete rewrite from scratch. Created mechanism for pluggable storage -managers, writers (DB dumpers/exporters) and robots. +WHAT'S NEW in version 4.3.1 (2011-01-03). -WHERE TO GET - Master site: http://phd.pp.ru/Software/Python/#bookmarks_db - - Faster mirrors: http://phd.by.ru/Software/Python/#bookmarks_db - http://phd2.chat.ru/Software/Python/#bookmarks_db - - -AUTHOR - Oleg Broytmann - -COPYRIGHT - Copyright (C) 1997-2002 PhiloSoft Design +Get favicon before HTML redirect (refresh). -LICENSE - GPL +Get favicon even if it's of a wrong type; many sites return favicon as +text/plain or application/*; the only exception is text/html which is usually +an error page instead of error 404. -STATUS - Storage managers: pickle, FLAD (Flat ASCII Database). - Writers: HTML, text, FLAD (full database or only errors). - Robots (URL checker): simple, simple+timeoutscoket, forking. -TODO - Parse downloaded file and get some additional information out of headers - and parsed data - title, for example. Or redirects using . - (Partially done - now extracting title). +WHAT'S NEW in version 4.3.0 (2011-01-01). - Documentation. +Robots no longer have one global temporary file - there are at least two +(html and favicon), and in the future there will be more for +asynchronous robot(s) that will test many URLs in parallel. - Merge "writers" to storage managers. - New storage managers: shelve, SQL, ZODB, MetaKit. - Robots (URL checkers): threading, asyncore-based. - Aliases in bookmarks.html. - Configuration file for configuring defaults - global defaults for the system - and local defaults for subsystems. +WHERE TO GET + Master site: http://phdru.name/Software/Python/#bookmarks_db + Mirrors: http://phd.webhost.ru/Software/Python/#bookmarks_db + http://phd.by.ru/Software/Python/#bookmarks_db - Ruleset-based mechanisms to filter out what types of URLs to check: checking - based on URL schema, host, port, path, filename, extension, etc. +AUTHOR + Oleg Broytman - Detailed reports on robot run - what's old, what's new, what was moved, - errors, etc. - WWW-interface to the report. +COPYRIGHT + Copyright (C) 1997-2011 PhiloSoft Design - Bigger database. Multiuser database. Robot should operate on a part of - the DB. - WWW-interface to the database. User will import/export/edit bookmarks, - schedule robot run, etc. +LICENSE + GPL