X-Git-Url: https://git.phdru.name/?a=blobdiff_plain;ds=sidebyside;f=doc%2FANNOUNCE;h=fffcad90ce0fa81e23250abc105b8a5784f95fbf;hb=35deb8415d80e17370776581187838ce00f510ca;hp=b4974dc416246b7f04552ba20922adffdede4956;hpb=fb5c3b2b91aeeb615d6d6d890491af3fdff69556;p=bookmarks_db.git diff --git a/doc/ANNOUNCE b/doc/ANNOUNCE index b4974dc..fffcad9 100644 --- a/doc/ANNOUNCE +++ b/doc/ANNOUNCE @@ -2,75 +2,49 @@ Bookmarks Database and Internet Robot WHAT IS IT - There is a set of classes, libraries, programs and plugins I use to -manipulate my bookmarks.html. I like Netscape Navigator, but I need more -features, so I write and maintain these programs for my needs. I need to -extend Navigator's "What's new" feature (Navigator 4 calls it "Update + A set of classes, libraries, programs and plugins I use to manipulate my +bookmarks.html. I like Mozilla, but I need more features. I want to extend +Mozilla's "Check for updates" feature (Navigator4 called it "Update bookmarks"). -WHAT'S NEW in version 3.3.1 - New shell scripts in the example area. +WHAT'S NEW in version 4.1.0 (2008-01-14) + Parser for HTML based on BeautifulSoup. + Changed User-agent header: I saw a number of sites that forbid + "Mozilla compatible" browsers. Added a number of fake headers to pretend + this is a real web-browser - there are still stupid sites + that are trying to protect themselves from robots by analyzing headers. -WHAT'S NEW in version 3.3.0 - Required Python 2.2. - HTML parser. If the protocol is HTTP, and there is Content-Type header, and -content type is text/html, the object is parsed to extract its title; if the -Content-Type header has charset, or if the HTML has with charset, the -title is converted from the given charset to the default charset. The object is -also parsed to extract tag with redirect. + Handle redirects while looking for the icon. + Handle float timeouts in HTML redirects. -WHAT'S NEW in version 3.0 - Complete rewrite from scratch. Created mechanism for pluggable storage -managers, writers (DB dumpers/exporters) and robots. + Minimal required version of Python is 2.5 now. -WHERE TO GET - Master site: http://phd.pp.ru/Software/Python/#bookmarks_db +WHAT'S NEW in version 4.0.0 (2007-10-20) + Extended support for Mozilla: charset and icon in bookmarks. + Use the charset to add Accept-Charset header. + Retrieve favicon.ico (or whatever points to) and store it. + + The project celebrates 10th anniversary! - Faster mirrors: http://phd.by.ru/Software/Python/#bookmarks_db - http://phd2.chat.ru/Software/Python/#bookmarks_db + +WHAT'S NEW in version 3.4.1 (2005-01-29) + Updated to Python 2.4. Switched from CVS to Subversion. + + +WHERE TO GET + Master site: http://phd.pp.ru/Software/Python/#bookmarks_db + A mirror: http://phd.webhost.ru/Software/Python/#bookmarks_db AUTHOR Oleg Broytmann COPYRIGHT - Copyright (C) 1997-2002 PhiloSoft Design + Copyright (C) 1997-2007 PhiloSoft Design LICENSE GPL - -STATUS - Storage managers: pickle, FLAD (Flat ASCII Database). - Writers: HTML, text, FLAD (full database or only errors). - Robots (URL checker): simple, simple+timeoutscoket, forking. - -TODO - Parse downloaded file and get some additional information out of headers - and parsed data - title, for example. Or redirects using . - (Partially done - now extracting title). - - Documentation. - - Merge "writers" to storage managers. - New storage managers: shelve, SQL, ZODB, MetaKit. - Robots (URL checkers): threading, asyncore-based. - Aliases in bookmarks.html. - - Configuration file for configuring defaults - global defaults for the system - and local defaults for subsystems. - - Ruleset-based mechanisms to filter out what types of URLs to check: checking - based on URL schema, host, port, path, filename, extension, etc. - - Detailed reports on robot run - what's old, what's new, what was moved, - errors, etc. - WWW-interface to the report. - - Bigger database. Multiuser database. Robot should operate on a part of - the DB. - WWW-interface to the database. User will import/export/edit bookmarks, - schedule robot run, etc.