X-Git-Url: https://git.phdru.name/?a=blobdiff_plain;f=doc%2FANNOUNCE;h=d970b0ed6d86ad5cc669c4dd351aae0f28156a03;hb=65f5fb4ab18a3386b9fe1ffaad10f4a23c4410c0;hp=5160ef7106d0c55093bb581ceadf0e15125d979b;hpb=387f77d110986aa12967c9cd788ab0e4f41f2be2;p=bookmarks_db.git
diff --git a/doc/ANNOUNCE b/doc/ANNOUNCE
index 5160ef7..d970b0e 100644
--- a/doc/ANNOUNCE
+++ b/doc/ANNOUNCE
@@ -2,79 +2,62 @@
Bookmarks Database and Internet Robot
WHAT IS IT
- There is a set of classes, libraries, programs and plugins I use to
-manipulate my bookmarks.html. I like Netscape Navigator, but I need more
-features, so I write and maintain these programs for my needs. I need to
-extend Navigator's "What's new" feature (Navigator 4 calls it "Update
+ A set of classes, libraries, programs and plugins I use to manipulate my
+bookmarks.html. I like Mozilla, but I need more features. I want to extend
+Mozilla's "Check for updates" feature (Navigator4 called it "Update
bookmarks").
+WHAT'S NEW in version 4.1.2
-WHAT'S NEW in version 3.4.0
- Updated to m_lib version 1.2. Extended support for Mozilla.
+ Process http error 307 as a temporary redirect.
-WHAT'S NEW in version 3.3.2
- parse_html.py can now recode unicode entities in titles.
+WHAT'S NEW in version 4.1.1 (2008-03-10)
+ Catch and report all errors.
+ Consider application/xhtml+xml as HTML.
-WHAT'S NEW in version 3.3.0
- Required Python 2.2.
- HTML parser. If the protocol is HTTP, and there is Content-Type header, and
-content type is text/html, the object is parsed to extract its title; if the
-Content-Type header has charset, or if the HTML has with charset, the
-title is converted from the given charset to the default charset. The object is
-also parsed to extract tag with redirect.
+ Better handling of exceptions while looking up the icon.
+ Recode HTML entities.
-WHAT'S NEW in version 3.0
- Complete rewrite from scratch. Created mechanism for pluggable storage
-managers, writers (DB dumpers/exporters) and robots.
+ Always use utf-8 as the default encoding.
-WHERE TO GET
- Master site: http://phd.pp.ru/Software/Python/#bookmarks_db
+WHAT'S NEW in version 4.1.0 (2008-01-14)
+ Parser for HTML based on BeautifulSoup.
- Faster mirrors: http://phd.by.ru/Software/Python/#bookmarks_db
- http://phd2.chat.ru/Software/Python/#bookmarks_db
+ Changed User-agent header: I saw a number of sites that forbid
+ "Mozilla compatible" browsers. Added a number of fake headers to pretend
+ this is a real web-browser - there are still stupid sites
+ that are trying to protect themselves from robots by analyzing headers.
+ Handle redirects while looking for the icon.
-AUTHOR
- Oleg Broytmann
+ Handle float timeouts in HTML redirects.
-COPYRIGHT
- Copyright (C) 1997-2002 PhiloSoft Design
+ Minimal required version of Python is 2.5 now.
-LICENSE
- GPL
-STATUS
- Storage managers: pickle, FLAD (Flat ASCII Database).
- Writers: HTML, text, FLAD (full database or only errors).
- Robots (URL checker): simple, simple+timeoutscoket, forking.
+WHAT'S NEW in version 4.0.0 (2007-10-20)
+ Extended support for Mozilla: charset and icon in bookmarks.
+ Use the charset to add Accept-Charset header.
+ Retrieve favicon.ico (or whatever points to) and store it.
-TODO
- Parse downloaded file and get some additional information out of headers
- and parsed data - title, for example. Or redirects using .
- (Partially done - now extracting title).
+ The project celebrates 10th anniversary!
- Documentation.
- Merge "writers" to storage managers.
- New storage managers: shelve, SQL, ZODB, MetaKit.
- Robots (URL checkers): threading, asyncore-based.
- Aliases in bookmarks.html.
+WHERE TO GET
+ Master site: http://phd.pp.ru/Software/Python/#bookmarks_db
+ Mirrors: http://phd.webhost.ru/Software/Python/#bookmarks_db
+ http://phd.by.ru/Software/Python/#bookmarks_db
- Configuration file for configuring defaults - global defaults for the system
- and local defaults for subsystems.
- Ruleset-based mechanisms to filter out what types of URLs to check: checking
- based on URL schema, host, port, path, filename, extension, etc.
+AUTHOR
+ Oleg Broytmann
- Detailed reports on robot run - what's old, what's new, what was moved,
- errors, etc.
- WWW-interface to the report.
+COPYRIGHT
+ Copyright (C) 1997-2008 PhiloSoft Design
- Bigger database. Multiuser database. Robot should operate on a part of
- the DB.
- WWW-interface to the database. User will import/export/edit bookmarks,
- schedule robot run, etc.
+LICENSE
+ GPL