2 Bookmarks Database and Internet Robot
5 There is a set of classes, libraries, programs and plugins I use to
6 manipulate my bookmarks.html. I like Netscape Navigator, but I need more
7 features, so I write and maintain these programs for my needs. I need to
8 extend Navigator's "What's new" feature (Navigator 4 calls it "Update
12 WHAT'S NEW in version 3.3.1
13 New shell scripts in the example area.
16 WHAT'S NEW in version 3.3.0
18 HTML parser. If the protocol is HTTP, and there is Content-Type header, and
19 content type is text/html, the object is parsed to extract its title; if the
20 Content-Type header has charset, or if the HTML has <META> with charset, the
21 title is converted from the given charset to the default charset. The object is
22 also parsed to extract <META> tag with redirect.
25 WHAT'S NEW in version 3.0
26 Complete rewrite from scratch. Created mechanism for pluggable storage
27 managers, writers (DB dumpers/exporters) and robots.
31 Master site: http://phd.pp.ru/Software/Python/#bookmarks_db
33 Faster mirrors: http://phd.by.ru/Software/Python/#bookmarks_db
34 http://phd2.chat.ru/Software/Python/#bookmarks_db
38 Oleg Broytmann <phd@phd.pp.ru>
41 Copyright (C) 1997-2002 PhiloSoft Design
47 Storage managers: pickle, FLAD (Flat ASCII Database).
48 Writers: HTML, text, FLAD (full database or only errors).
49 Robots (URL checker): simple, simple+timeoutscoket, forking.
52 Parse downloaded file and get some additional information out of headers
53 and parsed data - title, for example. Or redirects using <META HTTP-Equiv>.
54 (Partially done - now extracting title).
58 Merge "writers" to storage managers.
59 New storage managers: shelve, SQL, ZODB, MetaKit.
60 Robots (URL checkers): threading, asyncore-based.
61 Aliases in bookmarks.html.
63 Configuration file for configuring defaults - global defaults for the system
64 and local defaults for subsystems.
66 Ruleset-based mechanisms to filter out what types of URLs to check: checking
67 based on URL schema, host, port, path, filename, extension, etc.
69 Detailed reports on robot run - what's old, what's new, what was moved,
71 WWW-interface to the report.
73 Bigger database. Multiuser database. Robot should operate on a part of
75 WWW-interface to the database. User will import/export/edit bookmarks,
76 schedule robot run, etc.