Bookmarks Database and Internet Robot
WHAT IS IT
- There is a set of classes, libraries, programs and plugins I use to
-manipulate my bookmarks.html. I like Netscape Navigator, but I need more
-features, so I write and maintain these programs for my needs. I need to
-extend Navigator's "What's new" feature (Navigator 4 calls it "Update
-bookmarks").
+ A set of classes, libraries, programs and plugins I use to manipulate my
+bookmarks.html.
-WHAT'S NEW in version 3.3.1
- New shell scripts in the example area.
+WHAT'S NEW in version 4.5.7
+ Change parse_html to parse strings, not files.
-WHAT'S NEW in version 3.3.0
- Required Python 2.2.
- HTML parser. If the protocol is HTTP, and there is Content-Type header, and
-content type is text/html, the object is parsed to extract its title; if the
-Content-Type header has charset, or if the HTML has <META> with charset, the
-title is converted from the given charset to the default charset. The object is
-also parsed to extract <META> tag with redirect.
+ Split parse_html/__init__.py into __main__.py.
+ Add ChangeLog.
-WHAT'S NEW in version 3.0
- Complete rewrite from scratch. Created mechanism for pluggable storage
-managers, writers (DB dumpers/exporters) and robots.
+WHAT'S NEW in version 4.5.6 (2014-01-14)
+ Remove absolute directory ~/lib to make it portable.
-WHERE TO GET
- Master site: http://phd.pp.ru/Software/Python/#bookmarks_db
- Faster mirrors: http://phd.by.ru/Software/Python/#bookmarks_db
- http://phd2.chat.ru/Software/Python/#bookmarks_db
+WHAT'S NEW in version 4.5.5 (2013-12-05)
+ Parse <meta charset="...">.
-AUTHOR
- Oleg Broytmann <phd@phd.pp.ru>
+WHAT'S NEW in version 4.5.4 (2013-11-23)
+ Published through git/gitweb.
-COPYRIGHT
- Copyright (C) 1997-2002 PhiloSoft Design
-LICENSE
- GPL
+WHAT'S NEW in version 4.5.3 (2013-07-26)
+ Minor tweak in Makefile.
+
+ Switched to git.
+
+
+WHAT'S NEW in version 4.5.2 (2012-09-24)
+ Removed svn:keywords.
+
+ Handle redirects with codes 303 and 307.
+
+ Fixed a bug in handling place: URIs (do not append '//').
-STATUS
- Storage managers: pickle, FLAD (Flat ASCII Database).
- Writers: HTML, text, FLAD (full database or only errors).
- Robots (URL checker): simple, simple+timeoutscoket, forking.
+ Extract html redirect even if ther is no title.
-TODO
- Parse downloaded file and get some additional information out of headers
- and parsed data - title, for example. Or redirects using <META HTTP-Equiv>.
- (Partially done - now extracting title).
- Documentation.
+WHAT'S NEW in version 4.5.1 (2011-12-28).
+ Read/write mozilla-specific date/time format in json storage.
- Merge "writers" to storage managers.
- New storage managers: shelve, SQL, ZODB, MetaKit.
- Robots (URL checkers): threading, asyncore-based.
- Aliases in bookmarks.html.
- Configuration file for configuring defaults - global defaults for the system
- and local defaults for subsystems.
+WHAT'S NEW in version 4.5.0 (2011-12-18).
+ Encode international domain names with IDNA encoding.
- Ruleset-based mechanisms to filter out what types of URLs to check: checking
- based on URL schema, host, port, path, filename, extension, etc.
+ Adapted to different Mozilla 'place' URIs.
- Detailed reports on robot run - what's old, what's new, what was moved,
- errors, etc.
- WWW-interface to the report.
- Bigger database. Multiuser database. Robot should operate on a part of
- the DB.
- WWW-interface to the database. User will import/export/edit bookmarks,
- schedule robot run, etc.
+
+WHERE TO GET
+ Home page: http://phdru.name/Software/Python/#bookmarks_db
+ git clone http://git.phdru.name/bookmarks_db.git
+ git clone git://git.phdru.name/bookmarks_db.git
+
+AUTHOR
+ Oleg Broytman <phd@phdru.name>
+
+COPYRIGHT
+ Copyright (C) 1997-2014 PhiloSoft Design
+
+LICENSE
+ GPL