From 037c50a4a20df82a51375d9fcb075d4ac5add0b8 Mon Sep 17 00:00:00 2001 From: Oleg Broytman Date: Wed, 22 Sep 2004 23:14:41 +0000 Subject: [PATCH] Updated TODO. git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@47 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23 --- doc/TODO | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/doc/TODO b/doc/TODO index 6b4e748..a74684b 100644 --- a/doc/TODO +++ b/doc/TODO @@ -1,4 +1,4 @@ - Cleanup HTML using BeautifulSoap or Tidy. + Cleanup HTML before parsing using BeautifulSoap or Tidy. Parse downloaded file and get javascript redirects. More and better documentation. @@ -7,7 +7,7 @@ New storage managers: shelve, SQL, ZODB, MetaKit. More robots (URL checkers): threading, asyncore-based. - Configuration file for configuring defaults - global defaults for the system + Configuration file to configure defaults - global defaults for the system and local defaults for subsystems. Ruleset-based mechanisms to filter out what types of URLs to check: checking -- 2.39.5