X-Git-Url: https://git.phdru.name/?a=blobdiff_plain;f=doc%2FTODO;h=8fef568f6f1bd4d630e61e5074bde12ae2679690;hb=4ea75aed268e3d34863a0002a76329202ffe75ec;hp=2bbe37ddef6f54fb674f86b75f4a08fe5f116b28;hpb=e479d3793f93442f6ae002a6127d8c15ae8c5d27;p=bookmarks_db.git diff --git a/doc/TODO b/doc/TODO index 2bbe37d..8fef568 100644 --- a/doc/TODO +++ b/doc/TODO @@ -1,11 +1,11 @@ -Parse - +Configuration file to configure defaults - global defaults for the system +and local defaults for subsystems. -Switch simple robot to urllib2. +Robot based on requests. -A new robot based on PycURL. +Robot based on PycURL. -HTML parser based on BeautifulSoup4. +Robot based on Scrapy. A program to publish bookmarks with icons. @@ -21,8 +21,6 @@ Merge "writers" to storage managers. New storage managers: shelve, SQL, ZODB, MetaKit. More robots (URL checkers): threading, asyncore-based; robots that test many URLs in parallel. -Configuration file to configure defaults - global defaults for the system -and local defaults for subsystems. Ruleset-based mechanisms to filter out what types of URLs to check: checking based on URL schema, host, port, path, filename, extension, etc. @@ -35,3 +33,5 @@ Bigger database. Multiuser database. Robot should operates on a part of the DB. WWW-interface to the database. User should import/export/edit bookmarks, schedule robot run, etc. + +A program to collect and check links from a site.