X-Git-Url: https://git.phdru.name/?a=blobdiff_plain;f=doc%2FTODO;h=deb79cab111efc8b36cdb6fa0fe12560c1c461ab;hb=11632b7623b2b2e01995f013bc6d8ba01c20cf74;hp=72008993c548b6ea1747e9f91127e25c25095b20;hpb=7c67f205b840eae593185f531d6d5729be3360cb;p=bookmarks_db.git diff --git a/doc/TODO b/doc/TODO index 7200899..deb79ca 100644 --- a/doc/TODO +++ b/doc/TODO @@ -1,8 +1,15 @@ -Robot based on urllib2: handle ftp (welcome message). +Replace subproc.py with some IPC. Or update for Python 3. -A new robot based on PycURL. +Python 3. -HTML parser based on BeautifulSoup4. +Forbid external names to resolve to internal addresses (127.0.0.1, etc). + +Configuration file to configure defaults - global defaults for the system +and local defaults for subsystems. + +Robot based on PycURL. + +Robot based on Scrapy. A program to publish bookmarks with icons. @@ -18,8 +25,6 @@ Merge "writers" to storage managers. New storage managers: shelve, SQL, ZODB, MetaKit. More robots (URL checkers): threading, asyncore-based; robots that test many URLs in parallel. -Configuration file to configure defaults - global defaults for the system -and local defaults for subsystems. Ruleset-based mechanisms to filter out what types of URLs to check: checking based on URL schema, host, port, path, filename, extension, etc. @@ -32,3 +37,5 @@ Bigger database. Multiuser database. Robot should operates on a part of the DB. WWW-interface to the database. User should import/export/edit bookmarks, schedule robot run, etc. + +A program to collect and check links from a site.