X-Git-Url: https://git.phdru.name/?a=blobdiff_plain;f=doc%2FTODO;h=164aa33fb2124479e19f5863c6808634c390f3c0;hb=ca111b9044c2b413f6dd84a686a544e058d3550b;hp=f7210018a32f5433f1b7fc3c846b800a095bc530;hpb=c067541ef9e42e33742ad90e1a51c9aea2bcdc51;p=bookmarks_db.git diff --git a/doc/TODO b/doc/TODO index f721001..164aa33 100644 --- a/doc/TODO +++ b/doc/TODO @@ -1,14 +1,24 @@ -Cleanup HTML before parsing using BeautifulSoap or Tidy. +Configuration file to configure defaults - global defaults for the system +and local defaults for subsystems. + +A new robot based on PycURL. + +HTML parser based on BeautifulSoup4. + +A program to publish bookmarks with icons. + +Fetch description from and store it in +bookmark.description if the description is empty. (How to update old +descriptions without replacing my own comments?) + Parse (or interpret) downloaded file and get javascript redirects. More and better documentation. Merge "writers" to storage managers. New storage managers: shelve, SQL, ZODB, MetaKit. -More robots (URL checkers): threading, asyncore-based. - -Configuration file to configure defaults - global defaults for the system -and local defaults for subsystems. +More robots (URL checkers): threading, asyncore-based; +robots that test many URLs in parallel. Ruleset-based mechanisms to filter out what types of URLs to check: checking based on URL schema, host, port, path, filename, extension, etc. @@ -21,3 +31,5 @@ Bigger database. Multiuser database. Robot should operates on a part of the DB. WWW-interface to the database. User should import/export/edit bookmarks, schedule robot run, etc. + +A program to collect and check links from a site.