]> git.phdru.name Git - bookmarks_db.git/log
bookmarks_db.git
2 months agoFix(Robots): Only proceed if `wait` was called
Oleg Broytman [Tue, 10 Sep 2024 14:16:15 +0000 (17:16 +0300)]
Fix(Robots): Only proceed if `wait` was called

2 months agoRefactor(Robots): Most robots do not store FTP welcome message
Oleg Broytman [Tue, 10 Sep 2024 13:04:48 +0000 (16:04 +0300)]
Refactor(Robots): Most robots do not store FTP welcome message

2 months agoRefactor(Robots): Make `robot_base` the parent of `multi_mixin`
Oleg Broytman [Tue, 10 Sep 2024 12:58:52 +0000 (15:58 +0300)]
Refactor(Robots): Make `robot_base` the parent of `multi_mixin`

2 months agoFix(Robots): Remove extra second
Oleg Broytman [Tue, 10 Sep 2024 12:43:47 +0000 (15:43 +0300)]
Fix(Robots): Remove extra second

2 months agoFeat(Robots): Removed ftp_timeout
Oleg Broytman [Tue, 10 Sep 2024 12:32:54 +0000 (15:32 +0300)]
Feat(Robots): Removed ftp_timeout

2 months agoFeat(bkmk_rtwisted): HTTP proxy
Oleg Broytman [Tue, 10 Sep 2024 12:15:37 +0000 (15:15 +0300)]
Feat(bkmk_rtwisted): HTTP proxy

2 months agoFeat(Robots): Robot based on `twisted` and `concurrent.futures`
Oleg Broytman [Mon, 9 Sep 2024 21:18:11 +0000 (00:18 +0300)]
Feat(Robots): Robot based on `twisted` and `concurrent.futures`

Processes multiple URLs in parallel (multithreaded).

2 months agoRefactor(Robots): Split `robot_curl` into `cf_multithread`
Oleg Broytman [Mon, 9 Sep 2024 20:25:23 +0000 (23:25 +0300)]
Refactor(Robots): Split `robot_curl` into `cf_multithread`

2 months agoRefactor(Robots): Split `robot_multirequests` into `cf_multiprocess`
Oleg Broytman [Mon, 9 Sep 2024 20:07:24 +0000 (23:07 +0300)]
Refactor(Robots): Split `robot_multirequests` into `cf_multiprocess`

2 months agoRefactor(Robots): Move `encode_url` to `base.py`
Oleg Broytman [Mon, 9 Sep 2024 20:10:56 +0000 (23:10 +0300)]
Refactor(Robots): Move `encode_url` to `base.py`

2 months agoRelease 6.1.0 6.1.0
Oleg Broytman [Sun, 8 Sep 2024 07:36:12 +0000 (10:36 +0300)]
Release 6.1.0

2 months agoFeat(Robots): Combined curl with curlmulti
Oleg Broytman [Sun, 8 Sep 2024 07:36:12 +0000 (10:36 +0300)]
Feat(Robots): Combined curl with curlmulti

The combined robot is named just curl.

2 months agoFeat(Robots): Robot based on curl_multi
Oleg Broytman [Sat, 7 Sep 2024 14:28:54 +0000 (17:28 +0300)]
Feat(Robots): Robot based on curl_multi

Processes multiple URLs in parallel using concurrent.futures (multithreaded).

2 months agoFeat(aio): Combine `aiohttp` with `multiaio`
Oleg Broytman [Sat, 7 Sep 2024 11:39:21 +0000 (14:39 +0300)]
Feat(aio): Combine `aiohttp` with `multiaio`

The combined robot is named just `aio`.

2 months agoRefactor(Robots): Split `bkmk_rmultirequests` into `concurrent_futures` mix-in
Oleg Broytman [Thu, 5 Sep 2024 14:32:55 +0000 (17:32 +0300)]
Refactor(Robots): Split `bkmk_rmultirequests` into `concurrent_futures` mix-in

2 months agoFeat(bkmk_rmultirequests): Fix `concurrent_class` to `ProcessPoolExecutor`
Oleg Broytman [Thu, 5 Sep 2024 14:03:04 +0000 (17:03 +0300)]
Feat(bkmk_rmultirequests): Fix `concurrent_class` to `ProcessPoolExecutor`

2 months agoUpdate docs
Oleg Broytman [Wed, 4 Sep 2024 18:03:31 +0000 (21:03 +0300)]
Update docs

2 months agoFix(base): Fix reporting proxy error
Oleg Broytman [Thu, 22 Aug 2024 22:22:39 +0000 (01:22 +0300)]
Fix(base): Fix reporting proxy error

2 months agoFeat(Robots): Simplify `.get()`
Oleg Broytman [Thu, 22 Aug 2024 21:46:45 +0000 (00:46 +0300)]
Feat(Robots): Simplify `.get()`

Return data without processing,
let's `get_url()` process it in one place.

2 months agoRefactor(bkmk_rcurl): Split off `CurlWrapper`
Oleg Broytman [Thu, 22 Aug 2024 16:54:36 +0000 (19:54 +0300)]
Refactor(bkmk_rcurl): Split off `CurlWrapper`

Will be used in multi-curl robot(s).

2 months agoRefactor(Robots): Separate `headers` into `req_headers` and `resp_headers`
Oleg Broytman [Wed, 21 Aug 2024 14:32:25 +0000 (17:32 +0300)]
Refactor(Robots): Separate `headers` into `req_headers` and `resp_headers`

2 months agoRefactor(bkmk_rmultiaio): Split off `multi_async_mixin`
Oleg Broytman [Wed, 21 Aug 2024 14:04:46 +0000 (17:04 +0300)]
Refactor(bkmk_rmultiaio): Split off `multi_async_mixin`

2 months agoFeat(bkmk_rcurl): Lower SSL security settings
Oleg Broytman [Tue, 20 Aug 2024 22:30:28 +0000 (01:30 +0300)]
Feat(bkmk_rcurl): Lower SSL security settings

2 months agoRefactor(Robots): Pass headers instead of charset
Oleg Broytman [Tue, 20 Aug 2024 22:21:26 +0000 (01:21 +0300)]
Refactor(Robots): Pass headers instead of charset

2 months agoChore(Robots): Report "Checked: <URL>" but avoid duplicates
Oleg Broytman [Tue, 20 Aug 2024 17:28:15 +0000 (20:28 +0300)]
Chore(Robots): Report "Checked: <URL>" but avoid duplicates

Robots that process multiple URLs in parallel report it themselves.

2 months agoChore(check_urls): Improve output
Oleg Broytman [Tue, 20 Aug 2024 17:27:08 +0000 (20:27 +0300)]
Chore(check_urls): Improve output

2 months agoRefactor(Robots): `get`/`get_url` don't need `bookmark`, only charset
Oleg Broytman [Mon, 19 Aug 2024 19:28:41 +0000 (22:28 +0300)]
Refactor(Robots): `get`/`get_url` don't need `bookmark`, only charset

2 months agoFix(get_url): Adapt `get_url` to the new shiny async world
Oleg Broytman [Mon, 19 Aug 2024 16:26:29 +0000 (19:26 +0300)]
Fix(get_url): Adapt `get_url` to the new shiny async world

2 months agoRefactor(bkmk_rmultirequests): Change parent to `robot_base`
Oleg Broytman [Mon, 19 Aug 2024 14:51:20 +0000 (17:51 +0300)]
Refactor(bkmk_rmultirequests): Change parent to `robot_base`

`bkmk_rmultirequests` instantiates `bkmk_rrequests` in the workers
but itself doesn't really use anything from `bkmk_rrequests`,
so it can be just a base robot.

2 months agoDocs: Fix forgotten docs
Oleg Broytman [Mon, 19 Aug 2024 13:30:59 +0000 (16:30 +0300)]
Docs: Fix forgotten docs

2 months agoFeat(bkmk_raiohttp): Lower SSL cert validation strictness
Oleg Broytman [Mon, 19 Aug 2024 12:43:54 +0000 (15:43 +0300)]
Feat(bkmk_raiohttp): Lower SSL cert validation strictness

2 months agoFix(robots.py): Fix robot name
Oleg Broytman [Sat, 7 Sep 2024 10:58:16 +0000 (13:58 +0300)]
Fix(robots.py): Fix robot name

2 months agoFeat(Robots): Robot based on aiohttp, processes multiple URLs in parallel 6.0.0
Oleg Broytman [Mon, 19 Aug 2024 00:59:24 +0000 (03:59 +0300)]
Feat(Robots): Robot based on aiohttp, processes multiple URLs in parallel

2 months agoFeat(Robots): Make all robots async
Oleg Broytman [Sun, 18 Aug 2024 20:52:21 +0000 (23:52 +0300)]
Feat(Robots): Make all robots async

Split check_bookmark() into sync and async variants.

2 months agoRefactor(Robots): Split off `multi_mixin`
Oleg Broytman [Sun, 18 Aug 2024 20:38:52 +0000 (23:38 +0300)]
Refactor(Robots): Split off `multi_mixin`

2 months agoStyle(bkmk_rmultirequests): Renamed max_workers to max_urls
Oleg Broytman [Sun, 18 Aug 2024 20:28:56 +0000 (23:28 +0300)]
Style(bkmk_rmultirequests): Renamed max_workers to max_urls

2 months agoRefactor(Robots/base): Simplify `X-User-Agent`
Oleg Broytman [Sun, 18 Aug 2024 19:57:57 +0000 (22:57 +0300)]
Refactor(Robots/base): Simplify `X-User-Agent`

2 months agoRefactor(Robots): Rename `bkmk_robot_base.py` -> `base.py`
Oleg Broytman [Sun, 18 Aug 2024 19:42:02 +0000 (22:42 +0300)]
Refactor(Robots): Rename `bkmk_robot_base.py` -> `base.py`

2 months agoBuild: Use Python 3 5.7.0
Oleg Broytman [Fri, 16 Aug 2024 16:40:28 +0000 (19:40 +0300)]
Build: Use Python 3

2 months agoVersion 5.7.0
Oleg Broytman [Fri, 16 Aug 2024 16:39:04 +0000 (19:39 +0300)]
Version 5.7.0

2 months agoFix(bkmk_robot_base): Redraw progress bar after unhandled exception
Oleg Broytman [Fri, 16 Aug 2024 14:05:48 +0000 (17:05 +0300)]
Fix(bkmk_robot_base): Redraw progress bar after unhandled exception

2 months agoFix(bkmk_rmultirequests): Limit number of URLs to load into workers
Oleg Broytman [Fri, 16 Aug 2024 13:53:31 +0000 (16:53 +0300)]
Fix(bkmk_rmultirequests): Limit number of URLs to load into workers

2 months agoFeat(bkmk_robot_base): Send our version in `X-User-Agent` header
Oleg Broytman [Fri, 16 Aug 2024 13:37:42 +0000 (16:37 +0300)]
Feat(bkmk_robot_base): Send our version in `X-User-Agent` header

2 months agoRefactor: Move version from `setup.py` to `bkmk_objects.py`
Oleg Broytman [Fri, 16 Aug 2024 13:36:55 +0000 (16:36 +0300)]
Refactor: Move version from `setup.py` to `bkmk_objects.py`

2 months agoBuild: Install `setuptools` for `setup.py`
Oleg Broytman [Fri, 16 Aug 2024 13:36:09 +0000 (16:36 +0300)]
Build: Install `setuptools` for `setup.py`

2 months agoFeat(bkmk_raiohttp): Use siosocks for aioftp
Oleg Broytman [Fri, 16 Aug 2024 13:21:51 +0000 (16:21 +0300)]
Feat(bkmk_raiohttp): Use siosocks for aioftp

2 months agoFeat(bkmk_raiohttp): Use aiohttp-socks
Oleg Broytman [Fri, 16 Aug 2024 13:15:06 +0000 (16:15 +0300)]
Feat(bkmk_raiohttp): Use aiohttp-socks

2 months agoFeat(Robots): Removed connect_timeout, added ftp_timeout
Oleg Broytman [Fri, 16 Aug 2024 12:46:50 +0000 (15:46 +0300)]
Feat(Robots): Removed connect_timeout, added ftp_timeout

2 months agoFix(bkmk_raiohttp): Don't list FTP recursively
Oleg Broytman [Thu, 15 Aug 2024 22:59:22 +0000 (01:59 +0300)]
Fix(bkmk_raiohttp): Don't list FTP recursively

2 months agoFeat(bkmk_rrequests): Use ftplib directly, without requests_ftp
Oleg Broytman [Thu, 15 Aug 2024 22:36:07 +0000 (01:36 +0300)]
Feat(bkmk_rrequests): Use ftplib directly, without requests_ftp

2 months agoRefactor(bkmk_raiohttp): Remove unused values
Oleg Broytman [Thu, 15 Aug 2024 21:59:06 +0000 (00:59 +0300)]
Refactor(bkmk_raiohttp): Remove unused values

3 months agoRefactor(bkmk_robot_base):
Oleg Broytman [Thu, 15 Aug 2024 20:05:29 +0000 (23:05 +0300)]
Refactor(bkmk_robot_base):

3 months agoChore(bkmk_parser): Fix year
Oleg Broytman [Thu, 15 Aug 2024 20:04:46 +0000 (23:04 +0300)]
Chore(bkmk_parser): Fix year

3 months agoVersion 5.6.1: Minor fixes 5.6.1
Oleg Broytman [Thu, 15 Aug 2024 17:52:45 +0000 (20:52 +0300)]
Version 5.6.1: Minor fixes

3 months agoFix(bkmk_ph_lxml): Catch `ParserError`
Oleg Broytman [Sun, 11 Aug 2024 18:24:52 +0000 (21:24 +0300)]
Fix(bkmk_ph_lxml): Catch `ParserError`

3 months agoFix(bkmk_robot_base): Decode base64 bytes to unicode
Oleg Broytman [Sun, 11 Aug 2024 18:12:43 +0000 (21:12 +0300)]
Fix(bkmk_robot_base): Decode base64 bytes to unicode

3 months agoFix(bkmk_rrequests): Not all error codes have messages
Oleg Broytman [Thu, 8 Aug 2024 17:52:48 +0000 (20:52 +0300)]
Fix(bkmk_rrequests): Not all error codes have messages

3 months agoFeat(Robots): Robot based on requests and concurrent.futures 5.6.0
Oleg Broytman [Thu, 8 Aug 2024 13:25:33 +0000 (16:25 +0300)]
Feat(Robots): Robot based on requests and concurrent.futures

Processes multiple URLs in parallel.

3 months agoFeat: Dropped support for Python 2
Oleg Broytman [Thu, 8 Aug 2024 04:45:58 +0000 (07:45 +0300)]
Feat: Dropped support for Python 2

3 months agoFeat(Robots): Remove urllib-based robots
Oleg Broytman [Thu, 8 Aug 2024 04:31:23 +0000 (07:31 +0300)]
Feat(Robots): Remove urllib-based robots

3 months agoStyle(Writers/bkmk_wflad): Rename loop variables
Oleg Broytman [Wed, 7 Aug 2024 22:17:09 +0000 (01:17 +0300)]
Style(Writers/bkmk_wflad): Rename loop variables

3 months agoRefactor: Extract the common list of attributes; `copy_bkmk()`
Oleg Broytman [Wed, 7 Aug 2024 22:14:28 +0000 (01:14 +0300)]
Refactor: Extract the common list of attributes; `copy_bkmk()`

3 months agoFeat(check_urls): Separately report redirects
Oleg Broytman [Wed, 7 Aug 2024 21:51:21 +0000 (00:51 +0300)]
Feat(check_urls): Separately report redirects

3 months agoFeat(Robots): Stop the robot ASAP
Oleg Broytman [Wed, 7 Aug 2024 17:00:53 +0000 (20:00 +0300)]
Feat(Robots): Stop the robot ASAP

Process bookmarks after the robot stopped. In the future
there will be robots that check multiple URLs in parallel
so bookmarks cannot be processed inside check loop
but can be queried after.

3 months agoStyle(Robots): Rename `check_url` to `check_bookmark`
Oleg Broytman [Wed, 7 Aug 2024 15:28:49 +0000 (18:28 +0300)]
Style(Robots): Rename `check_url` to `check_bookmark`

Also rename `smart_get` to `get_url`.

3 months agoFix(bkmk-add): Stop the robot
Oleg Broytman [Tue, 6 Aug 2024 17:52:43 +0000 (20:52 +0300)]
Fix(bkmk-add): Stop the robot

3 months agoStyle(bkmk-add): Rename `_robot` -> `robot`
Oleg Broytman [Tue, 6 Aug 2024 16:18:36 +0000 (19:18 +0300)]
Style(bkmk-add): Rename `_robot` -> `robot`

3 months agoFeat(Robots): Do not return error from `check_url()`
Oleg Broytman [Tue, 6 Aug 2024 16:01:37 +0000 (19:01 +0300)]
Feat(Robots): Do not return error from `check_url()`

Break the entire program with `Ctrl-C`.

3 months agoFeat(bkmk_rrequests): Install socks dependency
Oleg Broytman [Tue, 6 Aug 2024 13:43:58 +0000 (16:43 +0300)]
Feat(bkmk_rrequests): Install socks dependency

3 months agoFeat(bkmk_raiohttp): Use aioftp
Oleg Broytman [Tue, 6 Aug 2024 11:27:46 +0000 (14:27 +0300)]
Feat(bkmk_raiohttp): Use aioftp

3 months agoFix(Robots): Do not route ftp requests via http(s) proxy
Oleg Broytman [Tue, 6 Aug 2024 10:07:50 +0000 (13:07 +0300)]
Fix(Robots): Do not route ftp requests via http(s) proxy

socks5 proxies are ok.

3 months agoFeat(Robots): Robot based on aiohttp 5.5.0
Oleg Broytman [Mon, 5 Aug 2024 12:00:55 +0000 (15:00 +0300)]
Feat(Robots): Robot based on aiohttp

3 months agoFeat(Writers/bkmk_whtml): Mark special folders
Oleg Broytman [Tue, 6 Aug 2024 06:16:46 +0000 (09:16 +0300)]
Feat(Writers/bkmk_whtml): Mark special folders

3 months agoFeat: Delete `root_folder.linear` before storing
Oleg Broytman [Mon, 5 Aug 2024 20:19:03 +0000 (23:19 +0300)]
Feat: Delete `root_folder.linear` before storing

3 months agoFix(bkmk_robot_base): Convert environment parameters to integer
Oleg Broytman [Mon, 5 Aug 2024 15:19:26 +0000 (18:19 +0300)]
Fix(bkmk_robot_base): Convert environment parameters to integer

3 months agoFeat(bkmk_robot_base) Cut long `data:` icon URLs for logs
Oleg Broytman [Mon, 5 Aug 2024 14:33:33 +0000 (17:33 +0300)]
Feat(bkmk_robot_base) Cut long `data:` icon URLs for logs

3 months agoFix(bkmk_rcurl): IDNA-encode URLs 5.4.1
Oleg Broytman [Mon, 5 Aug 2024 12:56:23 +0000 (15:56 +0300)]
Fix(bkmk_rcurl): IDNA-encode URLs

PycURL doesn't encode URLs itself
and requires URLs to be in ASCII encoding.

3 months agoRefactor(Robots): Connect timeout
Oleg Broytman [Mon, 5 Aug 2024 10:41:08 +0000 (13:41 +0300)]
Refactor(Robots): Connect timeout

3 months agoVersion 5.4.0: Robot based on PycURL 5.4.0
Oleg Broytman [Fri, 2 Aug 2024 10:42:11 +0000 (13:42 +0300)]
Version 5.4.0: Robot based on PycURL

3 months agoBuild: Add `devscripts`
Oleg Broytman [Fri, 2 Aug 2024 10:44:15 +0000 (13:44 +0300)]
Build: Add `devscripts`

3 months agoFix(bkmk_robot_base): Fix reporting proxy error
Oleg Broytman [Thu, 1 Aug 2024 16:10:30 +0000 (19:10 +0300)]
Fix(bkmk_robot_base): Fix reporting proxy error

3 months agoFeat(Robots): Update X-User-Agent header
Oleg Broytman [Thu, 1 Aug 2024 10:03:16 +0000 (13:03 +0300)]
Feat(Robots): Update X-User-Agent header

3 months agoFeat(Robots): Upgrade headers
Oleg Broytman [Thu, 1 Aug 2024 10:03:03 +0000 (13:03 +0300)]
Feat(Robots): Upgrade headers

3 months agoDocs(bkmk_robot_base): List robots
Oleg Broytman [Thu, 1 Aug 2024 08:40:32 +0000 (11:40 +0300)]
Docs(bkmk_robot_base): List robots

3 months agoFix(bkmk_rrequests): Check `r is not None`
Oleg Broytman [Thu, 1 Aug 2024 07:26:13 +0000 (10:26 +0300)]
Fix(bkmk_rrequests): Check `r is not None`

It seems `if r` is not enough --
`bool(r)` returns `False` in case there was an HTTP error.

3 months agoStyle(setup.py): Remove unused import
Oleg Broytman [Thu, 1 Aug 2024 05:06:27 +0000 (08:06 +0300)]
Style(setup.py): Remove unused import

3 months agoFeat(get_url): Parse args, save/print headers/body
Oleg Broytman [Thu, 1 Aug 2024 04:34:11 +0000 (07:34 +0300)]
Feat(get_url): Parse args, save/print headers/body

3 months agoFeat(robots): Report robot being used
Oleg Broytman [Thu, 1 Aug 2024 04:19:34 +0000 (07:19 +0300)]
Feat(robots): Report robot being used

3 months agoFeat(bkmk_robot_base): Report error on getting icon
Oleg Broytman [Wed, 31 Jul 2024 22:47:45 +0000 (01:47 +0300)]
Feat(bkmk_robot_base): Report error on getting icon

3 months agoFeat(get_url): Print headers
Oleg Broytman [Wed, 31 Jul 2024 22:25:07 +0000 (01:25 +0300)]
Feat(get_url): Print headers

3 months agoUpdate docs
Oleg Broytman [Wed, 31 Jul 2024 18:21:48 +0000 (21:21 +0300)]
Update docs

3 months agoFear(robots): Try robots from a list
Oleg Broytman [Wed, 31 Jul 2024 18:12:59 +0000 (21:12 +0300)]
Fear(robots): Try robots from a list

Default list is curl,requests,forking.

3 months agoFeat(Robots): Robot based on PycURL
Oleg Broytman [Wed, 31 Jul 2024 17:29:29 +0000 (20:29 +0300)]
Feat(Robots): Robot based on PycURL

3 months agoFix(bkmk_robot_base): Do not pass `localhost` via proxy
Oleg Broytman [Wed, 31 Jul 2024 16:23:22 +0000 (19:23 +0300)]
Fix(bkmk_robot_base): Do not pass `localhost` via proxy

3 months agoFeat(bkmk_rurllib): Use proxy
Oleg Broytman [Wed, 31 Jul 2024 16:22:58 +0000 (19:22 +0300)]
Feat(bkmk_rurllib): Use proxy

3 months agoStyle(bkmk_rurllib2): Remove unused import
Oleg Broytman [Wed, 31 Jul 2024 16:21:58 +0000 (19:21 +0300)]
Style(bkmk_rurllib2): Remove unused import

Found by `flake8`.

3 months agoRefactor(Robots): Move proxy handling to base class
Oleg Broytman [Wed, 31 Jul 2024 15:49:11 +0000 (18:49 +0300)]
Refactor(Robots): Move proxy handling to base class

This greatly simplifies robots.

3 months agoFeat(Robots): Return HTTP status code
Oleg Broytman [Wed, 31 Jul 2024 15:14:05 +0000 (18:14 +0300)]
Feat(Robots): Return HTTP status code

3 months agoDocs(TODO): Robot(s) that test many URLs in parallel
Oleg Broytman [Fri, 26 Jul 2024 10:12:14 +0000 (13:12 +0300)]
Docs(TODO): Robot(s) that test many URLs in parallel

Increase task priority.

3 months agoDocs(TODO): Robot based on aiohttp
Oleg Broytman [Fri, 26 Jul 2024 10:11:22 +0000 (13:11 +0300)]
Docs(TODO): Robot based on aiohttp