]>
git.phdru.name Git - bookmarks_db.git/log
Oleg Broytman [Mon, 19 Dec 2011 22:40:43 +0000 (22:40 +0000)]
Write netscape-formatted datetime.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@355
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 19 Dec 2011 22:34:53 +0000 (22:34 +0000)]
2039!
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@354
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 19 Dec 2011 22:29:52 +0000 (22:29 +0000)]
Process mozilla-specific date/time representation.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@353
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 18 Dec 2011 17:48:28 +0000 (17:48 +0000)]
Changed the order or parser according to their success rate.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@352
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 18 Dec 2011 17:36:10 +0000 (17:36 +0000)]
Version 4.5.0.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@351
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 18 Dec 2011 17:35:19 +0000 (17:35 +0000)]
Pretty-print last_modified.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@350
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 12 Dec 2011 09:30:12 +0000 (09:30 +0000)]
Added a shell wrapper for set-title-list.py.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@349
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 12 Dec 2011 09:29:03 +0000 (09:29 +0000)]
Adapted to different Mozilla place URIs.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@348
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 11 Dec 2011 03:43:17 +0000 (03:43 +0000)]
Debug HTML parsers.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@347
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 11 Dec 2011 03:29:12 +0000 (03:29 +0000)]
Newer Mozilla versions use 'places://'.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@346
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 11 Dec 2011 03:27:44 +0000 (03:27 +0000)]
Encode values from unicode.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@345
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 11 Dec 2011 03:27:18 +0000 (03:27 +0000)]
Convert to unicode and back again to unescape unichr'd entities.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@344
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 1 Dec 2011 12:21:32 +0000 (12:21 +0000)]
Moved robot_simple into run().
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@343
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 1 Dec 2011 12:20:34 +0000 (12:20 +0000)]
urllib2
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@342
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 1 Dec 2011 12:20:01 +0000 (12:20 +0000)]
Removed mirror at phd.by.ru.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@341
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 13 Jul 2011 13:43:23 +0000 (13:43 +0000)]
Underline IDNA encoding.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@340
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 12 Jul 2011 15:18:10 +0000 (15:18 +0000)]
Do not split/decode query and tag - only split host and path and recode the host.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@339
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 12 Jul 2011 14:37:25 +0000 (14:37 +0000)]
Split hrefs into domain and path components; recode only domain.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@338
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Jul 2011 18:48:24 +0000 (18:48 +0000)]
Insert a space.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@337
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Jul 2011 18:42:33 +0000 (18:42 +0000)]
Recode hrefs (due to international domain names) to the current charset.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@336
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Jul 2011 18:19:22 +0000 (18:19 +0000)]
Fixed bugs related to keywords/positional parameters.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@335
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Jul 2011 18:05:58 +0000 (18:05 +0000)]
last_visit could be None.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@334
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 14 Apr 2011 15:25:28 +0000 (15:25 +0000)]
Moved DEFAULT_CHARSET to bkmk_objects.py.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@333
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 14 Apr 2011 15:23:14 +0000 (15:23 +0000)]
m_lib.defenc is always available.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@332
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 14 Apr 2011 15:20:20 +0000 (15:20 +0000)]
Get default charset from m_lib, if available.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@331
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 14 Apr 2011 15:15:07 +0000 (15:15 +0000)]
Fixed a bug - restored DEFAULT_CHARSET.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@330
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Jan 2011 12:07:15 +0000 (12:07 +0000)]
Version 4.4.0 was released 2011-01-07.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@329
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Jan 2011 12:06:30 +0000 (12:06 +0000)]
Changed docstring.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@328
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 19:38:11 +0000 (19:38 +0000)]
Fixed grammar.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@327
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 19:15:55 +0000 (19:15 +0000)]
Unindented log texts.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@326
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 19:12:59 +0000 (19:12 +0000)]
Export main. Import universal_charset only in main.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@325
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 19:11:54 +0000 (19:11 +0000)]
Fixed logging.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@324
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 19:04:30 +0000 (19:04 +0000)]
Skip bookmarks with keyword.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@323
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 19:00:27 +0000 (19:00 +0000)]
Added code to collect statistics on parsers;
sort parser according to the statistics; commented out statistics code.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@322
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 6 Jan 2011 18:57:52 +0000 (18:57 +0000)]
Fixed comment.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@321
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 5 Jan 2011 19:01:21 +0000 (19:01 +0000)]
Renamed parse_html modules to bkmk_ph_* to avoid name clashes.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@320
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Jan 2011 19:34:49 +0000 (19:34 +0000)]
Report what parser is in use.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@319
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Jan 2011 19:31:21 +0000 (19:31 +0000)]
Added __all__.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@318
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Jan 2011 17:13:55 +0000 (17:13 +0000)]
Preparing version 4.4.0.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@317
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Jan 2011 17:12:18 +0000 (17:12 +0000)]
2011.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@316
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Jan 2011 17:10:17 +0000 (17:10 +0000)]
Added docstrings, __{version,revision,etc}__ boilerplates.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@315
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 19:44:37 +0000 (19:44 +0000)]
Copy COPYING to doc.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@314
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 19:42:40 +0000 (19:42 +0000)]
Stopped tracking the text of the GPL license. Moved it to doc subdirectory.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@313
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 19:36:30 +0000 (19:36 +0000)]
Moved BeautifulSoup.py and subproc.py from Robots/ to the top-level directory.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@312
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 19:35:08 +0000 (19:35 +0000)]
Moved parse_html.py and its submodules to a separate parse_html module.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@311
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 19:30:42 +0000 (19:30 +0000)]
Removed old cruft.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@310
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 17:25:18 +0000 (17:25 +0000)]
Removed old unused scripts and docs.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@309
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 17:20:45 +0000 (17:20 +0000)]
ElementTidy often segfaults.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@308
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 09:53:21 +0000 (09:53 +0000)]
Version 4.3.1 was released 2011-01-03.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@307
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 09:51:42 +0000 (09:51 +0000)]
2011.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@306
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 00:34:35 +0000 (00:34 +0000)]
Changed wording.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@305
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Jan 2011 00:07:46 +0000 (00:07 +0000)]
Get favicon even if it's of a wrong type.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@304
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 2 Jan 2011 23:56:53 +0000 (23:56 +0000)]
Get favicon before HTML redirect (refresh).
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@303
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 2 Jan 2011 00:41:31 +0000 (00:41 +0000)]
2011.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@302
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 2 Jan 2011 00:40:46 +0000 (00:40 +0000)]
Encode icon's URL from unicode.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@301
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:56:43 +0000 (13:56 +0000)]
Version 4.3.0 (2011-01-01).
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@300
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:56:02 +0000 (13:56 +0000)]
Removed excessive empty lines.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@299
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:54:17 +0000 (13:54 +0000)]
Copy third-party modules to the source tree.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@298
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:51:50 +0000 (13:51 +0000)]
Stop tracking a third-party program.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@297
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:18:17 +0000 (13:18 +0000)]
Do not append timestamp.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@296
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:16:51 +0000 (13:16 +0000)]
Do it in tmp.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@295
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 1 Jan 2011 13:04:29 +0000 (13:04 +0000)]
phd.pp.ru => phdru.name.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@294
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 24 Dec 2010 19:18:12 +0000 (19:18 +0000)]
phd.pp.ru => phdru.name.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@293
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 7 Oct 2010 18:30:33 +0000 (18:30 +0000)]
Remove all temporary files with urlcleanup().
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@292
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 7 Oct 2010 18:29:54 +0000 (18:29 +0000)]
Fixed a bug.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@291
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 7 Oct 2010 18:10:46 +0000 (18:10 +0000)]
Robots no longer have one global temporary file - there are at least two
(html and favicon), and in the future there will be more for
asynchronous robot(s) that would test many URLs in parallel.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@290
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 2 Oct 2010 22:35:50 +0000 (22:35 +0000)]
More robots (URL checkers): PycURL multi.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@289
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 2 Oct 2010 16:35:44 +0000 (16:35 +0000)]
Minor optimization (free an object). Changed error message.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@288
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 24 Aug 2010 17:40:11 +0000 (17:40 +0000)]
No need to call .lower() two times.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@287
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 14 Aug 2010 22:32:35 +0000 (22:32 +0000)]
Write icon's URIs.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@286
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 18:54:29 +0000 (18:54 +0000)]
Fixed encoding.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@285
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 15:46:30 +0000 (15:46 +0000)]
Fixed a bug.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@284
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 14:54:28 +0000 (14:54 +0000)]
HTML parser based on lxml was implemnted.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@283
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 14:53:58 +0000 (14:53 +0000)]
Test for completely broken HTML.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@282
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:38:06 +0000 (13:38 +0000)]
Moved lxml-based parser after BeautifulSoup - it doesn't accept charset.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@281
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:19:27 +0000 (13:19 +0000)]
Insert lxml-based parser at the beginning.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@280
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:18:04 +0000 (13:18 +0000)]
Next version will be 4.2.2.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@279
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:17:33 +0000 (13:17 +0000)]
Added HTML Parser based on lxml.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@278
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:06:20 +0000 (13:06 +0000)]
Lookup title in html if not found in head.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@277
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:04:15 +0000 (13:04 +0000)]
Fixed a bug - moved the code where meta_charset is defined.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@276
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 13 Aug 2010 13:03:06 +0000 (13:03 +0000)]
Lookup title in the root if not found in head.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@275
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 19:56:25 +0000 (19:56 +0000)]
Minor doc updates.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@274
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 19:45:49 +0000 (19:45 +0000)]
Version 4.2.1 was released 2010-08-12.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@273
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 19:45:02 +0000 (19:45 +0000)]
Nicer logging.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@272
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 15:15:38 +0000 (15:15 +0000)]
Fixed a bug - don't do a double encode.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@271
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 15:09:34 +0000 (15:09 +0000)]
Try parser in order until the first one finds a title.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@270
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 09:13:59 +0000 (09:13 +0000)]
Parser could be None.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@269
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 09:11:06 +0000 (09:11 +0000)]
Test if m_lib is available.
Return None instead of a parser if there are no parsers.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@268
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 09:05:40 +0000 (09:05 +0000)]
Do not parse meta charset if there is HTTP charset.
Find html in tree.childNodes skipping DocType.
Use meta charset if there is no HTTP charset.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@267
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 09:02:06 +0000 (09:02 +0000)]
Move charset to the beginning of the list.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@266
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:44:03 +0000 (20:44 +0000)]
Fixed a bug - check if childNodes not empty.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@265
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:08:41 +0000 (20:08 +0000)]
Store icon's URIs.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@264
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:07:57 +0000 (20:07 +0000)]
Added HTML Parser based on html5 library.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@263
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:07:29 +0000 (20:07 +0000)]
Added HTML Parser based on html5 library.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@262
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 19:18:26 +0000 (19:18 +0000)]
Removed parse_html_etreetidy - TidyHTMLTreeBuilder segfaults. Ouch! :(
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@261
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 18:24:36 +0000 (18:24 +0000)]
Version 4.2.1.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@260
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 18:23:40 +0000 (18:23 +0000)]
Added HTML Parser based on TidyHTMLTreeBuilder.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@259
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 18:22:25 +0000 (18:22 +0000)]
Added HTML Parser based on TidyHTMLTreeBuilder.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@258
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 17:26:11 +0000 (17:26 +0000)]
Moved HTMLParser from parse_html_beautifulsoup.py to parse_html_util.py.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@257
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 16:59:40 +0000 (16:59 +0000)]
Fixed a bug in case there are more than one Content-Type headers.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@256
fdd5c36f -1aea-0310-aeeb-
c58d7e2b6c23