]> git.phdru.name Git - bookmarks_db.git/log
bookmarks_db.git
12 years agoWrite netscape-formatted datetime.
Oleg Broytman [Mon, 19 Dec 2011 22:40:43 +0000 (22:40 +0000)]
Write netscape-formatted datetime.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@355 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years ago2039!
Oleg Broytman [Mon, 19 Dec 2011 22:34:53 +0000 (22:34 +0000)]
2039!

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@354 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoProcess mozilla-specific date/time representation.
Oleg Broytman [Mon, 19 Dec 2011 22:29:52 +0000 (22:29 +0000)]
Process mozilla-specific date/time representation.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@353 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoChanged the order or parser according to their success rate.
Oleg Broytman [Sun, 18 Dec 2011 17:48:28 +0000 (17:48 +0000)]
Changed the order or parser according to their success rate.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@352 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoVersion 4.5.0.
Oleg Broytman [Sun, 18 Dec 2011 17:36:10 +0000 (17:36 +0000)]
Version 4.5.0.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@351 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoPretty-print last_modified.
Oleg Broytman [Sun, 18 Dec 2011 17:35:19 +0000 (17:35 +0000)]
Pretty-print last_modified.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@350 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoAdded a shell wrapper for set-title-list.py.
Oleg Broytman [Mon, 12 Dec 2011 09:30:12 +0000 (09:30 +0000)]
Added a shell wrapper for set-title-list.py.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@349 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoAdapted to different Mozilla place URIs.
Oleg Broytman [Mon, 12 Dec 2011 09:29:03 +0000 (09:29 +0000)]
Adapted to different Mozilla place URIs.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@348 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoDebug HTML parsers.
Oleg Broytman [Sun, 11 Dec 2011 03:43:17 +0000 (03:43 +0000)]
Debug HTML parsers.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@347 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoNewer Mozilla versions use 'places://'.
Oleg Broytman [Sun, 11 Dec 2011 03:29:12 +0000 (03:29 +0000)]
Newer Mozilla versions use 'places://'.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@346 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoEncode values from unicode.
Oleg Broytman [Sun, 11 Dec 2011 03:27:44 +0000 (03:27 +0000)]
Encode values from unicode.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@345 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoConvert to unicode and back again to unescape unichr'd entities.
Oleg Broytman [Sun, 11 Dec 2011 03:27:18 +0000 (03:27 +0000)]
Convert to unicode and back again to unescape unichr'd entities.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@344 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoMoved robot_simple into run().
Oleg Broytman [Thu, 1 Dec 2011 12:21:32 +0000 (12:21 +0000)]
Moved robot_simple into run().

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@343 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agourllib2
Oleg Broytman [Thu, 1 Dec 2011 12:20:34 +0000 (12:20 +0000)]
urllib2

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@342 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

12 years agoRemoved mirror at phd.by.ru.
Oleg Broytman [Thu, 1 Dec 2011 12:20:01 +0000 (12:20 +0000)]
Removed mirror at phd.by.ru.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@341 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoUnderline IDNA encoding.
Oleg Broytman [Wed, 13 Jul 2011 13:43:23 +0000 (13:43 +0000)]
Underline IDNA encoding.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@340 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoDo not split/decode query and tag - only split host and path and recode the host.
Oleg Broytman [Tue, 12 Jul 2011 15:18:10 +0000 (15:18 +0000)]
Do not split/decode query and tag - only split host and path and recode the host.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@339 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoSplit hrefs into domain and path components; recode only domain.
Oleg Broytman [Tue, 12 Jul 2011 14:37:25 +0000 (14:37 +0000)]
Split hrefs into domain and path components; recode only domain.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@338 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoInsert a space.
Oleg Broytman [Mon, 11 Jul 2011 18:48:24 +0000 (18:48 +0000)]
Insert a space.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@337 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoRecode hrefs (due to international domain names) to the current charset.
Oleg Broytman [Mon, 11 Jul 2011 18:42:33 +0000 (18:42 +0000)]
Recode hrefs (due to international domain names) to the current charset.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@336 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoFixed bugs related to keywords/positional parameters.
Oleg Broytman [Mon, 11 Jul 2011 18:19:22 +0000 (18:19 +0000)]
Fixed bugs related to keywords/positional parameters.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@335 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agolast_visit could be None.
Oleg Broytman [Mon, 11 Jul 2011 18:05:58 +0000 (18:05 +0000)]
last_visit could be None.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@334 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoMoved DEFAULT_CHARSET to bkmk_objects.py.
Oleg Broytman [Thu, 14 Apr 2011 15:25:28 +0000 (15:25 +0000)]
Moved DEFAULT_CHARSET to bkmk_objects.py.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@333 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agom_lib.defenc is always available.
Oleg Broytman [Thu, 14 Apr 2011 15:23:14 +0000 (15:23 +0000)]
m_lib.defenc is always available.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@332 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoGet default charset from m_lib, if available.
Oleg Broytman [Thu, 14 Apr 2011 15:20:20 +0000 (15:20 +0000)]
Get default charset from m_lib, if available.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@331 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoFixed a bug - restored DEFAULT_CHARSET.
Oleg Broytman [Thu, 14 Apr 2011 15:15:07 +0000 (15:15 +0000)]
Fixed a bug - restored DEFAULT_CHARSET.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@330 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoVersion 4.4.0 was released 2011-01-07.
Oleg Broytman [Fri, 7 Jan 2011 12:07:15 +0000 (12:07 +0000)]
Version 4.4.0 was released 2011-01-07.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@329 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoChanged docstring.
Oleg Broytman [Fri, 7 Jan 2011 12:06:30 +0000 (12:06 +0000)]
Changed docstring.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@328 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoFixed grammar.
Oleg Broytman [Thu, 6 Jan 2011 19:38:11 +0000 (19:38 +0000)]
Fixed grammar.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@327 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoUnindented log texts.
Oleg Broytman [Thu, 6 Jan 2011 19:15:55 +0000 (19:15 +0000)]
Unindented log texts.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@326 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoExport main. Import universal_charset only in main.
Oleg Broytman [Thu, 6 Jan 2011 19:12:59 +0000 (19:12 +0000)]
Export main. Import universal_charset only in main.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@325 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoFixed logging.
Oleg Broytman [Thu, 6 Jan 2011 19:11:54 +0000 (19:11 +0000)]
Fixed logging.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@324 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoSkip bookmarks with keyword.
Oleg Broytman [Thu, 6 Jan 2011 19:04:30 +0000 (19:04 +0000)]
Skip bookmarks with keyword.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@323 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoAdded code to collect statistics on parsers;
Oleg Broytman [Thu, 6 Jan 2011 19:00:27 +0000 (19:00 +0000)]
Added code to collect statistics on parsers;
sort parser according to the statistics; commented out statistics code.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@322 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoFixed comment.
Oleg Broytman [Thu, 6 Jan 2011 18:57:52 +0000 (18:57 +0000)]
Fixed comment.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@321 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoRenamed parse_html modules to bkmk_ph_* to avoid name clashes.
Oleg Broytman [Wed, 5 Jan 2011 19:01:21 +0000 (19:01 +0000)]
Renamed parse_html modules to bkmk_ph_* to avoid name clashes.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@320 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoReport what parser is in use.
Oleg Broytman [Tue, 4 Jan 2011 19:34:49 +0000 (19:34 +0000)]
Report what parser is in use.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@319 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoAdded __all__.
Oleg Broytman [Tue, 4 Jan 2011 19:31:21 +0000 (19:31 +0000)]
Added __all__.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@318 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoPreparing version 4.4.0.
Oleg Broytman [Tue, 4 Jan 2011 17:13:55 +0000 (17:13 +0000)]
Preparing version 4.4.0.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@317 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years ago2011.
Oleg Broytman [Tue, 4 Jan 2011 17:12:18 +0000 (17:12 +0000)]
2011.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@316 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoAdded docstrings, __{version,revision,etc}__ boilerplates.
Oleg Broytman [Tue, 4 Jan 2011 17:10:17 +0000 (17:10 +0000)]
Added docstrings, __{version,revision,etc}__ boilerplates.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@315 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoCopy COPYING to doc.
Oleg Broytman [Mon, 3 Jan 2011 19:44:37 +0000 (19:44 +0000)]
Copy COPYING to doc.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@314 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoStopped tracking the text of the GPL license. Moved it to doc subdirectory.
Oleg Broytman [Mon, 3 Jan 2011 19:42:40 +0000 (19:42 +0000)]
Stopped tracking the text of the GPL license. Moved it to doc subdirectory.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@313 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoMoved BeautifulSoup.py and subproc.py from Robots/ to the top-level directory.
Oleg Broytman [Mon, 3 Jan 2011 19:36:30 +0000 (19:36 +0000)]
Moved BeautifulSoup.py and subproc.py from Robots/ to the top-level directory.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@312 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoMoved parse_html.py and its submodules to a separate parse_html module.
Oleg Broytman [Mon, 3 Jan 2011 19:35:08 +0000 (19:35 +0000)]
Moved parse_html.py and its submodules to a separate parse_html module.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@311 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoRemoved old cruft.
Oleg Broytman [Mon, 3 Jan 2011 19:30:42 +0000 (19:30 +0000)]
Removed old cruft.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@310 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoRemoved old unused scripts and docs.
Oleg Broytman [Mon, 3 Jan 2011 17:25:18 +0000 (17:25 +0000)]
Removed old unused scripts and docs.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@309 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoElementTidy often segfaults.
Oleg Broytman [Mon, 3 Jan 2011 17:20:45 +0000 (17:20 +0000)]
ElementTidy often segfaults.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@308 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoVersion 4.3.1 was released 2011-01-03.
Oleg Broytman [Mon, 3 Jan 2011 09:53:21 +0000 (09:53 +0000)]
Version 4.3.1 was released 2011-01-03.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@307 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years ago2011.
Oleg Broytman [Mon, 3 Jan 2011 09:51:42 +0000 (09:51 +0000)]
2011.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@306 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoChanged wording.
Oleg Broytman [Mon, 3 Jan 2011 00:34:35 +0000 (00:34 +0000)]
Changed wording.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@305 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoGet favicon even if it's of a wrong type.
Oleg Broytman [Mon, 3 Jan 2011 00:07:46 +0000 (00:07 +0000)]
Get favicon even if it's of a wrong type.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@304 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoGet favicon before HTML redirect (refresh).
Oleg Broytman [Sun, 2 Jan 2011 23:56:53 +0000 (23:56 +0000)]
Get favicon before HTML redirect (refresh).

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@303 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years ago2011.
Oleg Broytman [Sun, 2 Jan 2011 00:41:31 +0000 (00:41 +0000)]
2011.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@302 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoEncode icon's URL from unicode.
Oleg Broytman [Sun, 2 Jan 2011 00:40:46 +0000 (00:40 +0000)]
Encode icon's URL from unicode.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@301 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoVersion 4.3.0 (2011-01-01).
Oleg Broytman [Sat, 1 Jan 2011 13:56:43 +0000 (13:56 +0000)]
Version 4.3.0 (2011-01-01).

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@300 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoRemoved excessive empty lines.
Oleg Broytman [Sat, 1 Jan 2011 13:56:02 +0000 (13:56 +0000)]
Removed excessive empty lines.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@299 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoCopy third-party modules to the source tree.
Oleg Broytman [Sat, 1 Jan 2011 13:54:17 +0000 (13:54 +0000)]
Copy third-party modules to the source tree.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@298 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoStop tracking a third-party program.
Oleg Broytman [Sat, 1 Jan 2011 13:51:50 +0000 (13:51 +0000)]
Stop tracking a third-party program.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@297 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoDo not append timestamp.
Oleg Broytman [Sat, 1 Jan 2011 13:18:17 +0000 (13:18 +0000)]
Do not append timestamp.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@296 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agoDo it in tmp.
Oleg Broytman [Sat, 1 Jan 2011 13:16:51 +0000 (13:16 +0000)]
Do it in tmp.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@295 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agophd.pp.ru => phdru.name.
Oleg Broytman [Sat, 1 Jan 2011 13:04:29 +0000 (13:04 +0000)]
phd.pp.ru => phdru.name.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@294 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

13 years agophd.pp.ru => phdru.name.
Oleg Broytman [Fri, 24 Dec 2010 19:18:12 +0000 (19:18 +0000)]
phd.pp.ru => phdru.name.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@293 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoRemove all temporary files with urlcleanup().
Oleg Broytman [Thu, 7 Oct 2010 18:30:33 +0000 (18:30 +0000)]
Remove all temporary files with urlcleanup().

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@292 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed a bug.
Oleg Broytman [Thu, 7 Oct 2010 18:29:54 +0000 (18:29 +0000)]
Fixed a bug.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@291 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoRobots no longer have one global temporary file - there are at least two
Oleg Broytman [Thu, 7 Oct 2010 18:10:46 +0000 (18:10 +0000)]
Robots no longer have one global temporary file - there are at least two
(html and favicon), and in the future there will be more for
asynchronous robot(s) that would test many URLs in parallel.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@290 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoMore robots (URL checkers): PycURL multi.
Oleg Broytman [Sat, 2 Oct 2010 22:35:50 +0000 (22:35 +0000)]
More robots (URL checkers): PycURL multi.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@289 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoMinor optimization (free an object). Changed error message.
Oleg Broytman [Sat, 2 Oct 2010 16:35:44 +0000 (16:35 +0000)]
Minor optimization (free an object). Changed error message.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@288 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoNo need to call .lower() two times.
Oleg Broytman [Tue, 24 Aug 2010 17:40:11 +0000 (17:40 +0000)]
No need to call .lower() two times.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@287 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoWrite icon's URIs.
Oleg Broytman [Sat, 14 Aug 2010 22:32:35 +0000 (22:32 +0000)]
Write icon's URIs.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@286 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed encoding.
Oleg Broytman [Fri, 13 Aug 2010 18:54:29 +0000 (18:54 +0000)]
Fixed encoding.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@285 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed a bug.
Oleg Broytman [Fri, 13 Aug 2010 15:46:30 +0000 (15:46 +0000)]
Fixed a bug.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@284 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoHTML parser based on lxml was implemnted.
Oleg Broytman [Fri, 13 Aug 2010 14:54:28 +0000 (14:54 +0000)]
HTML parser based on lxml was implemnted.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@283 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoTest for completely broken HTML.
Oleg Broytman [Fri, 13 Aug 2010 14:53:58 +0000 (14:53 +0000)]
Test for completely broken HTML.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@282 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoMoved lxml-based parser after BeautifulSoup - it doesn't accept charset.
Oleg Broytman [Fri, 13 Aug 2010 13:38:06 +0000 (13:38 +0000)]
Moved lxml-based parser after BeautifulSoup - it doesn't accept charset.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@281 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoInsert lxml-based parser at the beginning.
Oleg Broytman [Fri, 13 Aug 2010 13:19:27 +0000 (13:19 +0000)]
Insert lxml-based parser at the beginning.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@280 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoNext version will be 4.2.2.
Oleg Broytman [Fri, 13 Aug 2010 13:18:04 +0000 (13:18 +0000)]
Next version will be 4.2.2.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@279 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoAdded HTML Parser based on lxml.
Oleg Broytman [Fri, 13 Aug 2010 13:17:33 +0000 (13:17 +0000)]
Added HTML Parser based on lxml.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@278 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoLookup title in html if not found in head.
Oleg Broytman [Fri, 13 Aug 2010 13:06:20 +0000 (13:06 +0000)]
Lookup title in html if not found in head.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@277 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed a bug - moved the code where meta_charset is defined.
Oleg Broytman [Fri, 13 Aug 2010 13:04:15 +0000 (13:04 +0000)]
Fixed a bug - moved the code where meta_charset is defined.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@276 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoLookup title in the root if not found in head.
Oleg Broytman [Fri, 13 Aug 2010 13:03:06 +0000 (13:03 +0000)]
Lookup title in the root if not found in head.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@275 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoMinor doc updates.
Oleg Broytman [Thu, 12 Aug 2010 19:56:25 +0000 (19:56 +0000)]
Minor doc updates.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@274 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoVersion 4.2.1 was released 2010-08-12.
Oleg Broytman [Thu, 12 Aug 2010 19:45:49 +0000 (19:45 +0000)]
Version 4.2.1 was released 2010-08-12.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@273 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoNicer logging.
Oleg Broytman [Thu, 12 Aug 2010 19:45:02 +0000 (19:45 +0000)]
Nicer logging.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@272 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed a bug - don't do a double encode.
Oleg Broytman [Thu, 12 Aug 2010 15:15:38 +0000 (15:15 +0000)]
Fixed a bug - don't do a double encode.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@271 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoTry parser in order until the first one finds a title.
Oleg Broytman [Thu, 12 Aug 2010 15:09:34 +0000 (15:09 +0000)]
Try parser in order until the first one finds a title.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@270 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoParser could be None.
Oleg Broytman [Thu, 12 Aug 2010 09:13:59 +0000 (09:13 +0000)]
Parser could be None.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@269 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoTest if m_lib is available.
Oleg Broytman [Thu, 12 Aug 2010 09:11:06 +0000 (09:11 +0000)]
Test if m_lib is available.
Return None instead of a parser if there are no parsers.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@268 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoDo not parse meta charset if there is HTTP charset.
Oleg Broytman [Thu, 12 Aug 2010 09:05:40 +0000 (09:05 +0000)]
Do not parse meta charset if there is HTTP charset.
Find html in tree.childNodes skipping DocType.
Use meta charset if there is no HTTP charset.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@267 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoMove charset to the beginning of the list.
Oleg Broytman [Thu, 12 Aug 2010 09:02:06 +0000 (09:02 +0000)]
Move charset to the beginning of the list.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@266 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed a bug - check if childNodes not empty.
Oleg Broytman [Wed, 11 Aug 2010 20:44:03 +0000 (20:44 +0000)]
Fixed a bug - check if childNodes not empty.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@265 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoStore icon's URIs.
Oleg Broytman [Wed, 11 Aug 2010 20:08:41 +0000 (20:08 +0000)]
Store icon's URIs.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@264 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoAdded HTML Parser based on html5 library.
Oleg Broytman [Wed, 11 Aug 2010 20:07:57 +0000 (20:07 +0000)]
Added HTML Parser based on html5 library.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@263 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoAdded HTML Parser based on html5 library.
Oleg Broytman [Wed, 11 Aug 2010 20:07:29 +0000 (20:07 +0000)]
Added HTML Parser based on html5 library.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@262 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoRemoved parse_html_etreetidy - TidyHTMLTreeBuilder segfaults. Ouch! :(
Oleg Broytman [Wed, 11 Aug 2010 19:18:26 +0000 (19:18 +0000)]
Removed parse_html_etreetidy - TidyHTMLTreeBuilder segfaults. Ouch! :(

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@261 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoVersion 4.2.1.
Oleg Broytman [Wed, 11 Aug 2010 18:24:36 +0000 (18:24 +0000)]
Version 4.2.1.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@260 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoAdded HTML Parser based on TidyHTMLTreeBuilder.
Oleg Broytman [Wed, 11 Aug 2010 18:23:40 +0000 (18:23 +0000)]
Added HTML Parser based on TidyHTMLTreeBuilder.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@259 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoAdded HTML Parser based on TidyHTMLTreeBuilder.
Oleg Broytman [Wed, 11 Aug 2010 18:22:25 +0000 (18:22 +0000)]
Added HTML Parser based on TidyHTMLTreeBuilder.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@258 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoMoved HTMLParser from parse_html_beautifulsoup.py to parse_html_util.py.
Oleg Broytman [Wed, 11 Aug 2010 17:26:11 +0000 (17:26 +0000)]
Moved HTMLParser from parse_html_beautifulsoup.py to parse_html_util.py.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@257 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23

14 years agoFixed a bug in case there are more than one Content-Type headers.
Oleg Broytman [Wed, 11 Aug 2010 16:59:40 +0000 (16:59 +0000)]
Fixed a bug in case there are more than one Content-Type headers.

git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@256 fdd5c36f-1aea-0310-aeeb-c58d7e2b6c23