]>
git.phdru.name Git - bookmarks_db.git/log
Oleg Broytman [Thu, 12 Aug 2010 09:05:40 +0000 (09:05 +0000)]
Do not parse meta charset if there is HTTP charset.
Find html in tree.childNodes skipping DocType.
Use meta charset if there is no HTTP charset.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@267
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 12 Aug 2010 09:02:06 +0000 (09:02 +0000)]
Move charset to the beginning of the list.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@266
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:44:03 +0000 (20:44 +0000)]
Fixed a bug - check if childNodes not empty.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@265
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:08:41 +0000 (20:08 +0000)]
Store icon's URIs.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@264
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:07:57 +0000 (20:07 +0000)]
Added HTML Parser based on html5 library.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@263
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 20:07:29 +0000 (20:07 +0000)]
Added HTML Parser based on html5 library.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@262
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 19:18:26 +0000 (19:18 +0000)]
Removed parse_html_etreetidy - TidyHTMLTreeBuilder segfaults. Ouch! :(
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@261
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 18:24:36 +0000 (18:24 +0000)]
Version 4.2.1.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@260
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 18:23:40 +0000 (18:23 +0000)]
Added HTML Parser based on TidyHTMLTreeBuilder.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@259
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 18:22:25 +0000 (18:22 +0000)]
Added HTML Parser based on TidyHTMLTreeBuilder.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@258
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 17:26:11 +0000 (17:26 +0000)]
Moved HTMLParser from parse_html_beautifulsoup.py to parse_html_util.py.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@257
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 16:59:40 +0000 (16:59 +0000)]
Fixed a bug in case there are more than one Content-Type headers.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@256
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 16:48:09 +0000 (16:48 +0000)]
More logging.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@255
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 16:47:57 +0000 (16:47 +0000)]
Set timeout to 60 seconds.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@254
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 16:38:30 +0000 (16:38 +0000)]
2010.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@253
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 11 Aug 2010 16:18:41 +0000 (16:18 +0000)]
Version 4.2.0 released 11 Aug 2010.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@252
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 10 Aug 2010 09:49:21 +0000 (09:49 +0000)]
Print real title, not a saved one.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@251
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 10 Aug 2010 09:46:14 +0000 (09:46 +0000)]
Added check_url.py to check URLs one by one from the command line verbosely.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@250
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 9 Aug 2010 16:25:17 +0000 (16:25 +0000)]
New storage: json; it allows to load and store Mozilla (Firefox) backup files.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@249
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 9 Aug 2010 14:51:18 +0000 (14:51 +0000)]
Unquote single quote.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@248
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 9 Aug 2010 14:49:31 +0000 (14:49 +0000)]
Skip Firefox SmartBookmarks.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@247
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 9 Aug 2010 11:59:24 +0000 (11:59 +0000)]
Skip Firefox SmartBookmarks.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@246
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 20:55:58 +0000 (20:55 +0000)]
Fixed a bug.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@245
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 19:52:53 +0000 (19:52 +0000)]
Fixed a bug.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@244
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 19:52:35 +0000 (19:52 +0000)]
Fixed parsing in case of unknown entity.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@243
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 19:46:59 +0000 (19:46 +0000)]
Fixed a bug.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@242
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 16:47:46 +0000 (16:47 +0000)]
Use Firefox backup instead of bookmarks.html.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@241
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 16:46:55 +0000 (16:46 +0000)]
Fixed a bug - parse "HTTP-Equiv" without content.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@240
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 8 Aug 2010 15:54:56 +0000 (15:54 +0000)]
Comments are always strings.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@239
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 4 Aug 2010 17:20:46 +0000 (17:20 +0000)]
Make annos for SmartBookmarks.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@238
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 4 Aug 2010 17:10:39 +0000 (17:10 +0000)]
Load/store folder's root.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@237
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 1 Aug 2010 20:18:44 +0000 (20:18 +0000)]
Fixed bugs in loading folders.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@236
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 25 Jul 2010 20:22:18 +0000 (20:22 +0000)]
Load/store keywords.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@235
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 25 Jul 2010 20:19:23 +0000 (20:19 +0000)]
Load/store comments.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@234
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 23 Jul 2010 18:38:01 +0000 (18:38 +0000)]
Store json.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@233
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 22 Jul 2010 19:02:38 +0000 (19:02 +0000)]
Implemented JSON loading.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@232
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 22 Jul 2010 19:02:08 +0000 (19:02 +0000)]
Preparing release 4.2.0. New storage: json.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@231
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 22 Jul 2010 19:01:09 +0000 (19:01 +0000)]
Copyright 2010.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@230
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 27 Sep 2009 16:07:12 +0000 (16:07 +0000)]
"BroytMann" => "Broytman".
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@229
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 11 Sep 2009 10:01:21 +0000 (10:01 +0000)]
New robot based on PycURL.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@228
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Thu, 5 Mar 2009 10:43:22 +0000 (10:43 +0000)]
Use lxml to parse broken HTML.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@227
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 21 Oct 2008 17:24:51 +0000 (17:24 +0000)]
Changed Desktop back to .mozilla.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@226
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 18 Oct 2008 19:16:44 +0000 (19:16 +0000)]
bookmarks.html in ~/Desktop.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@225
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 18 Oct 2008 19:16:11 +0000 (19:16 +0000)]
Fixed a bug - return to the STARTDIR.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@224
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 5 Oct 2008 11:22:05 +0000 (11:22 +0000)]
Strip bookmarks.html off of icons to make the file smaller.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@223
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 4 Oct 2008 20:04:27 +0000 (20:04 +0000)]
Do not use bookmarks.html from Firefox3 directory - use ~/.mozilla/ instead.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@222
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 4 Oct 2008 19:53:41 +0000 (19:53 +0000)]
Remotely sync and publish bookmarks in Mozilla format.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@221
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 4 Oct 2008 19:30:12 +0000 (19:30 +0000)]
2008.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@220
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 4 Oct 2008 19:25:45 +0000 (19:25 +0000)]
Updated for Firefox3.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@219
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 29 Jun 2008 12:54:28 +0000 (12:54 +0000)]
Process http error 307 as a temporary redirect.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@218
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 2 Apr 2008 14:55:07 +0000 (14:55 +0000)]
Renamed bkmk-set to bkmk-publish.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@217
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 1 Apr 2008 13:57:29 +0000 (13:57 +0000)]
by.ru has revived phd.by.ru mirror.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@216
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 9 Mar 2008 22:16:13 +0000 (22:16 +0000)]
Fixed a bug - cd ..
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@215
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 9 Mar 2008 22:15:22 +0000 (22:15 +0000)]
rm -rf bookmarks_db
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@214
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 9 Mar 2008 22:14:08 +0000 (22:14 +0000)]
Version 4.1.1.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@213
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 9 Mar 2008 22:11:11 +0000 (22:11 +0000)]
Version 4.1.1 (2008-03-10).
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@212
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 9 Mar 2008 15:55:20 +0000 (15:55 +0000)]
Title (and refresh) can be None.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@211
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 9 Mar 2008 15:40:42 +0000 (15:40 +0000)]
Minor optimization.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@210
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Mar 2008 22:39:37 +0000 (22:39 +0000)]
Fixed a misspelled HTML entity.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@209
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Mar 2008 20:56:06 +0000 (20:56 +0000)]
Split the title into subparts, reassemble the subparts and recode.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@208
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Mar 2008 20:15:20 +0000 (20:15 +0000)]
New database format: pyyaml.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@207
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Mar 2008 20:14:12 +0000 (20:14 +0000)]
Lookup TITLE in HEAD, in HTML and in the root; test for None instead of AttributeError.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@206
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Mar 2008 19:44:57 +0000 (19:44 +0000)]
Fixed a misspelling.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@205
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Fri, 7 Mar 2008 19:42:54 +0000 (19:42 +0000)]
Pass charset from the command line.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@204
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 12:07:52 +0000 (12:07 +0000)]
Extract charset from "text/html; foo; charset=UTF-8, bar; baz;"
("bar" is in case there are few Content-Type headers.)
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@203
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 11:45:39 +0000 (11:45 +0000)]
Full name for "IGNORECASE".
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@202
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 11:42:38 +0000 (11:42 +0000)]
Consider application/xhtml+xml as HTML.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@201
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 10:55:43 +0000 (10:55 +0000)]
Ignore case for DOCTYPE.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@200
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 10:52:30 +0000 (10:52 +0000)]
Check root.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@199
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 10:52:17 +0000 (10:52 +0000)]
application/xhtml+xml is HTML, too.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@198
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 10:40:14 +0000 (10:40 +0000)]
There could be more than one semicolon in Content-Type header.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@197
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 10:32:18 +0000 (10:32 +0000)]
Reparse the HTML if the charset was changed.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@196
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 10:14:56 +0000 (10:14 +0000)]
I have never saw pages in MacCyriliic.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@195
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 09:42:31 +0000 (09:42 +0000)]
Replace ISO-8859-2 to the default encoding.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@194
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 4 Mar 2008 09:37:45 +0000 (09:37 +0000)]
Do not log TypeError.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@193
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 21:23:58 +0000 (21:23 +0000)]
In the default hierarchy "root > html > head > title" any part can be absent.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@192
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 21:13:38 +0000 (21:13 +0000)]
Log more parsers errors.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@191
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 21:03:35 +0000 (21:03 +0000)]
Always log guessed charset even if it's utf-8.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@190
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 17:50:06 +0000 (17:50 +0000)]
Charset was guessed if it is not from META and not from HTTP.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@189
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 15:22:14 +0000 (15:22 +0000)]
Fixed a bug in case there is no charset in META Content-Type.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@188
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 14:30:54 +0000 (14:30 +0000)]
Test meta charset by looking in META HTTP-Equiv.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@187
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 3 Mar 2008 14:30:09 +0000 (14:30 +0000)]
Create the list of charsets outside of the parsers test loop.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@186
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 25 Feb 2008 02:44:02 +0000 (02:44 +0000)]
is an entity that needs to be encoded.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@185
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 24 Feb 2008 21:37:24 +0000 (21:37 +0000)]
Used name2codepoint directly; recode it.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@184
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 24 Feb 2008 21:19:19 +0000 (21:19 +0000)]
Combined two "if"s.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@183
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 24 Feb 2008 13:02:24 +0000 (13:02 +0000)]
Do not unquote standard HTML entities.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@182
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sun, 24 Feb 2008 12:27:21 +0000 (12:27 +0000)]
Emulate log.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@181
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 23 Feb 2008 22:58:56 +0000 (22:58 +0000)]
Fixed a bug - break out of the loop after finding the first working charset.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@180
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 23 Feb 2008 22:40:56 +0000 (22:40 +0000)]
It is not HTTP charset, it is guessed charset.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@179
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Sat, 23 Feb 2008 22:34:24 +0000 (22:34 +0000)]
Try a list of charsets, including the universal (utf-8) and the default ones.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@178
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 13 Feb 2008 18:18:18 +0000 (18:18 +0000)]
Stop meddling with cp1252.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@177
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Wed, 13 Feb 2008 18:17:53 +0000 (18:17 +0000)]
Replace BeautifulSoup's guessed cp1252 with DEFAULT_CHARSET.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@176
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Tue, 12 Feb 2008 21:44:10 +0000 (21:44 +0000)]
current_charset is only needed in main.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@175
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 22:42:19 +0000 (22:42 +0000)]
Timeout 5 minutes is enough (Squid uses 3 minutes).
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@174
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 20:11:30 +0000 (20:11 +0000)]
Encode entities.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@173
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 20:11:05 +0000 (20:11 +0000)]
Removed dead code.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@172
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 19:58:39 +0000 (19:58 +0000)]
Recode entities before num. entities.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@171
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 19:37:12 +0000 (19:37 +0000)]
Switched to utf-8.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@170
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 16:16:11 +0000 (16:16 +0000)]
Recode HTML entities.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@169
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23
Oleg Broytman [Mon, 11 Feb 2008 16:15:49 +0000 (16:15 +0000)]
Recode HTML entities.
git-svn-id: file:///home/phd/archive/SVN/bookmarks_db/trunk@168
fdd5c36f-1aea-0310-aeeb-
c58d7e2b6c23