06-09-2010, 06:48 PM | #1 |
Connoisseur
Posts: 82
Karma: 256
Join Date: Feb 2010
Location: Netherlands
Device: dr1000
|
DR800/DR1000 Website archive browser (website in .ZIP file)
As you all probably know, the DR800, and the DR1000 with 2.0 firmware, have a fairly decent HTML browser based on WebKit. Unfortunately, storing whole websites on the reader is not really feasible, since they usually contain a large number of files, making the indexing phase of the boot process much slower.
As an alternative, I have built a modified browser, based on the irex one, that allows you to browse within a .ZIP file. The idea is that you first download the site (with wget or your favourite downloading tool), make a ZIP archive with the contents and copy that file to the reader. When you open the ZIP file from the file browser, it will open the modified browser, which looks for a /index.htm or /index.html in the ZIP file. If such a file is found, it is shown like a regular website. Images, css files and all other pages are also loaded from the same ZIP file. I currently have this working on the DR800 emulator, but I want to wait for the 2.0 sources to be released, to release the final version (if they don't release the sources soon, I'm a bit worried about the chapter 11 situation, I'll probably do a release based on the old sources). I now have a few questions for you guys: - Do you think this is useful? - What extension would you want to be associated with the modified browser? using .ZIP means of course that all .ZIP files will be opened by this program - Any other feature requests or suggestions? |
06-09-2010, 07:01 PM | #2 |
Connoisseur
Posts: 82
Karma: 256
Join Date: Feb 2010
Location: Netherlands
Device: dr1000
|
For the people who want to know how it works:
I have modified the browser to start an embedded web server. This web server is based on libmicrohttpd ( http://www.gnu.org/software/libmicrohttpd/ ), and automatically extracts each requested file from the zip file and delivers it to the browser. The unzip part uses minizip ( http://www.winimage.com/zLibDll/minizip.html ). The URLs look like this: http://127.0.0.1:7766/media/mmcblk0p...LES/index.html It would be fairly easy to modify the browser to show files from other kinds of archives (it requires modifying only one function). I've seen a few requests for an offline Wikipedia browser. Does anyone know a good archive format for Wikipedia? |
Advert | |
|
06-10-2010, 03:37 AM | #3 |
Addict
Posts: 234
Karma: 214
Join Date: Nov 2008
Device: Galaxy Note 3, Galaxy NotePro 12.2, InkBook
|
That's definitely a useful piece of software. Thanks!
|
06-10-2010, 04:19 AM | #4 |
ex-IRX developer
Posts: 158
Karma: 224
Join Date: Oct 2008
Device: Irex DR800S, DR1000S, iLiad
|
Its always nice to have choices, but DR800 (1.0/2.0) and DR1000 (2.0) already support MHT (MHTML) web archives natively.
|
06-10-2010, 04:42 AM | #5 | ||
本の虫
Posts: 242
Karma: 344
Join Date: May 2008
Location: USA
Device: Kindle3, DXG, Fire ˇ iPad, iPod
|
Quote:
Quote:
Resizing with resampling for image archives would be nice. Something like how Kolenka's CBZ Document Plugin does it. (Much better looking than the built-in shrink-to-fit software.) |
||
Advert | |
|
06-10-2010, 04:46 AM | #6 |
本の虫
Posts: 242
Karma: 344
Join Date: May 2008
Location: USA
Device: Kindle3, DXG, Fire ˇ iPad, iPod
|
|
06-10-2010, 04:47 AM | #7 |
Addict
Posts: 234
Karma: 214
Join Date: Nov 2008
Device: Galaxy Note 3, Galaxy NotePro 12.2, InkBook
|
There are a lot of of ebooks and ezines that aren't in .mht but consist of hundreds of individual .html files.
Thanks, luite! |
06-10-2010, 06:22 AM | #8 | |
Addict
Posts: 225
Karma: 72
Join Date: Jan 2010
Device: irex DR-800S, M92S
|
Quote:
as you know those books can not be read by dr800 or dr1000, but they can easily converted in html pages, they are an html pages archive essentially, so this can be an use of your program.. btw why don't you try to do directly a chm reader ? |
|
06-10-2010, 08:59 AM | #9 |
Tatütatuter
Posts: 390
Karma: 3148
Join Date: Oct 2008
Location: Germany, Baden, Karlsruhe
Device: Aura H2O, Glo HD, Paperwhite, PRS-950
|
An other possibilty is to save the "website" or the "html-ebook" as mht-file, e.g. with http://maf.mozdev.org/
|
06-10-2010, 03:31 PM | #10 |
Connoisseur
Posts: 82
Karma: 256
Join Date: Feb 2010
Location: Netherlands
Device: dr1000
|
Ah, I have to admit that I didn't know that, but I was under the impression that MHT files are used for a single page, while I intend to use this for complete sites (for example Doxygen or Javadoc documentation, which often consist of hundreds of html files).
|
06-10-2010, 03:47 PM | #11 | |
Connoisseur
Posts: 82
Karma: 256
Join Date: Feb 2010
Location: Netherlands
Device: dr1000
|
I have to admit that it was only a very small modification of the erbrowser code, zip was just the easiest format to support. CHM is a good suggestion, and could be done with chm lib ( http://www.jedrea.com/chmlib/ ), but I'm not sure how much work it is to properly support the table of contents an index features of CHM files.
Quote:
|
|
06-12-2010, 12:01 PM | #12 | |
Guru
Posts: 730
Karma: 72743
Join Date: Feb 2008
Location: Here or there
Device: iRex iLiad, iRex DR800S. K4NT. Kobo Aura, Aura One, Libra 2.
|
Quote:
|
|
06-12-2010, 01:34 PM | #13 | |
Connoisseur
Posts: 82
Karma: 256
Join Date: Feb 2010
Location: Netherlands
Device: dr1000
|
Quote:
And as promised, now that the 2.0 source code is available, here's a preview of the zip browser. It should work on both the DR800S(G) and DR1000, with the 2.0 firmware. Installing associates .web and .zip files with the program. Keep in mind that zipbrowser looks for a file called index.html or index.htm in the root directory of the zip file. You can zip complete websites with multiple html files, images and other resources. MAFF/maf files are not supported yet, but I plan to add support in the future (it requires parsing some rdf files). Installation: - Enable "Folder based navigation" in Advanced Settings - Connect your device via USB - Extract the zip file to the root of your device - Disconnect USB - Go to SD Card / Programs / Install, click on AssociateZipbrowser.desktop - Wait for the device to reboot |
|
06-13-2010, 12:16 AM | #14 |
Connoisseur
Posts: 82
Karma: 256
Join Date: Feb 2010
Location: Netherlands
Device: dr1000
|
Oops, I found a bug. The installation script incorrectly associated the djvu file format with zipbrowser if you had installed the preview of the djvu plugin for the new firmware ( https://www.mobileread.com/forums/sho...t=73790&page=2 ) before you installed zipbrowser. If you experience this problem, just reinstall the djvu plugin.
Here's an updated zipbrowser that fixes this. |
06-14-2010, 12:25 PM | #15 | |
Junior Member
Posts: 6
Karma: 10
Join Date: Apr 2010
Device: dr1000s
|
Thanks.
Quote:
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
DR800 ANN: DR800+. Port of some DR1000 features to the DR800 | Iņigo | iRex | 274 | 04-05-2013 03:24 PM |
pulling an entire website into Calibre and generate an epub file using news function? | N13L5 | Calibre | 2 | 10-06-2010 10:00 PM |
Complex HTML archive (ZIP), how to convert | Mixx | Calibre | 10 | 09-28-2010 01:29 PM |
epub file website downloads | stunev | ePub | 3 | 07-23-2010 01:44 PM |
iRex DR1000 Software Development website | vwinstead | iRex | 23 | 02-04-2009 01:21 PM |