11-06-2023, 06:44 PM | #1 |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2023
Device: Kobo Libra 2
|
How-To get Gutenberg-DE HTML to calibre (EPUB)?
Hi, I just bought the Gutenberg-DE edition 16.
I want to convert my favorite books to Epub and transfer them to my Kobo Libra 2. Sounds too easy? Well, http://www.epub2go.eu/ does the job from the online Gutenberg-DE repository. However, it is not from the locally installed version. I just pulled the index.html of a book of interest to calibre; it tells about 'importing metadata' and stalls (well, after 20 min at 0%, I gave up). Any experience or tricks on how I can do it? Note: I already complained to the Gutenberg-DE service about why they cannot make it a bit easier, e.g., by a calibre plugin.. |
11-06-2023, 10:09 PM | #2 |
creator of calibre
Posts: 44,565
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
I dont know what gutenber-de is, but assuming its a colelction of HTML files, just let calibre run it has to parse all the html files looking for images/stylesheets/links etc so for a large colelction it will take time.
|
11-07-2023, 02:30 AM | #3 |
Belgian Pommes Frites
Posts: 116
Karma: 532
Join Date: Jan 2012
Device: Pocketbook Touch HD
|
It can be found here: https://www.projekt-gutenberg.org/
11789 Books from 2476 Authors in HTML-Format will take a LONG time to convert to Epub. |
11-07-2023, 03:35 PM | #4 | |
Connoisseur
Posts: 67
Karma: 143132
Join Date: Sep 2010
Device: Kindle Keyboard 3G
|
Quote:
a) have you tried to open the Index.html with the calibre E-Book-Editor? (Right Click - Open With - E-Book editor)? b) The books are on CD - right? Perhaps you must copy the book first from the cd on a local drive. Greetings, Maria |
|
11-07-2023, 04:24 PM | #5 |
Bibliophagist
Posts: 40,603
Karma: 157444382
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
If Gutenberg.de is supplying the html books in .zip files which Gutenberg.org is fond of doing, I've found that I needed to unpack the .zip file into a temporary directory and then import it to calibre. I ended up using 7Zip and calibre from the command line the one time I tried to import ~200 books using a loop in a batch file to step through the directories.
|
11-07-2023, 04:44 PM | #6 | |
the rook, bossing Never.
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
I suppose we are lucky Gutenberg aren't solely offering plain text, which they did before adding HTML and mobi (not called old Kindle till later). |
|
11-07-2023, 08:02 PM | #7 | |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2023
Device: Kobo Libra 2
|
Quote:
|
|
11-07-2023, 08:05 PM | #8 | |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2023
Device: Kobo Libra 2
|
Quote:
Yes, I first copied the .zip Download (7.6 GB) to my SSD drive and unzipped it. Now I try opening it with the Calibre epub editor. Quite annoying. Why don't they offer epubs or a decent converter/ Calibre plugin a la epub2go? Best, Robert |
|
11-09-2023, 03:34 PM | #9 | |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2023
Device: Kobo Libra 2
|
Quote:
~~~ rob@robert-winkler-Lenovo ~/d/g/a/judas> ebook-convert index.html andrejew_judas.epub 1% Eingabe wird zu HTML konvertiert*… InputFormatPlugin: HTML Input running on /home/rob/dataspace/gutenberg-edition16/andrejew/judas/index.html Language not specified Building file list... IgnoreFile('/home/rob/dataspace/gutenberg-edition16/info/texte/Vom-Antiquariat-zum-E-Text.pdf is a binary file') IgnoreFile('/home/rob/dataspace/gutenberg-edition16/plautus/asinaria/asinaria.pdf is a binary file') IgnoreFile('/home/rob/dataspace/gutenberg-edition16/plautus/epidicus/epidicus.pdf is a binary file') IgnoreFile('/home/rob/dataspace/gutenberg-edition16/plautus/mercator/mercator.pdf is a binary file') IgnoreFile('/home/rob/dataspace/gutenberg-edition16/plautus/mostell1/mostell1.pdf is a binary file') ~~~ etc. This indicates that calibre is scanning through all the directories? By chance, do you have a working script to process a single directory? |
|
11-09-2023, 04:19 PM | #10 |
Bibliophagist
Posts: 40,603
Karma: 157444382
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
What does index.html contain? Can you post an image of the structure of the .zip file? One publisher used to include an index.html on their CD's that contains all the ebooks & formats on the CD so you needed to import from the subdirectories to get the ebooks.
|
11-12-2023, 01:38 PM | #11 | |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2023
Device: Kobo Libra 2
|
Quote:
An example index.html (in the subdirectory of the book) looks as follows. Yes, I would import the books 1x1 from the subdirectory. Spoiler:
Last edited by theducks; 11-12-2023 at 07:24 PM. Reason: Spoiler logs. |
|
11-13-2023, 02:28 AM | #12 |
Belgian Pommes Frites
Posts: 116
Karma: 532
Join Date: Jan 2012
Device: Pocketbook Touch HD
|
When I look at the "Impressum" webpage of the website where you bought it, it is clear that it is not gutenberg.org that manages it, but a private publisher. On this page they state they can deliver the stuff in epub - see https://www.abc.de/
So, ask them to send it all to you in epub format instead of html. |
11-14-2023, 03:42 PM | #13 |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2023
Device: Kobo Libra 2
|
SOLVED: conversion of Gutenberg-DE HTML to epub
With https://github.com/JohnButzel/Gutenberg2Epub you can extract a book from Gutenberg-DE online or local. The resulting output is compatible with Calibre and an ebook Reader (tested on a Kobo Libra 2).
Thanks a lot to the author!!! |
Tags |
calibre, epub, gutenberg-de, html, kobo libra 2 |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Calibre EPUB Conversion -- EPUB 3 and HTML vs. XHTML internal file naming | GranitStateColin | Calibre | 5 | 06-04-2023 10:44 AM |
html to epub via Sigil or Calibre? | Bigo2 | Calibre | 2 | 07-01-2012 03:07 AM |
Yet Another Gutenberg Book/HTML converter | FangornUK | Sony Reader | 59 | 05-01-2009 11:15 AM |
HTML from Project Gutenberg? | Rcartes | Sony Reader | 10 | 04-21-2009 08:26 PM |
Mazarin - Gutenberg in HTML | Alexander Turcic | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 05-25-2004 04:11 AM |