Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 03-09-2022, 12:56 PM   #1
cow
Just a Cow
cow began at the beginning.
 
cow's Avatar
 
Posts: 11
Karma: 10
Join Date: Mar 2022
Location: Pasture
Device: Kobo Sage
Arrow Large book with a 2-column structure, PDF -> EPUB .. any tips on this one?

Hello

I'm looking for a tips converting this PDF to EPUB (if it is even realistically possible). My Kobo Sage can handle PDFs just fine, but they are much slower than EPUB

https://theholychristianchurch.org/d...iffe-bible.pdf

The file is very clean and well-structured, but I'm having difficulties getting a nice epub conversion out of it.

The entire document is comprised of two columns, which the epub converter doesn't seem to like. Any tips on converting this?

Also, is it possible to only convert, for example, the first 50 pages so I can get an idea of how how the conversion is going without waiting for the entire 1100-page book to process? I've looked through all the settings over and over and I don't see any option to do this. It takes quite a long time

Any tips appreciated. Thank you
cow is offline   Reply With Quote
Old 03-09-2022, 01:32 PM   #2
cow
Just a Cow
cow began at the beginning.
 
cow's Avatar
 
Posts: 11
Karma: 10
Join Date: Mar 2022
Location: Pasture
Device: Kobo Sage
I can't seem to edit my post, I wanted to update before anyone here spent time replying.

I may have found a way to pull this off. I found a utility on Linux called "pdf2htmlex" and got a brilliant, near-perfect HTML conversion out of it. Maybe this can help someone else in the future.

I am now converting the HTML to EPUB in Calibre to see if that works better (I assume it should). I will post again with screenshots again once it finishes and report back - taking a very long time.

This is how the HTML conversion came out with pdf2htmlex:

Click image for larger version

Name:	ytf3v5.png
Views:	157
Size:	208.2 KB
ID:	192655

Last edited by BetterRed; 03-09-2022 at 03:32 PM. Reason: image too big - thumbnailed
cow is offline   Reply With Quote
Advert
Old 03-09-2022, 03:06 PM   #3
retiredbiker
Evangelist
retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.
 
retiredbiker's Avatar
 
Posts: 410
Karma: 2289864
Join Date: May 2013
Location: Ontario, Canada
Device: Kindle KB, Oasis, Pop_Os!, Jutoh, Kobo Forma
In general, pdfs are the worst files to convert. See the sticky post for a lot of details: Read this before Posting PDF Questions

This particular file does seem to have excellent text. You can select and paste it, and it seems to be complete and in order. Unlike so many pdfs. So in theory it might convert well.

But the two columns are the killer. Even if the text comes out of the two columns un-garbled in a conversion, epub does not have any easy two column display. And it's pretty obvious you want this book to preserve the columns as they are, not only as columns but with the entries aligned as in a table.

So to make this into an epub you would have to put the text into tables. That would be a manual job, a huge one, (unless someone knows of a tool to automate it). And tables have real troubles on an e-reader device if a user changes text size, for example, or if the table entries don't fit the reader screen size.

A book like this is best left as a pdf, IMHO.
retiredbiker is offline   Reply With Quote
Old 03-09-2022, 03:46 PM   #4
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,772
Karma: 27405072
Join Date: Mar 2012
Location: Sydney Australia
Device: none
@cow - you need make some more posts before you can edit your own posts - play some games in the lounge. Your second post went into the moderation queue. Re the image size see ==>> Guideline #9

BR
BetterRed is offline   Reply With Quote
Old 03-09-2022, 04:29 PM   #5
cow
Just a Cow
cow began at the beginning.
 
cow's Avatar
 
Posts: 11
Karma: 10
Join Date: Mar 2022
Location: Pasture
Device: Kobo Sage
Quote:
Originally Posted by BetterRed View Post
@cow - you need make some more posts before you can edit your own posts - play some games in the lounge. Your second post went into the moderation queue. Re the image size see ==>> Guideline #9

BR
Understood, thanks!

Quote:
Originally Posted by retiredbiker View Post
In general, pdfs are the worst files to convert. See the sticky post for a lot of details: Read this before Posting PDF Questions

This particular file does seem to have excellent text. You can select and paste it, and it seems to be complete and in order. Unlike so many pdfs. So in theory it might convert well.

But the two columns are the killer. Even if the text comes out of the two columns un-garbled in a conversion, epub does not have any easy two column display. And it's pretty obvious you want this book to preserve the columns as they are, not only as columns but with the entries aligned as in a table.

So to make this into an epub you would have to put the text into tables. That would be a manual job, a huge one, (unless someone knows of a tool to automate it). And tables have real troubles on an e-reader device if a user changes text size, for example, or if the table entries don't fit the reader screen size.

A book like this is best left as a pdf, IMHO.
Yeah, the only reason I wanted to keep the columns is because of the side-by-side comparison format that the book is built upon. My PDF conversion did indeed fail spectacularly. Once I got the epub added to my reader, I realized why the side by side format wouldn't work.. still makes for tiny text and requires zooming, same as the PDF. I guess I was expecting the impossible and didn't think enough about it.

I suppose one possibility would be to break these columns out into alternating rows and use white background color for Wycliffe and light gray background color for KJV.. or something to that nature. If I was able to get a coherent conversion from PDF to EPUB, I could probably do this work in Sigil with regular expressions and other tools.

But the problem is first getting that initial conversion so I can start editing it manually.

It turns out that the converter I used put EVERY SINGLE LETTER IN THE BOOK into individual <span>S</span> objects, each with their own pixel-perfect absolute positioning. So there wasn't much I could do with it.

Maybe I'd be better off just buying a hard copy
cow is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Ultimate PDF to Epub/Mobi conversion tips sinan Workshop 43 08-01-2017 12:46 AM
Epub to Pdf results in very large font size. Joan M Conversion 5 05-12-2015 08:15 AM
large textbooks epub or pdf? gris Onyx Boox 1 01-03-2014 07:19 PM
best output epub to pdf large monitor CineMan Conversion 14 11-25-2013 10:35 AM
2 column PDF book to 1 column possible? SeaBookGuy Calibre 19 07-01-2013 02:30 AM


All times are GMT -4. The time now is 03:33 PM.


MobileRead.com is a privately owned, operated and funded community.