![]() |
#1 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 36
Karma: 1510058
Join Date: Dec 2008
Device: Various Kobo readers, Sony readers from the past.
|
Extracting text
I prefer to format my own ebooks using OpenOffice. Is there a quick and easy way of extracting the text [either as plain text, but preferably as RTF] from ebooks?
|
![]() |
![]() |
![]() |
#2 |
Liseuse Lover
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 869
Karma: 1035404
Join Date: Jul 2008
Location: Netherlands
Device: PRS-505
|
"Ebooks" is a rather broad term encompassing a metric crapload of formats. Calibre can convert many formats to and from many formats. Not all, and not all will look as dashing.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,490
Karma: 5239563
Join Date: Jan 2008
Location: Denmark
Device: Kindle 3|iPad air|iPhone 4S
|
At the moment I prefer to read PDFs on my iREx digital reader, so I have some of the same issues as you do.
How to extract the text, depends on what type your source files are. Firstly, you will need to remove DRM. AFAIK this is not possible with .LRF (BBeb) files, but it is with many others, such as epub, prc and lit. Then with a DRM free file, you can do a number of things. What I've found to be easiest, was to open the file in Stanza (reader application), copy all, and paste to OpenOffice. There are other ways to get at the text, but I've found that most often the source is a collection of html files, and using Stanza you get all in one go. I haven't tried with DRM'd files but I doubt it will work. calibre can also convert to a number of formats, but not as many as Stanza. As far as I remember you can also convert to RTF directly in calibre, but the quality was not usable for me - perhaps it is for you. |
![]() |
![]() |
![]() |
#4 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 36
Karma: 1510058
Join Date: Dec 2008
Device: Various Kobo readers, Sony readers from the past.
|
Thanks for that. I too convert my files to PDF. I'll have a look at Stanza.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Extracting a cover image from lit file | p3aul | Calibre | 6 | 07-25-2010 05:33 PM |
Extracting firmware bin file | adreamer | Ectaco jetBook | 1 | 01-02-2010 02:38 PM |
Tool for extracting pdf bookmarks | geraschenko | iRex | 1 | 10-24-2009 04:42 PM |
Extracting pdb files from Palm Installer | bpwhistler | Alternative Devices | 0 | 11-15-2008 04:07 PM |
Extracting text with formatting from PDF | nekokami | 22 | 03-05-2007 10:18 AM |