05-25-2017, 05:18 PM | #1 |
Junior Member
Posts: 3
Karma: 31958
Join Date: Apr 2017
Device: Kindle PW3
|
Convert Image PDF to PDF with text or other ebook format.
Hello,
I have been lurking these forums for a few months now, and I'm proud to say I have finally joined the ranks as someone who reads on an e-ink display! It's great I'm not wasting my phone battery or hunched over my laptop anymore. However, most of my books I wish to read are books about the game of go. What I want to do is take my PDFs and convert them to PDFs with text. (instead of an image on each page its text and an image) This will make the text available for me to highlight and make notes (I own a kindle paperwhite) and will make it easier for me to format using k2pdfopt. (when I try to reflow a PDF it makes the diagram text below a diagram unreadable because it's so small) Here is an example of what I'm working with: http://imgur.com/a/18fPm Thank you very much for you time and I'm sorry if this question was already answered. P.S. I don't want to extract only the text I want to keep the whole of the document. Last edited by Memes; 05-25-2017 at 05:20 PM. |
05-26-2017, 10:51 PM | #2 | |
Fuzzball, the purple cat
Posts: 1,286
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
Code:
k2pdfopt -mode copy -n- -ocr t -odpi 100 -as go.pdf |
|
06-02-2017, 08:31 PM | #3 |
Guru
Posts: 929
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
I usually use Elucidate. It's a Mac front-end for Tesseract.
However, Tesseract sometimes drops the first letters from words, or reads things in the wrong order. I can't select an entry from a table of contents, to check it in my translation software, because it selects the next page, and inserts that into the entry... |
06-09-2017, 04:27 PM | #4 | |
Junior Member
Posts: 3
Karma: 31958
Join Date: Apr 2017
Device: Kindle PW3
|
Quote:
|
|
06-10-2017, 04:01 PM | #5 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
|
06-11-2017, 12:47 AM | #6 |
Fuzzball, the purple cat
Posts: 1,286
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Or you can use the MS Windows GUI:
1. Under conversion mode, select "copy" 2. Check the autostraighten box 3. Check the OCR box 4. Uncheck the native output box 5. Set the output DPI as desired if you do all this, the auto-generated command-line options should mostly look like what I put in my previous post. |
06-16-2017, 04:29 AM | #7 |
Junior Member
Posts: 3
Karma: 31958
Join Date: Apr 2017
Device: Kindle PW3
|
Thank you everyone very much for your help and your time I really appreciate it!
|
05-01-2023, 05:52 PM | #8 | |
Junior Member
Posts: 1
Karma: 10
Join Date: May 2023
Device: Kindle Paperwhite 11
|
Thank you willus
Quote:
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Convert epub to pdf, with notes with main text in the pdf? | 8140david | ePub | 1 | 06-18-2015 02:13 PM |
Convert epub to pdf, with notes with main text in the pdf? | 8140david | Conversion | 1 | 06-18-2015 12:02 PM |
PDF ebook convert to Kindle format - with a lot empty lines | subarux | Calibre | 4 | 12-28-2010 10:53 PM |
Convert PDF To Sony eBook Format? | Sjwdavies | Sony Reader | 12 | 12-13-2009 04:15 AM |