04-24-2023, 06:22 AM | #1 |
Junior Member
Posts: 6
Karma: 55624
Join Date: Apr 2023
Device: kindle 5th gen
|
Looking for K2pdfopt Help
Hello,
I've got a scan of an old book in PDF that I'd like to read on my Kindle (5th Gen.). For whatever reason, once on device pages appear blurry or blank (based on a post by Willus in another thread I suspect it's because the pdf may be composed of JPX images?). When I run it through K2pdfopt I can view the pages, the only problem is it seems no matter what setting adjustments I make to the conversion, I'm still getting some pages that are cut up in random ways (see attached sample, converted with setting bp 3, -1). Is there a way to adjust the settings in K2pdfopt so that it simply converts the pdf to display as it is successfully doing, but without making any other crop or margin type adjustments at all? thanks so much! |
04-25-2023, 07:42 AM | #2 |
Connoisseur
Posts: 63
Karma: 302424
Join Date: Aug 2019
Location: Serbia, former Yugoslavia
Device: Pocketbook InkPad 3
|
Whics conversion parameters did you applied?
Last edited by VladimirS; 04-25-2023 at 07:45 AM. |
04-25-2023, 07:48 AM | #3 |
Junior Member
Posts: 6
Karma: 55624
Join Date: Apr 2023
Device: kindle 5th gen
|
|
04-26-2023, 05:19 AM | #4 |
Connoisseur
Posts: 63
Karma: 302424
Join Date: Aug 2019
Location: Serbia, former Yugoslavia
Device: Pocketbook InkPad 3
|
You can allways make exact copy of original book using :
k2pdfopt -mode copy filename.pdf. This will make exact copy, without any modifications, without any cropping or similar adjustments. |
04-26-2023, 07:27 AM | #5 | |
Junior Member
Posts: 6
Karma: 55624
Join Date: Apr 2023
Device: kindle 5th gen
|
Quote:
Thanks so much for your help!! |
|
04-27-2023, 05:42 AM | #6 |
Connoisseur
Posts: 63
Karma: 302424
Join Date: Aug 2019
Location: Serbia, former Yugoslavia
Device: Pocketbook InkPad 3
|
No problem, we are here to help each other.
If you do not want to change contrast, and add OCR, you can run K2pdfopt as follows k2pdfopt -mode copy -cmax 1.0 -g 1.0 -ocr filename.pdf |
04-28-2023, 07:54 AM | #7 | |
Junior Member
Posts: 6
Karma: 55624
Join Date: Apr 2023
Device: kindle 5th gen
|
Quote:
The only issue I'm having is with the OCR. It almost seems as if only every other page is navigable by cursor on the kindle (the cursor get's stuck on the outside of the page—in spite of the text being highlightable on the desktop). Any thoughts on how to resolve that? I did notice their are several OCR options, I've been doing mupdf with default settings. Cheers mate Steven |
|
04-29-2023, 06:50 AM | #8 |
Connoisseur
Posts: 63
Karma: 302424
Join Date: Aug 2019
Location: Serbia, former Yugoslavia
Device: Pocketbook InkPad 3
|
OCR is better as days go by, but it has it's limits.
Good OCR job depends on original photo (individual pdf page). Page needs to be without distorsions, flat, with good contrast between text and background. Scanning of books and artefacts of scanning makes OCR job more difficult. Many small imperfections, shades of background ... If I find some possible solution, I will notice you. Cheers |
04-29-2023, 10:45 PM | #9 |
Fuzzball, the purple cat
Posts: 1,283
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Hi all--sorry to have missed this thread. It's been a busy week. Thank you VladimirS for your help.
A few things: 1. To force no contrast change, actually -cmax -1 is better than -cmax 1, but it hardly matters. 2. If you want an exact replica, also turn off sharpening: -s- 3. Is this being run on Linux? Mac? What version? For the latest versions of k2pdfopt, you can actually just directly enter the command-line arguments into the "Enter option above" prompt. It should work. 4. Are you doing tesseract OCR? You might try different detection options and see if one works better than another. E.g. -ocrd p will entirely use Tesseract's own algorithm for finding text on the page, whereas -ocrd l will have k2pdfopt submit the OCR graphics line by line. If you want to PM me the PDF source you are trying to convert, I'd be happy to recommend options. |
05-07-2023, 03:22 PM | #10 | |
Junior Member
Posts: 6
Karma: 55624
Join Date: Apr 2023
Device: kindle 5th gen
|
Quote:
|
|
05-07-2023, 03:27 PM | #11 | |
Junior Member
Posts: 6
Karma: 55624
Join Date: Apr 2023
Device: kindle 5th gen
|
Quote:
It seems I was getting the best results with mupdf OCR. Will PM you the specific pdf I'm seeking to convert in this particular instance. Thanks again! |
|
05-17-2023, 01:50 AM | #12 |
Fuzzball, the purple cat
Posts: 1,283
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
I found these settings worked pretty well:
Code:
k2pdfopt -p 50-52 -mode copy -c- -m .25,.42,.25,1 -t -om 0.25 -as -ocr t input.pdf -o output.pdf The -mode copy defaults to copying the source page size The -m arguments ignore the left 0.25 inches, the top 0.42 inches, the right 0.25 inches, and the bottom 1 inch of each source page The -t trims to the text The -om adds a small blank border to the output edges (0.25 in) The -as auto-straightens (de-skews) each page The -ocr t uses Tesseract OCR |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
KOReader / k2pdfopt integration | Thelele | KOReader | 7 | 11-22-2021 03:41 PM |
k2pdfopt for yellow/brown pages | mike.foster | 9 | 01-19-2018 06:50 PM | |
k2pdfopt segmentation fault | mike.foster | 3 | 11-23-2015 11:39 PM | |
Touch pdf and k2pdfopt on kobo | metita | Kobo Reader | 7 | 08-23-2015 06:14 AM |
Problem with reconverting k2PDFOpt to EBUB | ittiandro | Conversion | 7 | 08-19-2014 07:43 PM |