10-14-2012, 10:27 AM | #196 | |
Fuzzball, the purple cat
Posts: 1,283
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
10-14-2012, 01:19 PM | #197 | |
Banned
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
|
Quote:
So it is 2 or 3 step process for PDF image. 1. Quick OCR-ing by Abyy, Acrobat etc. because there is usually no need for a great OCR behind the image. 2. Cropping roughly by Briss, eliminating headers/footers if needed (soPdf removes headers/footers like page numbers automatically). 3. Cropping in soPdf or k2pdfopt. Often k2pdfopt should be enough as standalone (i.e. 1 step process) though, even for pure image (non OCR-ed). With soPdf OCR layer stays there after cropping and PDF is about the same size i.e no rasterization involved that makes PDF bigger as with k2pdfopt. Example: 1st picture is original, 8 pages of scanned pdf OCR-ed. 2nd picture is that original croped by Briss (just roughly i.e. not getting very close to the text proper but headers cropped) 3d picture is original cropped by briss and then cropped additionally in soPdf (to fit hight). 4th picture is original cropped in soPdf directly. 1 2 3 4 -click on a picture to enlarge view As we can see soPdf didn't cut those two left margins on two pages (4th picture) when directly applied, whereas after cropping in Briss soPdf cropped those two margins correctly and we eliminated headers/footers by Briss also. Briss and soPdf or k2pdfopt are complementary because usually there are pages that stick out in Briss (inch or half of an inch from stacked majority on odd or even pages) and we can freely include them all for cropping if we are to use soPdf or k2pdfopt after Briss for very precise cropping. Last edited by markom; 09-05-2014 at 09:42 PM. |
|
Advert | |
|
11-20-2012, 04:54 PM | #198 | |
Banned
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
|
Quote:
http://www.verydoc.com/pdf-margin-crop.html It is not freeware but we can have about 40 trial croppings. Just enter some margin value like 5X5x5x5 points or 10x10x10x10 and it will crop text based pdf nicely. For pdf scan (with or without ocr layer) it is again (as in the case of soPdf) advisable to crop pdf roughly beforehand by Pdf-Scissors or Briss and then apply PDF Margin Crop, results were always good for me that way. I also used to crop margins of text based pdfs by printing them in virtual printer, in Adobe Reader, Acrobat etc. We should first check exact dimensions of text (usually under comments/measure/distance) and then print with auto-center box checked and corresponding Width x Hight. Last edited by markom; 11-20-2012 at 06:36 PM. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Yet another PDF to LRF converter | cacapee | LRF | 583 | 11-28-2011 06:50 AM |
comiclrf - Comics(CBZ) to LRF converter | FangornUK | LRF | 274 | 06-16-2010 02:24 PM |
Quick/easy LIT to LRF converter? | OUTATIME | Sony Reader Dev Corner | 10 | 02-29-2008 09:44 AM |
Anyone else want chm to lrf converter? | buster | Sony Reader | 10 | 02-09-2008 05:07 PM |
PRS-500 Linux based HTML to LRF converter? | Thiana | Sony Reader Dev Corner | 3 | 04-08-2007 02:34 AM |