Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > PDF

Notices

Reply
 
Thread Tools Search this Thread
Old 10-14-2012, 10:27 AM   #196
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,283
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by ectoplasm View Post
This is actually pretty sweet for automatically cropping text based PDF page margins. This is the first tool I found that does this automatically. If there are others, please comment. I'm not interested in the programs where you have to select a region by hand.
Try the PDF forum thread listing--the top half dozen threads which are listed with a "sticky" icon next to them all discuss PDF tools for mobile reading--almost all of them do some form of auto-cropping.
willus is offline   Reply With Quote
Old 10-14-2012, 01:19 PM   #197
markom
Banned
markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.
 
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
Quote:
Originally Posted by ectoplasm View Post
This is actually pretty sweet for automatically cropping text based PDF page margins. This is the first tool I found that does this automatically. If there are others, please comment. I'm not interested in the programs where you have to select a region by hand.
But if our PDF is image with text layer in the background we should be very much interested, because often we should first crop such PDF in Briss or PdfScissors, A-Pdf page crop etc. and then and only then use soPdf or k2pdfopt for much better result.

So it is 2 or 3 step process for PDF image.

1. Quick OCR-ing by Abyy, Acrobat etc. because there is usually no need for a great OCR behind the image.
2. Cropping roughly by Briss, eliminating headers/footers if needed (soPdf removes headers/footers like page numbers automatically).
3. Cropping in soPdf or k2pdfopt.

Often k2pdfopt should be enough as standalone (i.e. 1 step process) though, even for pure image (non OCR-ed).

With soPdf OCR layer stays there after cropping and PDF is about the same size i.e no rasterization involved that makes PDF bigger as with k2pdfopt.

Example:

1st picture is original, 8 pages of scanned pdf OCR-ed.
2nd picture is that original croped by Briss (just roughly i.e. not getting very close to the text proper but headers cropped)
3d picture is original cropped by briss and then cropped additionally in soPdf (to fit hight).
4th picture is original cropped in soPdf directly.

1 2 3 4 -click on a picture to enlarge view

As we can see soPdf didn't cut those two left margins on two pages (4th picture) when directly applied, whereas after cropping in Briss soPdf cropped those two margins correctly and we eliminated headers/footers by Briss also.

Briss and soPdf or k2pdfopt are complementary because usually there are pages that stick out in Briss (inch or half of an inch from stacked majority on odd or even pages) and we can freely include them all for cropping if we are to use soPdf or k2pdfopt after Briss for very precise cropping.

Last edited by markom; 09-05-2014 at 09:42 PM.
markom is offline   Reply With Quote
Advert
Old 11-20-2012, 04:54 PM   #198
markom
Banned
markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.markom ought to be getting tired of karma fortunes by now.
 
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
Quote:
Originally Posted by ectoplasm View Post
This is actually pretty sweet for automatically cropping text based PDF page margins. This is the first tool I found that does this automatically. If there are others, please comment. I'm not interested in the programs where you have to select a region by hand.
There is VeryDOC PDF-Margin-Crop from a few years ago.

http://www.verydoc.com/pdf-margin-crop.html

It is not freeware but we can have about 40 trial croppings.

Just enter some margin value like 5X5x5x5 points or 10x10x10x10 and it will crop text based pdf nicely.

For pdf scan (with or without ocr layer) it is again (as in the case of soPdf) advisable to crop pdf roughly beforehand by Pdf-Scissors or Briss and then apply PDF Margin Crop, results were always good for me that way.


I also used to crop margins of text based pdfs by printing them in virtual printer, in Adobe Reader, Acrobat etc.

We should first check exact dimensions of text (usually under comments/measure/distance) and then print with auto-center box checked and corresponding Width x Hight.

Last edited by markom; 11-20-2012 at 06:36 PM.
markom is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Yet another PDF to LRF converter cacapee LRF 583 11-28-2011 06:50 AM
comiclrf - Comics(CBZ) to LRF converter FangornUK LRF 274 06-16-2010 02:24 PM
Quick/easy LIT to LRF converter? OUTATIME Sony Reader Dev Corner 10 02-29-2008 09:44 AM
Anyone else want chm to lrf converter? buster Sony Reader 10 02-09-2008 05:07 PM
PRS-500 Linux based HTML to LRF converter? Thiana Sony Reader Dev Corner 3 04-08-2007 02:34 AM


All times are GMT -4. The time now is 11:47 AM.


MobileRead.com is a privately owned, operated and funded community.