07-21-2009, 05:16 AM | #1 |
Zealot
Posts: 116
Karma: 463
Join Date: Feb 2007
Location: Newcastle, UK
Device: Apple iPad
|
Converting PDF - Removing text at top of pages
I have tried a few searches, but have not found exactly what I am looking for, so sorry if this has been answered before.
I have a number of PDF files that I would like to convert to LRF. The problem with them is that the PDF has the book title and author and page number at the top of each page, just like the original printed book, and when I use calibre to convert to LRF, these bits are naturaly present in the middle of the text betweem pages. How can I use Calibre to remove this info when converting. Am I missing something simple? |
07-21-2009, 05:28 AM | #2 |
Zealot
Posts: 116
Karma: 463
Join Date: Feb 2007
Location: Newcastle, UK
Device: Apple iPad
|
I have successfully used Adobe Acrobat to crop the pdf pages to remove the header, save and then import in calibre and that seems to work perfectly.
Still wondering if there is a way to have calibre to do it automaticaly. |
Advert | |
|
07-21-2009, 05:42 AM | #3 |
Wizard
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
|
The latest Calibre 0.6.0 beta releases now have an option to remove headers and footers under the Structure Detection part of the conversion parameters with the ability to specify a Regular Expression to identify the header/footer text. This might do what you want? The default RE provided is quite complex and will probably need amending for your particular documents, but it could still be worth looking at.
|
07-21-2009, 05:47 AM | #4 |
Zealot
Posts: 116
Karma: 463
Join Date: Feb 2007
Location: Newcastle, UK
Device: Apple iPad
|
Thanks for that, I have been toying with getting Beta 6, but I wasnt sure if I should wait till its officialy released.
|
07-21-2009, 07:00 AM | #5 |
Wizard
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
|
It is meant to be close to offical release (in the next week or so), so now is probably a good time to find if there are any issues that affect you.
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Bulk Removing of Pages | jhempel24 | Sigil | 4 | 08-12-2010 01:28 PM |
PDF Conversion - Removing Header / Footer Text | heb | Sony Reader | 9 | 07-11-2010 11:02 PM |
Converting pdf to text with Adobe Digital | kezzie | 3 | 02-28-2010 04:14 AM | |
can Calibre split text from multiple pdf pages? | pjfan281 | Calibre | 4 | 07-25-2008 12:08 AM |
Converting text to PDF with a2ps | chrylis | iRex | 7 | 11-23-2007 11:46 AM |