01-10-2011, 11:47 AM | #1 |
Connoisseur
Posts: 94
Karma: 110
Join Date: Sep 2010
Device: Kindle Fire HD
|
PDF pages in a box
I have a PDF book I want to convert to text. But each page of the book is in a text box so it will not convert. How do I get round this.
Thank you |
01-10-2011, 12:41 PM | #2 |
Sigil & calibre developer
Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
First you need to verify the text is not a set of images. Open the pdf using something like acrobat and see if you can copy the text. If you can you can use a program to crop the pages. I don’t know what you would use on Windows.
Your said you wanted to convert to txt. Acrobat can save as text. Give that a try too. |
Advert | |
|
01-10-2011, 02:15 PM | #3 |
Connoisseur
Posts: 94
Karma: 110
Join Date: Sep 2010
Device: Kindle Fire HD
|
Thank u for the reply.
Yes it looks like a set of images. I saved the PDF in Adobe as a text file, but the txt file comes up blank in Notepad or Wordpad. I tried "Select All" in Adobe, but it does not select the whole file, it only selects one page where the cursor and if I copy and paste it in MS Word it comes up in same text box!! I am looking to get rid of the boxes and want only the content in them to format as I want. Last edited by Mamaijee; 01-10-2011 at 02:17 PM. |
01-10-2011, 02:31 PM | #4 | |
Well trained by Cats
Posts: 30,107
Karma: 57259780
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
If the images are High quality, you might be able to OCR each and every one |
|
01-10-2011, 10:37 PM | #5 |
US Navy, Retired
Posts: 9,865
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Nexus 7
|
As you have already stated it is not a text box, but a image you're copying into MS Word. As a image calibre can't help you but as mentioned you may be able to run it through OCR software to convert it to text.
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Classic Split PDF pages into smaller pages (images into tiles) | Astro | Barnes & Noble NOOK | 4 | 06-12-2020 10:56 AM |
blank pages on a pdf book | afsandiego | Sony Reader | 6 | 12-19-2015 05:52 AM |
PDF to Epub (problem with pages) | violentlyserene | Calibre | 1 | 08-22-2010 10:38 AM |
Split pdf pages down the middle | Blue_Alien | Calibre | 3 | 08-15-2010 11:12 PM |
Calibre changes the .PDF pages size | beniof | Calibre | 0 | 07-09-2010 06:41 AM |