07-11-2020, 08:14 PM | #16 |
Guru
Posts: 929
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
Yes. In my experience the biggest problems are:
1. Either omitting important graphic tables, or retaining unimportant page frames, page backgrounds, and other cruft. 2. Mis-ocring numbers. Since it's especially hard to spot that these are off, and they throw resulting figures off. 3. Screwing up text tables. 4. Columns, column, columns. Last edited by MarjaE; 07-11-2020 at 08:17 PM. |
07-11-2020, 08:46 PM | #17 | |
null operator (he/him)
Posts: 20,912
Karma: 27620686
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
To convert a PDF to something editable the first tool I try is Word itself (2016 or later), it can't handle very large PDFs, but the results can be as good or better than the FineReader scanning etc route. I've wondered if MS and Adobe use a common code base for the PDF related improvements they've made to Acrobat and Word. There was speculation a few months ago that MS would acquire Adobe. BR |
|
Advert | |
|
07-11-2020, 09:25 PM | #18 | |
Bookmaker & Cat Slave
Posts: 11,473
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
Quote:
Hitch |
|
07-12-2020, 04:31 AM | #19 |
Evangelist
Posts: 440
Karma: 77256
Join Date: Sep 2011
Device: none
|
If I need OCR, I'll generally use FineReader.
Acrobat seems to be in many cases, of what I've tried so far best overall at exporting commercial vector PDFs. Unfortunately there aren't many options. Sometimes HTML export is better than Word, easier to correct, though the HTML isn't great. It can style paragraphs as italic, with a span for normal, a bit of work to correct. Or it can create lists, such as currently, a line break that might start with "1." or some initial on the next line, same paragraph, it'll break and create a list; a lot of work to correct. Maybe I will again try Nitro, Phantom and whatever other PDF reader I can find. Word might be an option but it's a large PDF I'd have to split. Afraid of multiple CSS style definitions with the same name but different definition per split that would cause other pains to fix. |
07-12-2020, 05:58 AM | #20 |
Resident Curmudgeon
Posts: 75,813
Karma: 134321338
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Once you've converted the PDF to anything else, you'll need to A/B compare the PDF to the output format. That means every letter, every space, every punctuation, etc. Because if you don't, you WILL have errors.
|
Advert | |
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Convert an epub to a pdf from another pdf sample file | SvenSND | Conversion | 3 | 09-02-2016 04:29 PM |
Convert epub to pdf, with notes with main text in the pdf? | 8140david | ePub | 1 | 06-18-2015 01:13 PM |
Convert epub to pdf, with notes with main text in the pdf? | 8140david | Conversion | 1 | 06-18-2015 11:02 AM |