03-05-2024, 09:36 PM | #1 |
Groupie
Posts: 181
Karma: 304158
Join Date: Jan 2016
Device: none
|
[SOLVED] Fixing broken PDFs?
Hello,
Calling PDF experts out there. Are there tools to fix broken PDFs that give out the following errors? Those are very old files whose source files are unavailable. Thank you. Code:
C:\>mutool.exe info BAD_01N09.pdf warning: trying to repair broken xref warning: repairing PDF document warning: object missing 'endobj' token warning: cannot load object (668 0 R) into cache PDF-1.1 Info object (1 0 R): <</Creator<BEC382F84A1000761A78DBFD60400FD0E5BB313A>/CreationDate<B2C3F680207E74046B0FBCF8611A0BDE>/Producer<B79AB5D67B2831150973CB9A23474B8CB9EB6E2246AA04F511CE2DCDA47E4ED5AC96>/Title<B398B4C0390C067616>/Author<BA9CA6CB77202B527976FEBE3E4D5688BFAE2F2056D24AF050>/Keywords()/Subject<A096ABCC742C65047913B4ED796F4F9BA2A77C3D4FA212BA>/ModDate<B2C3F680207E74046B0FBCFA63170CDC>>> Pages: 0 Retrieving info from pages 0-1... warning: ... repeated 4 times... warning: Page tree load failed. Falling back to slow lookup warning: cannot load object (668 0 R) into cache error: malformed page tree C:\>mutool.exe info BAD_02N01.pdf warning: trying to repair broken xref warning: repairing PDF document warning: object missing 'endobj' token warning: cannot load object (1027 0 R) into cache PDF-1.1 Info object (1 0 R): <</Creator<CD286D1677E44895E9799DED0F80E91CEC9C5527>/CreationDate<C128196E1D8A3DEF9A03FAEB0CD9EA19>/Producer<C4715A3846DC79F6FA728D8A4F87AD48B0CC0A3FCC9A07CA70638EFE5579B29A69D4>/Title<C0735B2E04F84E95E5>/Author<C97749254AD463B18A77B8AE528DB04CB6894B3DDCE249CF31>/Keywords()/Subject<D37D442249D82DE48A15FAFD15AFAC4AB79F4C31CD921E9A36>/ModDate<C128196E1D8A3CE7980FFBE40FD8E819>>> Pages: 0 Retrieving info from pages 0-1... warning: ... repeated 4 times... warning: Page tree load failed. Falling back to slow lookup warning: cannot load object (1027 0 R) into cache error: malformed page tree C:\>mutool.exe info BAD_02N02.pdf warning: trying to repair broken xref warning: repairing PDF document warning: object missing 'endobj' token warning: cannot load object (840 0 R) into cache PDF-1.1 Info object (1 0 R): <</Creator<E1A7BED95A83AAEF456017C912737171AEC7AE8A>/CreationDate<EDA7CAA130EDDE9D341770CC1024707B>/Producer<E8FE89F76BBB9B8C566B07AE52743526F297F19236C8D01D47395FFC0364502DD062>/Title<ECFC88E1299FACEF49>/Author<E5F89AEA67B381CB266E328A4F7E2822F4D2B09026B09E1806>/Keywords()/Subject<FFF297ED64BFCF9E260C73D9084E2433F4D2AEDE638BD04A11724982>/ModDate<EDA7CAA130EDDE9D341671C0132C757A>>> Pages: 0 Retrieving info from pages 0-1... warning: ... repeated 4 times... warning: Page tree load failed. Falling back to slow lookup warning: cannot load object (840 0 R) into cache error: malformed page tree C:\>mutool.exe info BAD_02N04.pdf warning: trying to repair broken xref warning: repairing PDF document warning: object missing 'endobj' token warning: cannot load object (730 0 R) into cache PDF-1.1 Info object (1 0 R): <</Creator<7DD3FFC84A3C928820410013DB5DE5CA3A91B244>/CreationDate<71D38BB02052E6FA51364716D806E1CF>/Producer<748AC8E67B04A3EB334A30749B5AA19B66C1ED5C47A11721B7B2CFECDFD877364C3E>/Title<7088C9F0392094882C>/Author<798CDBFB770CB9AC434F05508650BC9F6084AC5E57D95924F6>/Keywords()/Subject<6386D6FC7400F7F9432D4203C17DBA88718CBD1705B0067EE1F6C6>/ModDate<71D38BB02052E6FA5137461ADD01E4C7>>> Pages: 0 Retrieving info from pages 0-1... warning: ... repeated 4 times... warning: Page tree load failed. Falling back to slow lookup warning: cannot load object (730 0 R) into cache error: malformed page tree C:\>mutool.exe info BAD_02N05.pdf warning: trying to repair broken xref warning: repairing PDF document warning: object missing 'endobj' token PDF-1.1 Info object (1 0 R): <</Creator(H:EASYECCOv02n05a.pm6)/CreationDate(D:19971128153755)/Producer(Acrobat PDFWriter 2.01 for Windows)/Title(H:\\EASYECCO\\v02n05a.pm6)/Author(Stu Bloom)/Keywords()/Subject()/ModDate(D:19971129094835)>> Pages: 16 Retrieving info from pages 1-16... warning: cannot load object (503 0 R) into cache warning: Page tree load failed. Falling back to slow lookup warning: cannot load object (503 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (503 0 R) into cache warning: ... repeated 5 times... warning: non-page object in page tree () warning: cannot load object (503 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (502 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (502 0 R) into cache warning: ... repeated 2 times... warning: cannot load object (503 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (502 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (503 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (502 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (678 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (678 0 R) into cache warning: ... repeated 2 times... warning: cannot load object (503 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (502 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (678 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (503 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (502 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () warning: cannot load object (678 0 R) into cache warning: ... repeated 3 times... warning: non-page object in page tree () error: cannot find page 4 in page tree Edit: While I'm at it… With at least one file, text I copied from it turns into a series of question marks when pasted elsewhere. Could it due to fonts? Here's the fonts it used: Code:
Fonts: Helvetica (Type1) MSTT31c4a3 (Type1; embedded) MSTT31c4af (Type1; embedded) MSTT31c4ba (Type1; embedded) MSTT31c4c7 (Type1; embedded) MSTT31c4d4 (Type1; embedded) MSTT31c4e0 (Type1; embedded) MSTT31c4eb (Type1; embedded) MSTT31c4f7 (Type1; embedded) MSTT31c51e (Type1; embedded) MSTT31c467 (Type1; embedded) MSTT31c497 (Type1; embedded) MSTT31c504 (Type1; embedded) MSTT31c511 (Type1; embedded) Times-Roman (Type1) Last edited by Shohreh; 03-07-2024 at 12:27 PM. |
03-06-2024, 09:25 AM | #2 |
the rook, bossing Never.
Posts: 12,334
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
OCR is sometimes the only solution.
Loads of PDFs simply don't have sane text to copy. If I can't copy/paste from Okular or similar (Sumatra on Windows) into LO Writer, or Word, then it's time for OCR. |
03-06-2024, 09:42 AM | #3 |
Groupie
Posts: 181
Karma: 304158
Join Date: Jan 2016
Device: none
|
I opened the file in Abbyy.
Oddly, it says "The content in this PDF file is protected with a password." File > Save As > Searchable PDF Document: "The document is already searchable and does not need to be converted." |
03-06-2024, 11:13 AM | #4 |
the rook, bossing Never.
Posts: 12,334
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Make a copy with Ghostview, or Print to PDF.
|
03-06-2024, 11:24 AM | #5 |
Groupie
Posts: 181
Karma: 304158
Join Date: Jan 2016
Device: none
|
Would you have the Ghostview command handy?
I ran the installer gs10021w64.exe, but no trace of gs.exe in "c:\Program Files\gs\gs10.02.1\bin\", and the following fails: Code:
"c:\Program Files\gs\gs10.02.1\bin\gswin64c.exe" -o EasyEccoV01N04.IMAGE.pdf -sDEVICE=pdfimage32 -dPDFSETTINGS=/screen EasyEccoV01N04.pdf '-o' is not recognized as an internal or external command, operable program or batch file. "c:\Program Files\gs\gs10.02.1\bin\gswin64c.exe" -sOutputFile EasyEccoV01N04.IMAGE.pdf -sDEVICE=pdfimage32 -dPDFSETTINGS=/screen EasyEccoV01N04.pdf Last OS error: No such file or directoryGPL Ghostscript 10.02.1: Unrecoverable error, exit code 1 Last edited by Shohreh; 03-06-2024 at 12:05 PM. |
03-07-2024, 12:26 PM | #6 |
Groupie
Posts: 181
Karma: 304158
Join Date: Jan 2016
Device: none
|
A user could provide a different set of files, which open fine. Fixed.
Thank you. |
03-07-2024, 04:24 PM | #7 |
the rook, bossing Never.
Posts: 12,334
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
I've not used gs (ghostscript) or gv (ghostview) for nearly a decade, so no.
gs (ghostscript) = console utility gv (ghostview) = GUI that works as a viewer and controls ghostscript, so you don't bother with a command line and arcane arguments. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Aura H2O Screen somehow got broken? Need guidance fixing it! | Karmylla | Kobo Reader | 58 | 03-17-2020 04:17 PM |
Arc HELP! kobo broken - no fixing in Australia!! | sidsthename | Kobo Tablets | 0 | 07-11-2015 06:59 AM |
Fixing slow PDFs | PF4Mobile | Onyx Boox | 3 | 09-25-2011 11:39 AM |
Fixing broken sentences. | Vanguard3000 | Sigil | 18 | 01-23-2011 01:45 PM |
fixing broken button (guide) | ashadocat | Sony Reader Dev Corner | 0 | 10-01-2009 02:52 AM |