05-22-2024, 04:18 PM | #1 |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
epub to PDF - specific letters are transparent and MISSING
Recently when exporting from epub to pdf my pdf exports have been crippled by some of the letters, specific alphabet letters, eg d and n being trasparent and not visible.
They are there, because when I select various paragraphs of pdf text and paste it into a text editor it is all there. I have tried different pdf viewers and none show the transparent letters. I've also tried viewing the pdf with "night mode" and the letters are still not there. Anyone know what the fix is ? |
05-22-2024, 05:12 PM | #2 |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
I get the same problem when using the ebook-convert utility
Here is my one liner script ebook-convert \ book_name.epub \ book_name.pdf \ --embed-all-fonts \ --paper-size=a4 \ --pdf-add-toc \ --pdf-page-numbers \ --pdf-sans-family \ --pretty-print It used to work. I converted lots of epub books to pdf with this line in the past, but not any more. Has the same issue as described in OP above. Random, but specific letters are transparent, making the pdf export unusable. For about 5 months (?) this problem has existed. It's not the epub files, because pdf2go can convert them all successfully. But pdf2go deos not produce a TOC, and does not number the pdf pages, so it is pretty useless. Last edited by Jam; 05-22-2024 at 05:17 PM. |
05-22-2024, 07:08 PM | #3 |
the rook, bossing Never.
Posts: 12,368
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Use Calibre to export the epub to docx. Edit in LO Writer using odt format. Have styles, TOC/Index etc as desired. Export a PDF.
|
05-22-2024, 08:28 PM | #4 | |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
Quote:
As someone who has done quite a lot of bash scripting, I know that errors are easily introduced into software. I wonder if the software was re-factored, in some upgrade, and things are still being ironed out. epub files that used to convert no longer convert, and also suffer the transparent letters problem, when being converted to pdf. I wonder why it could be that some letters are transparent but not all. This seems very odd. When I buy a epub book, this was a very hand feature for printing. Probably one of the most important features of Calibre. Does anyone know which is the last version that does not have this issue. I will then downgrade? |
|
05-22-2024, 11:11 PM | #5 |
creator of calibre
Posts: 44,559
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
05-23-2024, 07:10 AM | #6 | |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
Quote:
I understand that coding is complex and does many many tasks. When I get a moment I will provide more detail and information on the problem I am seeing. I am running up to date Arch Linux. Perhaps I have a dependency missing. I get the feeling it is some kind of PDF layering issue. My temporary fix is to take the ebook-convert output pdf file using the above command, see previous comment, and then post process this as follows inkscape in_file.pdf --pages=all --export-filename=out_file.pdf Inkscape get's rid of the text transparency problem but removes the useful pdf document index and in document clickable links from the output pdf. It lets me print the books for now. Last edited by Jam; 05-23-2024 at 07:13 AM. |
|
05-23-2024, 07:20 AM | #7 |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
Before post processing this is how te doc looks.
https://drive.google.com/file/d/1AdH...oCPssIpYa/edit Note letters v and h and other letters are missing. Makes doc un-readable. It is all there which is confirmed by selecting and copying sections of text from the pdf and pasting them into the text editor such as Vim. It's just a transparency / visibility issue. |
05-23-2024, 08:17 AM | #8 |
the rook, bossing Never.
Posts: 12,368
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Elephant in the room.
I stopped printing on paper about 2013. I "print" to epub and proof / annotate on eink. I sometimes have to check PDFs or read PDFs, so have choice of TCL Nxtpaper 11 (matt and almost like colour print paper or eink, laptop with matt add-on 2560x1440 16″ screen or matt 23″ 4K screen on workstation. Saved about 250,000 A4 sheets of paper and associated ink/toner environment and time in the last 10 years. Sometimes the 8″ Sage eink is big enough for a PDF. For an entire book or something that needs to be on paper for someone else I'd use Print24 or Lulu Print On Demand. Or loan them an ereader. It's cheaper to buy and give someone an ereader than print 15 novels. Normally I only generate PDF for POD / Publishing, hence the preferred route of LO Writer. I have used print from Calibre viewer occasionally to create a PDF, though not to have it on paper. If it's just a paper copy you need you can try printing from the Calibre viewer. A also sometimes use The GIMP to process poor PDFs (only available as PDF) or ImageMagick or K2PDFOPT https://www.mobileread.com/forums/sh...d.php?t=144711 I do use Inkscape, but not for PDFs. Last edited by Quoth; 05-23-2024 at 08:23 AM. |
05-23-2024, 08:52 AM | #9 |
creator of calibre
Posts: 44,559
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
I suggest you uninstall the arch calibre package and install the official binaries from https://calibre-ebook.com/download_linux
|
05-23-2024, 10:06 AM | #10 | |
the rook, bossing Never.
Posts: 12,368
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
|
|
05-23-2024, 11:26 AM | #11 | |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
Quote:
|
|
05-23-2024, 12:54 PM | #12 | |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
Quote:
|
|
05-31-2024, 08:48 AM | #13 |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
I had a detailed discussion on the Arch Linux bug reporting site about this issue.
One of the packagers and myself did some pdf font export testing. He did it, I replicated his results. It seems the issue with text transparencey in epub to pdf exports may be related to font setting not being specific in the pdf export settings. The Arch packager suggested this is an upstream issue in the original Calibre package which should be looked into. While I am now using a Calibre install, direct from the Calibre web site, which does not seem to contain this issue, it might still be worth looking into this. The discussion on the Arch bug reporting web is shown here https://gitlab.archlinux.org/archlin...bre/-/issues/4 Calibre is a great package, thank you for it. |
05-31-2024, 10:40 AM | #14 |
creator of calibre
Posts: 44,559
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
This has nothing to do with fonts. The PDF is generated by Qt WebEngine. And its subset by fonttools and post processed using PoDoFo. Most likely the arch package of one of those is broken in some way or the other. I am afraid I lost my patience with debugging distro packages many many years ago. Just use the calibre official binaries if you want a bug free experience.
|
05-31-2024, 02:31 PM | #15 | |
Junior Member
Posts: 9
Karma: 10
Join Date: May 2024
Device: linux
|
Quote:
I will stick with your binary going forward. |
|
Tags |
conversion, epub, fault, pdf |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Saving epub to disk missing letters | LordSith77 | Library Management | 4 | 10-21-2022 02:43 PM |
pdf to epub with arabic letters not in current position | Momaiz | Conversion | 6 | 03-28-2019 11:34 PM |
Troubleshooting missing letters dots when converting epub to kindle | msh2050 | Amazon Kindle | 10 | 12-23-2017 03:35 PM |
pdf missing letters | gabaldini | Amazon Kindle | 5 | 03-06-2017 08:45 PM |
Missing letters in pdf | cincinnatus | Kindle Fire | 2 | 06-22-2012 12:13 PM |