08-30-2023, 05:45 PM | #1 |
Junior Member
Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
|
Random characters on one xhtml file after importing.
Hello.
I'm having some problems concerning one .xhtml file of just one chapter in a book. When importing it to Calibre it shows as a jumbled mess of random symbols: While on Adobe Digital Editions it looks fine before importing it to Calibre: The chapter is a long list of names that are mentioned throughout the book, ordered alphabetically and with the page number where they're mentioned in. I just want to keep the names and ditch the numbers. How can make it so Calibre reads this long list properly? I tried correcting html headers and setting the whole page to utf-8 thinking it was en enconding issue but none worked. If you'd point me in the right direction I'd appreciate it. Thanks in advance! |
08-30-2023, 06:34 PM | #2 |
null operator (he/him)
Posts: 21,008
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Moderator Notice
Moved to Editor subforum. BR |
Advert | |
|
08-30-2023, 06:45 PM | #3 |
Well trained by Cats
Posts: 30,454
Karma: 58055868
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
That book is ?? (is it even a book).
Coded wrong Charset, encrypted, damaged archive, someone changed the extension to trick Calibre into loading Note Calibre can not Edit or convert DOC format. It can file it and send it (as a DOC) |
08-30-2023, 07:04 PM | #4 | |
null operator (he/him)
Posts: 21,008
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
PS: that's as much as I know about it, no idea of why it was added to the specs… if one can embed a PDF why not an Autocad project - a BoM on yer Kindle BR Last edited by BetterRed; 08-30-2023 at 07:33 PM. |
|
08-30-2023, 11:12 PM | #5 |
creator of calibre
Posts: 44,565
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Could be any number of things impossible to say without access to the book file.
|
Advert | |
|
08-31-2023, 06:43 AM | #6 |
the rook, bossing Never.
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Rename a copy of original epub to end in .zip in a new folder.
Unpack it. Rename the weird .xhtml file to .pdf and if that doesn't open try .png, etc. It can't be really an xhtml file unless it's encrypted. |
08-31-2023, 05:55 PM | #7 | |||
Junior Member
Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
|
Quote:
Quote:
Quote:
The book is protected against copy but it lets you highlight portions of the text and those can be exported so I found a workaround to what I wanted to do I guess (?). Last edited by BookieBlue; 08-31-2023 at 05:57 PM. Reason: Added more information. |
|||
08-31-2023, 06:23 PM | #8 |
the rook, bossing Never.
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
So it's likely an encrypted xhtml file.
|
08-31-2023, 08:14 PM | #9 | |
null operator (he/him)
Posts: 21,008
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
Seems odd that a list of characters would get 'special' treatment, are there any other lists, e.g. locations. BR |
|
09-01-2023, 12:36 AM | #10 | |
Junior Member
Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
|
Most probably, but I'm still puzzled on how the latest deDRM plugin unlocked the whole book except that one.
Quote:
The .xhtml file should only be a long list of full names ordered alphabetically and page numbers they are mentioned in. Last edited by BookieBlue; 09-01-2023 at 12:38 AM. Reason: Clarified there are no other lists in the file |
|
09-01-2023, 08:06 AM | #11 |
the rook, bossing Never.
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Screen shots of the page(s) in ADE. Then type up or OCR and edit...
Switch off "Cleartype" or any similar text enhancement as it colours the edges blue and red. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Random Metadata when Importing???? | Woodssi | Library Management | 4 | 12-22-2014 11:54 AM |
XHTML file limit? | BobK99 | Sigil | 4 | 03-08-2013 06:38 AM |
ncx file to html/xhtml file | javochase | Conversion | 1 | 06-23-2011 07:57 PM |
xhtml file name change | bobcdy | Sigil | 11 | 10-23-2010 01:05 AM |
Importing "big" XHTML files in Sigil | paulpeer | Sigil | 8 | 03-19-2010 06:00 AM |