![]() |
#1 |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
|
![]()
Hello.
I'm having some problems concerning one .xhtml file of just one chapter in a book. When importing it to Calibre it shows as a jumbled mess of random symbols: ![]() While on Adobe Digital Editions it looks fine before importing it to Calibre: ![]() The chapter is a long list of names that are mentioned throughout the book, ordered alphabetically and with the page number where they're mentioned in. I just want to keep the names and ditch the numbers. How can make it so Calibre reads this long list properly? I tried correcting html headers and setting the whole page to utf-8 thinking it was en enconding issue but none worked. If you'd point me in the right direction I'd appreciate it. Thanks in advance! |
![]() |
![]() |
![]() |
#2 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 20,787
Karma: 27405072
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Moderator Notice
Moved to Editor subforum. BR |
![]() |
![]() |
![]() |
#3 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,169
Karma: 57532200
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
That book is ?? (is it even a book).
Coded wrong Charset, encrypted, damaged archive, someone changed the extension to trick Calibre into loading Note Calibre can not Edit or convert DOC format. It can file it and send it (as a DOC) |
![]() |
![]() |
![]() |
#4 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 20,787
Karma: 27405072
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
PS: that's as much as I know about it, no idea of why it was added to the specs… if one can embed a PDF why not an Autocad project - a BoM on yer Kindle ![]() BR Last edited by BetterRed; 08-30-2023 at 06:33 PM. |
|
![]() |
![]() |
![]() |
#5 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 44,179
Karma: 23000000
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Could be any number of things impossible to say without access to the book file.
|
![]() |
![]() |
![]() |
#6 |
the rook, bossing Never.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,084
Karma: 89198465
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Rename a copy of original epub to end in .zip in a new folder.
Unpack it. Rename the weird .xhtml file to .pdf and if that doesn't open try .png, etc. It can't be really an xhtml file unless it's encrypted. |
![]() |
![]() |
![]() |
#7 | |||
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
|
![]() Quote:
Quote:
Quote:
The book is protected against copy but it lets you highlight portions of the text and those can be exported so I found a workaround to what I wanted to do I guess (?). Last edited by BookieBlue; 08-31-2023 at 04:57 PM. Reason: Added more information. |
|||
![]() |
![]() |
![]() |
#8 |
the rook, bossing Never.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,084
Karma: 89198465
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
So it's likely an encrypted xhtml file.
![]() |
![]() |
![]() |
![]() |
#9 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 20,787
Karma: 27405072
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
Seems odd that a list of characters would get 'special' treatment, are there any other lists, e.g. locations. BR |
|
![]() |
![]() |
![]() |
#10 | |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
|
![]()
Most probably, but I'm still puzzled on how the latest deDRM plugin unlocked the whole book except that one.
Quote:
The .xhtml file should only be a long list of full names ordered alphabetically and page numbers they are mentioned in. Last edited by BookieBlue; 08-31-2023 at 11:38 PM. Reason: Clarified there are no other lists in the file |
|
![]() |
![]() |
![]() |
#11 |
the rook, bossing Never.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,084
Karma: 89198465
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Screen shots of the page(s) in ADE. Then type up or OCR and edit...
Switch off "Cleartype" or any similar text enhancement as it colours the edges blue and red. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Random Metadata when Importing???? | Woodssi | Library Management | 4 | 12-22-2014 10:54 AM |
XHTML file limit? | BobK99 | Sigil | 4 | 03-08-2013 05:38 AM |
ncx file to html/xhtml file | javochase | Conversion | 1 | 06-23-2011 06:57 PM |
xhtml file name change | bobcdy | Sigil | 11 | 10-23-2010 12:05 AM |
Importing "big" XHTML files in Sigil | paulpeer | Sigil | 8 | 03-19-2010 05:00 AM |