Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Editor

Notices

Reply
 
Thread Tools Search this Thread
Old 08-30-2023, 05:45 PM   #1
BookieBlue
Junior Member
BookieBlue began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
Question Random characters on one xhtml file after importing.

Hello.

I'm having some problems concerning one .xhtml file of just one chapter in a book. When importing it to Calibre it shows as a jumbled mess of random symbols:



While on Adobe Digital Editions it looks fine before importing it to Calibre:



The chapter is a long list of names that are mentioned throughout the book, ordered alphabetically and with the page number where they're mentioned in. I just want to keep the names and ditch the numbers.

How can make it so Calibre reads this long list properly?

I tried correcting html headers and setting the whole page to utf-8 thinking it was en enconding issue but none worked.

If you'd point me in the right direction I'd appreciate it.

Thanks in advance!
Attached Thumbnails
Click image for larger version

Name:	Cali-01.png
Views:	112
Size:	176.0 KB
ID:	203455   Click image for larger version

Name:	Cali-02.png
Views:	104
Size:	21.6 KB
ID:	203456  
BookieBlue is offline   Reply With Quote
Old 08-30-2023, 06:34 PM   #2
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,008
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Moderator Notice

Moved to Editor subforum.

BR
BetterRed is offline   Reply With Quote
Advert
Old 08-30-2023, 06:45 PM   #3
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,454
Karma: 58055868
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
That book is ?? (is it even a book).
Coded wrong Charset, encrypted, damaged archive, someone changed the extension to trick Calibre into loading
Note Calibre can not Edit or convert DOC format. It can file it and send it (as a DOC)
theducks is offline   Reply With Quote
Old 08-30-2023, 07:04 PM   #4
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,008
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by theducks View Post
That book is ?? (is it even a book).
Coded wrong Charset, encrypted, damaged archive, someone changed the extension to trick Calibre into loading
Note Calibre can not Edit or convert DOC format. It can file it and send it (as a DOC)
Perhaps it's an embedded PDF, there's been mention of that in the Sigil forum in recent days… Kevin added the wherewithal to support it in the latest release.

PS: that's as much as I know about it, no idea of why it was added to the specs… if one can embed a PDF why not an Autocad project - a BoM on yer Kindle

BR

Last edited by BetterRed; 08-30-2023 at 07:33 PM.
BetterRed is offline   Reply With Quote
Old 08-30-2023, 11:12 PM   #5
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 44,565
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Could be any number of things impossible to say without access to the book file.
kovidgoyal is offline   Reply With Quote
Advert
Old 08-31-2023, 06:43 AM   #6
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
Rename a copy of original epub to end in .zip in a new folder.
Unpack it.
Rename the weird .xhtml file to .pdf and if that doesn't open try .png, etc. It can't be really an xhtml file unless it's encrypted.
Quoth is offline   Reply With Quote
Old 08-31-2023, 05:55 PM   #7
BookieBlue
Junior Member
BookieBlue began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
Question

Quote:
Originally Posted by kovidgoyal View Post
Could be any number of things impossible to say without access to the book file.
I can't do it as the book is DRM'ed, and thus forbidden per site's rules.

Quote:
Originally Posted by Quoth View Post
Rename a copy of original epub to end in .zip in a new folder.
Unpack it.
Rename the weird .xhtml file to .pdf and if that doesn't open try .png, etc. It can't be really an xhtml file unless it's encrypted.
Quote:
Originally Posted by BetterRed View Post
Perhaps it's an embedded PDF, there's been mention of that in the Sigil forum in recent days… Kevin added the wherewithal to support it in the latest release.
Copied the original ePub, zipped and extracted the whole content. Then I tried changing the extension of the file to both PDF and PNG and none opened, just a white page in preview and errors when opening with a dedicated PDF app. Doing the same to the copy I imported to Calibre I'm able to read all the other chapters that are in .xhtml form inside the zip, but not the one I'm having decoding issues (even when changing it to PDF).

The book is protected against copy but it lets you highlight portions of the text and those can be exported so I found a workaround to what I wanted to do I guess (?).

Last edited by BookieBlue; 08-31-2023 at 05:57 PM. Reason: Added more information.
BookieBlue is offline   Reply With Quote
Old 08-31-2023, 06:23 PM   #8
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
So it's likely an encrypted xhtml file.
Quoth is offline   Reply With Quote
Old 08-31-2023, 08:14 PM   #9
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,008
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by BookieBlue View Post
I can't do it as the book is DRM'ed, and thus forbidden per site's rules.
You can create an obfuscated version via the ScrambleEbook plugin and post that.

Seems odd that a list of characters would get 'special' treatment, are there any other lists, e.g. locations.

BR
BetterRed is offline   Reply With Quote
Old 09-01-2023, 12:36 AM   #10
BookieBlue
Junior Member
BookieBlue began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Aug 2023
Device: none
Unhappy

Quote:
Originally Posted by Quoth View Post
So it's likely an encrypted xhtml file.
Most probably, but I'm still puzzled on how the latest deDRM plugin unlocked the whole book except that one.

Quote:
Originally Posted by BetterRed View Post
You can create an obfuscated version via the ScrambleEbook plugin and post that.

Seems odd that a list of characters would get 'special' treatment, are there any other lists, e.g. locations.
Installed the plugin and when pressing the Scramble now action the window closes itself.

The .xhtml file should only be a long list of full names ordered alphabetically and page numbers they are mentioned in.

Last edited by BookieBlue; 09-01-2023 at 12:38 AM. Reason: Clarified there are no other lists in the file
BookieBlue is offline   Reply With Quote
Old 09-01-2023, 08:06 AM   #11
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 12,379
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
Screen shots of the page(s) in ADE. Then type up or OCR and edit...
Switch off "Cleartype" or any similar text enhancement as it colours the edges blue and red.
Quoth is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Random Metadata when Importing???? Woodssi Library Management 4 12-22-2014 11:54 AM
XHTML file limit? BobK99 Sigil 4 03-08-2013 06:38 AM
ncx file to html/xhtml file javochase Conversion 1 06-23-2011 07:57 PM
xhtml file name change bobcdy Sigil 11 10-23-2010 01:05 AM
Importing "big" XHTML files in Sigil paulpeer Sigil 8 03-19-2010 06:00 AM


All times are GMT -4. The time now is 05:49 PM.


MobileRead.com is a privately owned, operated and funded community.