05-28-2013, 07:51 PM | #1 |
Wizard
Posts: 2,286
Karma: 7409537
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Libra Colour, Libra 2, Clara 2E, Oasis3, Voyage
|
How to correct odd, unwanted characters in an epub?
I've browsed the Sigil forum and Conversion sub-forum looking for a solution without success. I have an epub loaded with the following characters and cannot figure out how to correct the entire book. In conversions I have tried, individually: 'Unsmarten punctuation,' 'Transliterate unicode characters to ASCII,' and also selected utf-8 in 'input character encoding.' Nothing seems to work. Here are the unwanted characters:
â (throughout the text, possibly replacing single quotation marks)  (between chapter breaks) and then this - which I think is meant to be the word protégé with accents aigu: protégé I cannot do a simple search and replace in Sigil since the characters may represent more than one character and I have no idea what they might be. Does anyone have a solution or suggestion how to correct this? Is there a conversion setting I am missing? |
05-28-2013, 08:02 PM | #2 |
Resident Curmudgeon
Posts: 77,622
Karma: 140804106
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Those are not unwanted characters. Those are actually the characters you do want. They are displaying incorrectly because of the wrong encoding being used.
In the header right after the title (if needed), I put the following... Code:
<title>Cat & Mouse</title> <meta content="http://www.w3.org/1999/xhtml; charset=utf-8" http-equiv="Content-Type"/> Last edited by JSWolf; 05-28-2013 at 08:07 PM. |
Advert | |
|
05-28-2013, 08:15 PM | #3 |
Grand Sorcerer
Posts: 6,236
Karma: 16537474
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
Using the Modify Epub plugin, with the option Encode HTML in UTF-8 checked usually works for me.
|
05-28-2013, 08:18 PM | #4 |
Resident Curmudgeon
Posts: 77,622
Karma: 140804106
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
In most cases, it's not an encoding issue of the actual file. It's an encoding issue in the header of the XML. That is why I have to add in the line I specified so the software reading the XML gets it right. ADE doesn't have an issue because uses UTF-8 regardless of what the header specifies.
|
05-28-2013, 08:21 PM | #5 | |
Grand Sorcerer
Posts: 6,236
Karma: 16537474
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
Quote:
|
|
Advert | |
|
05-28-2013, 08:44 PM | #6 | |
Wizard
Posts: 2,286
Karma: 7409537
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Libra Colour, Libra 2, Clara 2E, Oasis3, Voyage
|
Quote:
I'll try what Jon suggested and hope it works. Edit: I applied the edit to a chapter and there is no change. Last edited by Skydog; 05-28-2013 at 08:52 PM. |
|
05-28-2013, 08:49 PM | #7 |
Grand Sorcerer
Posts: 6,236
Karma: 16537474
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
@Skydog,
If you've converted it a few times it's possible the encoding is well and truly messed up by now. If all else fails, try going back to your clean original source before trying to correct. |
05-28-2013, 08:54 PM | #8 | |
Wizard
Posts: 2,286
Karma: 7409537
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Libra Colour, Libra 2, Clara 2E, Oasis3, Voyage
|
Quote:
In any case, I still have the problem. |
|
05-28-2013, 08:54 PM | #9 |
350 Hoarder
Posts: 3,574
Karma: 8281267
Join Date: Dec 2010
Location: Midwest USA
Device: Sony PRS-350, Kobo Glo & Glo HD, PW2
|
When that's happened to me a few times, a simple epub-to-epub conversion in Calibre fixed all those odd symbols. I didn't check anything special, just my usual of "remove blank lines" and it worked every time for me.
|
05-28-2013, 09:04 PM | #10 |
Resident Curmudgeon
Posts: 77,622
Karma: 140804106
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
ePub > ePub to fix the encoding is not a good idea. In fact, ePub > ePub is not a good idea if things can be fixed any other way.
|
05-28-2013, 09:05 PM | #11 |
Wizard
Posts: 2,286
Karma: 7409537
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Libra Colour, Libra 2, Clara 2E, Oasis3, Voyage
|
Third time is the charm! I once again copied the original and did nothing but apply the modify epub (utf-8 encoding), as jackie_w suggested. It worked!! Thank you, jackie_w. In this case, it accomplished the shortcut to Jon's suggestion.
@Ripplinger - I originally performed an epub-epub conversion as I mentioned above which did not work for some reason. The utf-8 encoding was indeed the issue -- I just wasn't initially able to get it to stick for some reason. |
05-28-2013, 09:06 PM | #12 |
Wizard
Posts: 2,286
Karma: 7409537
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Libra Colour, Libra 2, Clara 2E, Oasis3, Voyage
|
Is there a better way to accomplish widows: 0; orphans: 0; ?? I sure wish some talented person here would include it in the Modify Epub plugin.
|
05-28-2013, 09:36 PM | #13 |
Resident Curmudgeon
Posts: 77,622
Karma: 140804106
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Use Tweak eBook (in Calibre unless you are already editing in Sigil) and you will then be able to edit the CSS to add in widows and orphans of 0 to the body style.
|
05-28-2013, 09:39 PM | #14 |
Wizard
Posts: 2,286
Karma: 7409537
Join Date: Mar 2009
Location: Circling Earth @ Mach .83
Device: Elipsa 2E, Sage, Libra Colour, Libra 2, Clara 2E, Oasis3, Voyage
|
|
05-28-2013, 09:43 PM | #15 |
Resident Curmudgeon
Posts: 77,622
Karma: 140804106
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
koboish: Script that convert your epub to a kepub.epub with the correct bookcover !! | the_m | Kobo Reader | 4 | 01-24-2013 11:01 PM |
How do I correct varying section breaks (epub to epub) | library addict | Calibre | 0 | 02-21-2012 01:56 PM |
Odd letters/characters | MSWallack | Conversion | 7 | 12-30-2011 11:25 AM |
Odd Characters When Sending .mobi to Kindle 3 | mrh882 | Calibre | 3 | 07-27-2011 07:39 PM |
Unwanted information in Epub files | renesboy | Kobo Reader | 3 | 08-03-2010 03:01 PM |