03-31-2008, 06:58 PM | #1 |
zeldinha zippy zeldissima
Posts: 27,827
Karma: 921169
Join Date: Dec 2007
Location: Paris, France
Device: eb1150 & is that a nook in her pocket, or she just happy to see you?
|
what the ’ ??? (mobi conversion woes)
something very strange...
i tried to convert this book to imp, via html, using Nick's conversion tool (version 9.1). but i got some very strange text as a result... ’ instead of curly apostrophe, “ for a double curly left quotation mark, and � for a double curly right quotation mark... in the html source, the codes are respectively : ’ & acirc ; & #128 ; & #153 ; (i added the spaces) “ & acirc ; & #128 ; & #156 ; � & acirc ; & #128 ; & #157 ; in the mobi version, the characters display correctly (apostrophe, open quotation mark, close quotation mark). anybody know why this would happen ? there must be something strange in the mobi code, but i've never seen this result before... and is there any easy way to fix it (besides search and replace, i mean, because for the moment i have identified 3 charactes, but maybe there are a lot more in the rest of the book...) ? [EDIT yes there are more, a LOT more. it looks like every single special punctuation mark is incorrectly coded. i hope somebody knows an easy way to fix this... or this conversion is going to be a lot more work than i expected...] Last edited by zelda_pinwheel; 03-31-2008 at 07:00 PM. Reason: read a little further |
03-31-2008, 07:28 PM | #2 |
Grand Sorcerer
Posts: 7,452
Karma: 7185064
Join Date: Oct 2007
Location: Linköpng, Sweden
Device: Kindle Voyage, Nexus 5, Kindle PW
|
I would guess that UTF-8 is used in the MobiPocket file. Try mobi2html from MobiPerl and see if the resulting html file displays correctly in a browser. If that works then it must be the translation to imp that is wrong (and not my problem )
|
Advert | |
|
03-31-2008, 07:32 PM | #3 |
zeldinha zippy zeldissima
Posts: 27,827
Karma: 921169
Join Date: Dec 2007
Location: Paris, France
Device: eb1150 & is that a nook in her pocket, or she just happy to see you?
|
i will try mobi2html if you want (if i can figure it out...) but i think the problem is BEFORE the conversion to imp, because the html file which is left behind (it's the source for the imp file) contains these errors.
so, either the problem is in the original mobi code (seems likely), or in the conversion from mobi to html (never had this problem before). [EDIT : i was looking at the html in a browser already. i have attached the page for you.] Last edited by zelda_pinwheel; 03-31-2008 at 07:34 PM. |
03-31-2008, 07:56 PM | #4 | |
Grand Sorcerer
Posts: 7,452
Karma: 7185064
Join Date: Oct 2007
Location: Linköpng, Sweden
Device: Kindle Voyage, Nexus 5, Kindle PW
|
Quote:
|
|
03-31-2008, 08:04 PM | #5 |
zeldinha zippy zeldissima
Posts: 27,827
Karma: 921169
Join Date: Dec 2007
Location: Paris, France
Device: eb1150 & is that a nook in her pocket, or she just happy to see you?
|
heh, right, we are in the same time zone...
well, the mobipocket file is here, if you want to look at it tomorrow. in the meantime, good night ! |
Advert | |
|
03-31-2008, 08:19 PM | #6 |
Resident Curmudgeon
Posts: 76,049
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
The problem is with mobi2imp. The HTML direct from mobi2html has the quotes just fine. I tried it with 9.2 and it had the same problem you did describe.
|
03-31-2008, 08:23 PM | #7 | ||
Grand Sorcerer
Posts: 7,452
Karma: 7185064
Join Date: Oct 2007
Location: Linköpng, Sweden
Device: Kindle Voyage, Nexus 5, Kindle PW
|
Quote:
mobi2html did work properly. It said: Quote:
|
||
03-31-2008, 08:47 PM | #8 | |
Retired & reading more!
Posts: 2,764
Karma: 1884247
Join Date: Sep 2006
Location: North Alabama, USA
Device: Kindle 1, iPad Air 2, iPhone 6S+, Kobo Aura One
|
Quote:
mobi2html yields the same erroneous HTML markings. Book Designer will not "unpack" the .prc file. ABC Palm Converter just yields total garbage. Sorry. Not a lot of help. Tompe seems to have answered. Maybe I need to get an updated version of Mobiperl. Last edited by slayda; 03-31-2008 at 08:50 PM. |
|
03-31-2008, 09:05 PM | #9 |
Resident Curmudgeon
Posts: 76,049
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
0.0.37 expands the PRC correctly so this error is not the fault of mobi2html. The fault lies in mobi2imp not handling UTF-8 correctly.
|
03-31-2008, 09:37 PM | #10 | |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Quote:
I responded here with the exploration of this issue and that to fix the above strange behaviour, I best be getting the UTF-8 improvements implemented in Mobi2IMP! |
|
04-01-2008, 07:42 AM | #11 |
zeldinha zippy zeldissima
Posts: 27,827
Karma: 921169
Join Date: Dec 2007
Location: Paris, France
Device: eb1150 & is that a nook in her pocket, or she just happy to see you?
|
amazing, i go to sleep, and when i wake up everything is fixed and the problem explained !!
brilliant !! thanks everybody for your help ! i'm glad that the problem was so easily fixed, and i'm particularly glad there was an explanation for it so i understand (i hate those mysterious bugs that make no sense and you can't figure out where they come from, and even when you fix them you don't know why or how...). have a drink on me, just tell the barman to put it on my tab |
04-02-2008, 12:22 AM | #12 |
Fanatic
Posts: 549
Karma: 2928497
Join Date: Mar 2008
Device: Clara 2E & Sage
|
This issue of quotes, mdashes and such turning into strange characters is why I use the numeric representations in my HTML markup. For example, for a left-double-curly-quote, & # 8220 ; and & # 8221 ; for the right. This is the only way to guarantee that non-ASCII characters will display properly on different systems.
This is especially true with XHTML, as the only such characters defined by name are & lt ;, & gt ; & amp ; (I think there are a few more, I just don't remember them right now). Not even the & nbsp ; is defined for XHTML. Note that I had to insert spaces on each of those tags. The BBS software shows the character and not the tag that I entered, even if I wrap them in CODE tags. |
04-02-2008, 01:09 AM | #13 | |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
04-02-2008, 01:19 AM | #14 | |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Quote:
Ever use 300 baud (I did)? |
|
04-02-2008, 02:27 AM | #15 | |
creator of calibre
Posts: 44,409
Karma: 23977332
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Quote:
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Conversion error [PDF >> MOBI] | OMEN | Calibre | 3 | 09-27-2010 12:02 PM |
PDF to Mobi Conversion | rayh | Calibre | 2 | 09-24-2010 02:33 AM |
Epub to Mobi conversion | MichaelGray | Calibre | 2 | 08-12-2010 01:08 PM |
conversion to Mobi - Colors lost | ichbindasauge | Calibre | 2 | 11-06-2009 11:20 AM |
Conversion from Mobi to LRF error | jessie102 | Calibre | 2 | 08-16-2008 12:00 PM |