Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 06-02-2016, 09:51 PM   #1
jcsalomon
Zealot
jcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheesejcsalomon can extract oil from cheese
 
jcsalomon's Avatar
 
Posts: 100
Karma: 1204
Join Date: Jun 2012
Device: Bookari (née Mantano Reader) on Android; Kindle Fire HD
Angry xml:lang oddities

The book I’m editing has some foreign-language words, and (mostly so I wouldn’t “fix” these in spell-checking) I applied the XML language tag to them:
Code:
<i xml:lang="hu">Én nem beszélek magyarul.</i>
worked ok, but I tried this with some transliterated Korean:
Code:
<i xml:lang="ko-Latn">dongsaeng</i>
and that behaved oddly: The desktop version of ADE (version 2) showed the transliterated Korean in an upright font; and Mantano/Bookari displayed them in a slanted roman, rather than an italic, font.

I’ve reported the bug to Mantano, but I’m removing this attempt at semantic coding anyway, since ADE-based readers are just about guaranteed to to something wrong with this. I figured I’d post my results here so anyone trying to research this matter will find this and know not to bother.

(See What is the correct way to use the lang attribute with phonetic pronunciations? on Stack Overflow for an indication that—theoretically, at least—what I tried should have been the Right Thing to Do.™)
jcsalomon is offline   Reply With Quote
Old 06-06-2016, 05:28 PM   #2
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by jcsalomon View Post
[...] but I’m removing this attempt at semantic coding anyway, since ADE-based readers are just about guaranteed to to something wrong with this. I figured I’d post my results here so anyone trying to research this matter will find this and know not to bother.
Do you have any more example code + screenshots of this in action? Maybe attach a sample EPUB showing off the bugs?

I am going to take a giant stab in the dark and suspect this type of bug might only occur for languages that handle emphasis with punctuation marks instead of italic text:

https://en.wikipedia.org/wiki/Emphas...ctuation_marks

Again, another stab in the dark... I assume most of these devices would handle the primary tags ("en", "fr", [...]), and MAYBE the major variants ("en-US", "en-UK", [...]), but not properly handle more obscure variants (like "ko-Latn"), especially where the non-Latin primary language overlaps with some sort of Latin variant (such as you marking up the transliterations).

I bet ADE just sees the "ko" primary language, and automatically treats the entire thing as Korean (because it doesn't understand/distinguish "-Latn").

On a related note, do you suspect this type of bug might also effect some RTL or vertical languages/variants? I don't work with any sorts of those texts to know.
Tex2002ans is offline   Reply With Quote
Advert
Reply

Tags
language metadata, xml


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
xml:lang empty (pdf to epub) fxp33 Conversion 3 05-07-2015 11:40 PM
Catalog oddities tamhas Library Management 7 07-25-2014 10:55 AM
After merging all the .xml files, how do you divide it back into .xml files? automa Sigil 10 08-13-2013 07:43 AM
Anachronism or other oddities Hellmark General Discussions 34 05-03-2011 01:28 PM


All times are GMT -4. The time now is 02:50 PM.


MobileRead.com is a privately owned, operated and funded community.