06-19-2024, 03:56 PM | #46 |
Resident Curmudgeon
Posts: 76,402
Karma: 136466962
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
08-12-2024, 04:07 PM | #47 |
Member
Posts: 14
Karma: 796
Join Date: Mar 2021
Device: Kobo Aura One
|
|
Advert | |
|
08-15-2024, 03:00 AM | #48 |
Guru
Posts: 739
Karma: 7025494
Join Date: Aug 2017
Location: Italy
Device: Kindle Paperwhite, Kobo Elipsa, Pocketbook Inkpad 4, Inkpad Color
|
The first thing to check when we see an incorrect hyphenation is to check if the language in the ebook is set correctly. Calibre tells you what the language of the file is. Sometimes I have found books written in Italian with English set as the language. I'll let you imagine how they were hyphenated.
You can also find the language by opening the file in the calibre editor or with Sigil and in the content.opf file you can find <dc:language>it</dc:language> or <dc:language>it-IT</dc:language>. |
08-15-2024, 07:41 AM | #49 | |
the rook, bossing Never.
Posts: 12,352
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
Parts of a book file can be in a different language. These should be enclosed in a div p or span, depending if several paragraphs, one paragraph or part of a paragraph. So if a character speaks in in Irish in an English book, just the dialogue might be tagged "<span lang="ga">some Irish</span>" she said. This is important not just for hyphenation but also for TTS. Sadly I've seen ebooks where they didn't bother. A spelling check in Calibre in language of ebook should reveal alternate language parts without tangs. |
|
08-15-2024, 04:32 PM | #50 |
Guru
Posts: 739
Karma: 7025494
Join Date: Aug 2017
Location: Italy
Device: Kindle Paperwhite, Kobo Elipsa, Pocketbook Inkpad 4, Inkpad Color
|
It is right. Of course the books with strange hyphenation I have seen did not have <html lang="it"> at the beginning of each file. Sometimes there was nothing or sometimes there was <html lang="en">.
|
Advert | |
|
08-15-2024, 05:54 PM | #51 |
Bookish
Posts: 969
Karma: 1807784
Join Date: Jun 2011
Device: PC, t1, t2, t3, aura 2 v1, clara HD, Libra 2, Libra Color, Nxtpaper 11
|
When I just grab any ebook, I might see in certain files:
Code:
toc.ncx --> xml:lang="en-US"
content.opf --> <dc:language>en</dc:language>
nav.xhtml --> xml:lang="en"
*.xhtml --> <html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" xml:lang="en">
|
08-15-2024, 07:44 PM | #52 |
the rook, bossing Never.
Posts: 12,352
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
The most important one is the per file, and any alternate language in the file needs local lang property in whatever tags are appropriate (add extra span ii the language change is within a tag).
I'd imagine the system pop-up index needs the system level statements for hyphenation and TTS. |
08-16-2024, 06:04 PM | #53 |
Resident Curmudgeon
Posts: 76,402
Karma: 136466962
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
I'll be having a look for an Italian hyphenation dictionary tomorrow.
|
Tags |
hyphenation, kobo |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Hyphenation | skr1107 | Calibre | 1 | 09-27-2018 12:28 PM |
Hyphenation | Simboubou | PocketBook | 9 | 09-15-2014 06:21 AM |
Hyphenation does not work in E-book Viewer | Elancrom | Calibre | 2 | 06-18-2014 07:19 AM |
Hyphenation | Siard | Kobo Reader | 6 | 08-09-2013 08:40 AM |
hyphenation | CPatrick | OpenInkpot | 3 | 03-22-2010 07:06 AM |