12-03-2021, 04:29 AM | #1 |
Zealot
Posts: 101
Karma: 28496
Join Date: Feb 2010
Device: none
|
Highlights Export - Joined Words Problem
I am having a problem with Nova Air 2 epub highlights. So far both with English and Russian content. Some words are merged together when I export the highlights. For example "gothere" instead of "go there."
I found a pattern but not "the pattern." Word merging occurs when a word is followed by another word on a new line in a sentence. |
12-03-2021, 08:07 AM | #2 |
Wizard
Posts: 2,878
Karma: 12000011
Join Date: Feb 2012
Device: Nook NST, Glow2, 3, 4, '21, Kobo Aura2, Poke3, Poke5
|
You might look closer and see if there is some non-printing character between go and there. Onyx uses very strange characters sometimes in its exports.
|
Advert | |
|
12-03-2021, 02:56 PM | #3 |
Zealot
Posts: 101
Karma: 28496
Join Date: Feb 2010
Device: none
|
@Renate no special symbols.
Screenshots: Last edited by robertpolson; 12-03-2021 at 03:01 PM. |
12-03-2021, 06:46 PM | #4 |
Wizard
Posts: 2,878
Karma: 12000011
Join Date: Feb 2012
Device: Nook NST, Glow2, 3, 4, '21, Kobo Aura2, Poke3, Poke5
|
Could you get the actual annotation file and post it? You can see that those "square brackets" are not that, they are U+3010 Left Black Lenticular Brackets. I'll bet that you've never typed one! Maybe there's a U+200C Zero Width Non-Joiner between those words. I'm putting my money on U+2028 Line Separator. UTF-8 encoded, obviously.
|
12-03-2021, 07:15 PM | #5 |
Evangelist
Posts: 495
Karma: 2267928
Join Date: Nov 2015
Device: none
|
Chinese doesn't use space between broken lines, so that's probably it.
|
Advert | |
|
12-03-2021, 07:26 PM | #6 |
Wizard
Posts: 2,878
Karma: 12000011
Join Date: Feb 2012
Device: Nook NST, Glow2, 3, 4, '21, Kobo Aura2, Poke3, Poke5
|
German doesn't use apostrophes for possessives.
|
12-04-2021, 12:24 AM | #7 | |
Zealot
Posts: 101
Karma: 28496
Join Date: Feb 2010
Device: none
|
Quote:
Can you please explain what you mean? Are you referring to some settings in NeoReader? |
|
12-04-2021, 12:24 AM | #8 |
Zealot
Posts: 101
Karma: 28496
Join Date: Feb 2010
Device: none
|
|
12-04-2021, 12:43 AM | #9 |
Zealot
Posts: 101
Karma: 28496
Join Date: Feb 2010
Device: none
|
I updated the firmware yesterday to 2021-11-22_11-01_3_2
It seems that the problem has been fixed. I highlighted the same text and exported it and it seems to work now. |
12-04-2021, 07:51 AM | #10 |
Wizard
Posts: 2,878
Karma: 12000011
Join Date: Feb 2012
Device: Nook NST, Glow2, 3, 4, '21, Kobo Aura2, Poke3, Poke5
|
So, you're right. There is nothing between the squashed words, but we didn't know that until we looked at the binary file.
In case you're intending to do something with those annotations outside the Onyx ecosystem, be aware there are some unusual character choices present. Code:
U+00A0 Non-Break Space U+2013 En Dash (in source text) U+2019 Right Single Quotation Mark (source text not using apostrophe correctly) U+3010 Left Black Lenticular Bracket U+3011 Right Black Lenticular Bracket U+FF1A Fullwidth Colon |
02-20-2022, 09:14 AM | #11 | |
Zealot
Posts: 138
Karma: 380090
Join Date: Feb 2013
Device: Kindle Paperwhite (11th Gen) v5.14.2
|
Quote:
Would it be possible to post the original highlight export file again? Any small sample would be helpful. I'm trying to play around with the format (https://www.mobileread.com/forums/sh...d.php?t=345266) to see if I can't get a clean version via a script of sorts. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Process for highlights export | Futuregrace | Android Devices | 0 | 12-17-2018 03:05 PM |
Export words from Pocketbook | superpawko | PocketBook | 4 | 12-27-2017 04:06 PM |
How to export Highlights | ereaderundecided | PocketBook | 17 | 10-03-2012 12:10 PM |
How to export Highlights | ereaderundecided | Sony Reader | 12 | 09-05-2012 10:20 PM |
Export Highlights | musemj6 | Amazon Kindle | 1 | 12-09-2008 10:11 PM |