08-06-2022, 12:31 PM | #1 |
Member
Posts: 13
Karma: 10
Join Date: Oct 2016
Device: Onyx Boox N96ML
|
Apostrophes in Full text search
Still "playing" with this powerful new feature, I ran into a minor problem that I fear insoluble but I will try anyway
As an Italian-speaking user (but also in French there is this problem) I find myself looking for terms that in my books can be preceded by an article or a preposition followed by an apostrophe. For example: "dall'alto monte" (from the high mountain); "un'ottima cena" (a great dinner) and so on. Now, searching only for words without an article ("alto monte" / "ottima cena") I expected to find those preceded by an article as well. Instead, the books in which these forms are present do not appear among the results. In practice, it seems that the forms preceded by article and apostrophe ("dall'alto", "un'ottima") are considered as unique terms. I'm afraid it has to do with the apostrophe not being interpreted by the engine as a limit of a word, or what else? Is there a way to bypass this problem or is it structural? Many thanks! |
08-07-2022, 12:24 AM | #2 |
creator of calibre
Posts: 44,572
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
There's no way to bypass it. Tokenization of text into words id done at indexing time, and once done its done. calibre uses the ICU library to do this tokenization and that uses language sensitive rules, for a number of languages including european ones.
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Full Text Search query | DrChiper | Calibre | 2 | 07-26-2022 07:31 AM |
Full text search? | excaliber | Library Management | 3 | 08-07-2017 07:09 AM |
Full Text Search? | silentguy | Calibre | 4 | 02-22-2012 04:03 PM |
Full Text Search Engine | Fat Abe | General Discussions | 1 | 09-21-2010 06:30 PM |
Google Book Search to search full-text books online | Bob Russell | Deals and Resources (No Self-Promotion or Affiliate Links) | 1 | 08-19-2006 01:13 PM |