Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 01-23-2024, 12:41 PM   #1
Doum
Connoisseur
Doum began at the beginning.
 
Posts: 60
Karma: 10
Join Date: Jun 2020
Device: Kobo Aura HD
search whith Æ ou Œ

hi,
When i run search with title:"~Ætern" or title:"~ætern" i find all my books with this type of character ( lettre soudée in french, but in english i don't know the translation , welded letter perhaps ) (idem with Œ)

But this character in french meens "ae" or "oe" with 2 letters, so i would like to find also all books like "Ad Vitam Aeternam" or "Oedipe" .
Have you an idea ?

The strange thing is the function strtolower PHP("rÆd") give rÆd and not ræd, but calibre (pyton?) do the good job and find lower/upper title. (like the sql LIKE ).

Last edited by Doum; 01-23-2024 at 12:43 PM.
Doum is offline   Reply With Quote
Old 01-23-2024, 01:11 PM   #2
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,610
Karma: 87456643
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
I don't quite understand where or how you are searching as that makes a difference. Calibre main screen GUI, command line, a program script etc.

Titles in filenames probably won't use Æ etc.


Some background for searching in an eBook.
Æ Œ and similar combinations can either be a ligature (any two letters run together such as fi sometimes) or a single letter, a diagraph, made from two letters. A diagraph is entered as single character in the source text.

Quote:
In the modern French alphabet, æ (called e-dans-l'a, 'e in the a') is used to spell Latin and Greek borrowings like curriculum vitæ, et cætera, ex æquo, tænia, and the first name Lætitia. It is mentioned in the name of Serge Gainsbourg's song Elaeudanla Téïtéïa, a reading of the French spelling of the name Lætitia: "L, E dans l'A, T, I, T, I, A."
However ebooks are usually coded in HTML. Most ebooks manage Æ æ Œ œ as characters fine, but the publisher designer may use an HTML entity such as Æ æ Œ or œ

So you may have to search for the actual character or an HTML entity: https://en.wikipedia.org/wiki/List_o...ity_references
Quoth is offline   Reply With Quote
Advert
Old 01-23-2024, 01:49 PM   #3
Doum
Connoisseur
Doum began at the beginning.
 
Posts: 60
Karma: 10
Join Date: Jun 2020
Device: Kobo Aura HD
google book do the job

look google book on https://www.google.com/search?q=r%C3...=gws-wiz-books
Doum is offline   Reply With Quote
Old 01-23-2024, 01:56 PM   #4
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,610
Karma: 87456643
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
I know what they are, I just don't understand exactly where and how you are searching.

æ in a title can be searched for on Calibre GUI.

Also I can easily type them as I use Linux. Compose Key a e gives æ etc. Common in old Anglo-Saxon names.

Last edited by Quoth; 01-23-2024 at 02:01 PM.
Quoth is offline   Reply With Quote
Old 01-23-2024, 08:51 PM   #5
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 44,006
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
calibre's search will match accented to unaccented characters and ignore case as well controllable via Preferences->Searching. However it doesnt do the transform you are asking for. Searching isnt language specific and there is no general rule that says ae and æ should be the same.
kovidgoyal is offline   Reply With Quote
Advert
Old 01-23-2024, 11:58 PM   #6
Sarmat89
Evangelist
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 483
Karma: 2267928
Join Date: Nov 2015
Device: none
Quote:
Originally Posted by kovidgoyal View Post
Searching isnt language specific and there is no general rule that says ae and æ should be the same.
Calibre already does that for normal search, but not for regex one.
Sarmat89 is offline   Reply With Quote
Old 01-24-2024, 08:24 AM   #7
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,610
Karma: 87456643
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
In general ae is two characters and æ is one. The a e can be in words never spelled with an æ such as Fae, so ae and æ shouldn't be treated the same, unlike say i and ï (dais) or e and é (cafe) etc which the exact same word can be spelled without the accents. Admittedly some words in English can be spelled with a diaeresis or not, such as mediaeval.
Quoth is offline   Reply With Quote
Old 01-24-2024, 10:00 AM   #8
Sarmat89
Evangelist
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 483
Karma: 2267928
Join Date: Nov 2015
Device: none
Quote:
Originally Posted by Quoth View Post
In general ae is two characters and æ is one.
Not unless you have a tailored collation for some Scandinavian language. By default, it is A+E.
Sarmat89 is offline   Reply With Quote
Old 01-24-2024, 11:14 AM   #9
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,610
Karma: 87456643
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
Quote:
Originally Posted by Sarmat89 View Post
Not unless you have a tailored collation for some Scandinavian language. By default, it is A+E.
I was referring to spelling and typography were the source can be separate a and e or there can be a æ single character. What any particular search or sort system/program does is a separate issue.

Checking with caja file manager sort: aertel is always before ærtel, no matter the creation order. Both are after adept and before afro.

My GUI Language setting is English UK, Region & Time is English Ireland. (Spelling checkers have no English Ireland setting). Installed Languages are English UK, English Ireland and English USA.

In general the sort order will be different on some 8 bit ASCII DOS systems with æ in the upper 127 characters.

Last edited by Quoth; 01-24-2024 at 11:16 AM.
Quoth is offline   Reply With Quote
Old 01-26-2024, 11:23 AM   #10
Doum
Connoisseur
Doum began at the beginning.
 
Posts: 60
Karma: 10
Join Date: Jun 2020
Device: Kobo Aura HD
The use case is not to do a search with æ or œ which always succeeds (if exact character exists) but to do a search with 2 letters : ae or oe and also find the one letter æ or œ .

I readily recognize that this request is superfluous because I only need to make 2 requests or instead of one. It's purely theoretical
Anyway thank you for your comments

For the fun (with my keyboard) :
œ => Alt. + 0156
Π=> Alt. + 0140
æ => Alt. + 145 ou Alt. + 0230
Æ => Alt. + 146 ou Alt. + 0198
Doum is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
New Feature? Embed search term as a tag into Search Results with one click? mikhail_fil Library Management 1 04-02-2022 10:18 PM
Regex in search problems (NOT Search&Replace; the search bar) lairdb Calibre 3 03-15-2017 07:10 PM
Problem whith pdf landscape in k3 ersott Amazon Kindle 0 07-04-2011 03:21 AM
Little bug whith french installation Wapata Calibre 6 08-10-2010 02:30 PM
Google Book Search to search full-text books online Bob Russell Deals and Resources (No Self-Promotion or Affiliate Links) 1 08-19-2006 12:13 PM


All times are GMT -4. The time now is 07:25 AM.


MobileRead.com is a privately owned, operated and funded community.