06-24-2024, 12:35 PM | #1 |
Junior Member
Posts: 4
Karma: 10
Join Date: Jun 2024
Device: kindle
|
looking for the correct regex
Hi,
During the last years, I've been storing all my books using this simple format: El tutorial de python - Guido van rossum (2017).pdf Zed A. Shaw - Learn Python The Hard Way (2011).epub I'm lookinf for a regex template to be able to capture the metadata correctly. I've tried multiple kind of combinations, but I'm not being able to capture the year. One of the last examples I tried: (?P<author>[^_-]+) - (?P<title>[^_].+) (?P<published>[^_]).+? No matter the way I trie.. i never get the year. Could yopu please help me with this? Thanks in advance! |
07-09-2024, 08:30 AM | #2 |
Junior Member
Posts: 4
Karma: 10
Join Date: Jun 2024
Device: kindle
|
Anyone?
Just some help, please |
Advert | |
|
07-13-2024, 04:52 PM | #3 |
Wizard
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
|
You forgot the parentheses:
Code:
(?P<author>[^_-]+) - (?P<title>[^_].+) \((?P<published>.+)\) |
07-15-2024, 10:31 AM | #4 | |
Junior Member
Posts: 4
Karma: 10
Join Date: Jun 2024
Device: kindle
|
Quote:
thanks a lot for your answer. I've just tried the example you propposed, but it doesn't seem to work. The problem is that the year is not being detected. Do you have any other seugestion for me? Thanks a lot!! |
|
07-16-2024, 10:25 AM | #5 |
Wizard
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
|
Hm,
See attached pictures. Looks good for me. I attached two dummy files w/o content that I used as test. In the first two pictures I made a test with the import test dialog. You may recognize that you provide two different examples, one with [author - title (year).ext] and the other with [title - author (year).ext]. In this case you need two separate templates for matching the metadata sequence and then switch to the one needed for importing. For my example I use [author - title (year).ext] with: Code:
(?P<author>[^_-]+) - (?P<title>[^_].+) \((?P<published>.+)\) Variations for [author - title (year).ext]: Code:
(?P<author>[^_-]+) - (?P<title>[^_].+) \((?P<published>[0-9]{4})\) Code:
(?P<author>[^_-]+) - (?P<title>[^_].+) \((?P<published>\d{4})\) Template for [title - author (year).ext] should be: Code:
((?P<title>[^_].+) - ?P<author>[^_-]+) \((?P<published>.+)\) |
Advert | |
|
07-16-2024, 10:42 AM | #6 |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: kobo glo
|
Hey! You can try this regex to capture the year:
(?P<title>.+?) - (?P<author>[^-]+?) \((?P<year>\d{4})\)\. |
07-16-2024, 11:30 AM | #7 |
Junior Member
Posts: 4
Karma: 10
Join Date: Jun 2024
Device: kindle
|
SOLVED - looking for the correct regex
Hi @Divingduck and Yves54.
I'be just tested " (?P<title>[^_-]+) - (?P<author>[^_].+) \((?P<published>.+)\) " and seems to work fine for me. Thanks a lot for your help!!! |
Tags |
calibre, metadata, regex |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Regex in Regex function mode | lindlind | Editor | 5 | 03-22-2024 04:41 AM |
pdf regex question - regex that wraps to a new line | flyash | Conversion | 1 | 09-05-2021 10:00 AM |
Predefined regex for Regex-function | sherman | Editor | 3 | 01-19-2020 06:32 AM |
RegEx to correct Single quote to Double? | phossler | Editor | 3 | 02-22-2015 02:48 PM |
Is this e-pub correct? | silviolorusso | ePub | 12 | 06-08-2011 04:52 PM |