03-02-2023, 11:56 PM | #1081 | |
Junior Member
Posts: 6
Karma: 10
Join Date: Jun 2021
Location: Taiwan, sovereign nation and #1
Device: kindle pw4
|
Quote:
Hey, sorry about that, I got caught up in some big work projects. It's ok that kiwidude doesn't want to include a fork's functionality, that's very normal in the open source community for people to have strong opinions - such things are required for really cool projects like theirs to get created for free. I don't actually disagree with their logic either, combining duplicates carries high risk of screwing up your ebooks and metadata and losing copies! You have to be careful. But, there's still usecases. I had one, so I made the tweak to the plugin. Others have usecases too. I'm working right now to get it working with up to date versions of calibre. There was someone else that kindly emailed me to mention another bug that I'm looking at. As for @kiwidude, don't worry, I won't be making a PR, though i appreciate you linking https://github.com/kiwidude68/calibr...icates-v1.10.7 your github repo, there's lots of really good information there. The primary motivation for me to create my tweak was to try doing some open source work outside of web dev, thanks for creating resources to help people do that! As your code is GPL-3, I'm going to change my repo to be a fork of yours with the same license. I really like GPL-3 as well. |
|
03-03-2023, 02:55 AM | #1082 |
Junior Member
Posts: 6
Karma: 10
Join Date: Jun 2021
Location: Taiwan, sovereign nation and #1
Device: kindle pw4
|
Sorry to double reply, but it looks like there's a time limit on editing a post.
The fork of kiwidude's excellent repo for finding duplicates that I made a while back has been updated to (hopefully) work on newer versions of Calibre, and also includes the massive improvements kiwidude made to their code in the meantime. https://github.com/komali2/calibre-find-duplicates Remember that this is only possible because of all of their hard work, credit goes to them! |
Advert | |
|
04-07-2023, 12:06 PM | #1084 |
Connoisseur
Posts: 59
Karma: 666
Join Date: May 2020
Location: Germany
Device: android smartphone + tablet
|
Key error in module book_algorithms.py
Hello kiwidude,
when Calibre was running in debug mode today, I noticed that an key error is reported in the book_algorithms.py module: File "calibre_plugins.find_duplicates.book_algorith ms", line 295, in _find_candidate_by_file_size KeyError: 'mtime' Statement 295 in version (1, 10, 7) is: mtime = stat_metadata['mtime']. However, the plugin continued to run and returned a result. Regards, |
04-14-2023, 06:52 AM | #1085 | |
Calibre Plugins Developer
Posts: 4,688
Karma: 2162246
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Quote:
The only way I can see this happening is if you somehow have invalid metadata in your library for a book format. Perhaps you deleted a book format from disk, but calibre still thinks it exists. I will tweak the code so the exception won't appear in future, but with the way it is written it would just skip that book format anyways which is what it should do, it is just extra noise in the log. |
|
Advert | |
|
04-14-2023, 06:59 AM | #1086 |
Calibre Plugins Developer
Posts: 4,688
Karma: 2162246
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Find Duplicates v1.10.8 Released
Release Notes
https://github.com/kiwidude68/calibr...icates-v1.10.8 |
06-11-2023, 10:11 PM | #1087 |
Custom User Title
Posts: 9,575
Karma: 64960983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
Question: Is it possible to run a binary compare on just a few selected books, instead of the entire library?
|
06-12-2023, 06:17 AM | #1088 |
Calibre Plugins Developer
Posts: 4,688
Karma: 2162246
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
|
07-22-2023, 04:22 PM | #1089 | |
Leftutti
Posts: 490
Karma: 1153449
Join Date: Feb 2019
Location: Bavaria
Device: iPad Pro, Kobo Libra 2
|
Quote:
Edit: Nevermind. I swapped to this method, which works a bit better in my eyes. https://www.mobileread.com/forums/sh...77#post4333077 Last edited by Wiggo; 07-23-2023 at 06:19 AM. |
|
09-12-2023, 03:02 AM | #1090 | |
Zealot
Posts: 112
Karma: 53342
Join Date: Jun 2013
Device: Sony PRS-600
|
Quote:
One is SHA, the other is called 'CRC' but lies a little (it's actually CRC32-<type>-<size>). Your lovely plugin here makes it very easy to reliably dedupe based on this information. I still use a binary dupe check on occasion to look for files with different extensions (I found one set that had 'XML' and 'SRD' extensions -- one was the actual format, the other indicated the data set), but just having this is very handy. I can then, in a dedupe cleanup library, present the 'CRC identifier' in the tag browser. They're grouped by first character, giving me 16 groups of CRC values to quickly skim looking for 'more than 1' entries and cull. It's a little more convenient for me than the options built into the plugin. |
|
01-24-2024, 05:37 AM | #1091 |
Custom User Title
Posts: 9,575
Karma: 64960983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
How does "Similar" in Find Metadata Variations work? I can't remember the last time I even got a match with it, so I'm not even entirely sure it does work.
|
01-24-2024, 11:36 PM | #1092 | |
Calibre Plugins Developer
Posts: 4,688
Karma: 2162246
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Quote:
For similar authors it strips initials, punctuation and ignores order of the names so John Smith and Smith John will match. |
|
02-11-2024, 02:44 PM | #1093 |
Junior Member
Posts: 2
Karma: 10
Join Date: Feb 2024
Device: kindle
|
Duplicate finder (binary compare)
Hello guys,
I am using calibre as an e-book/e-documents for many years. Now I am facing a new challenge: I merged two libraries (medium size, both). The new resulted library contains more duplicates than I expected initially. The problem: I run binary compare to find real doc duplicates (without option to delete duplicates formats during operations). The result is a quite big list/groups. Can I save the result for processing the metadata/formats merge AFTER, without need to rerun the duplicate finder again (my this case is a long operation (the library is medium size 500GB and ~200k entries)) Any help/suggestion welcomed Thanks for your time! |
02-11-2024, 04:23 PM | #1094 | |
null operator (he/him)
Posts: 21,006
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
BR |
|
02-12-2024, 05:22 AM | #1095 | |
Enthusiast
Posts: 39
Karma: 10
Join Date: May 2012
Device: android
|
Quote:
L |
|
Tags |
cross library duplicates, in library duplicates |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
[GUI Plugin] Quality Check | kiwidude | Plugins | 1214 | Yesterday 12:05 PM |
[GUI Plugin] Generate Cover | kiwidude | Plugins | 834 | 11-09-2024 01:39 AM |
[GUI Plugin] View Manager | kiwidude | Plugins | 415 | 05-11-2024 04:28 AM |
[GUI Plugin] Open With | kiwidude | Plugins | 403 | 04-01-2024 09:39 AM |
[GUI Plugin] Plugin Updater **Deprecated** | kiwidude | Plugins | 159 | 06-19-2011 01:27 PM |