Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 04-03-2019, 09:19 AM   #1
lbutlr
Enthusiast
lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.
 
Posts: 31
Karma: 103134
Join Date: Feb 2010
Device: iPhone
Scan for duplicates

Probably as a result of an rsync error at some point, I have a lot of duplicates in my library, and cannot find any way to automatically find them all and merge them. I've seen mention of a "Merge Duplicates" plugin that doesn't seem to exist anymore, and I found a thread that is many years old.

https://www.mobileread.com/forums/sh...d.php?t=255394

But it refers to the plugin. I've scoured the menus and the preferences, but I am not seeing anything helpful. I estimate that I have at least 600 books which appear twice in my library, so I do not wan to do this by hand.

Short of creating an entirely new library and importing all 9000 books, is there anything I can do with the current version of Calibre to fix this?
lbutlr is offline   Reply With Quote
Old 04-03-2019, 09:41 AM   #2
ilovejedd
hopeless n00b
ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.
 
ilovejedd's Avatar
 
Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
Merging book records is a built-in function of Calibre (keyboard shortcut M).

First you select the book record you want to keep, then select all other records you want to merge into that and press M.

Last edited by ilovejedd; 04-03-2019 at 09:55 AM.
ilovejedd is offline   Reply With Quote
Advert
Old 04-03-2019, 09:54 AM   #3
lbutlr
Enthusiast
lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.
 
Posts: 31
Karma: 103134
Join Date: Feb 2010
Device: iPhone
Sorry, "Find Duplicates" was not listed on the plugins page and I didn't remember that the "Find plugins" button seached more plugin sources. I've found the plugin and am taking a look at it now.
lbutlr is offline   Reply With Quote
Old 04-03-2019, 09:55 AM   #4
lbutlr
Enthusiast
lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.
 
Posts: 31
Karma: 103134
Join Date: Feb 2010
Device: iPhone
Quote:
Originally Posted by ilovejedd View Post
Merging book records is a built-in function of Calibre now (keyboard shortcut M).

First you select the book record you want to keep, then select all other records you want to merge into that and press M.
That would require me to select two books and hit "M" at least 600 times.
lbutlr is offline   Reply With Quote
Old 04-03-2019, 09:59 AM   #5
ilovejedd
hopeless n00b
ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.
 
ilovejedd's Avatar
 
Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
Quote:
Originally Posted by lbutlr View Post
That would require me to select two books and hit "M" at least 600 times.
There's no completely automated way to do it.

The "Find Duplicates" plugin only identifies the duplicates. It doesn't delete them (unless they're binary identical).
ilovejedd is offline   Reply With Quote
Advert
Old 04-03-2019, 11:22 AM   #6
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,405
Karma: 58055234
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
There is no SAFE, completely automated way to do this.
Find duplicates doe get false positives. That is why it offers more than 1 'dup' detector.
Nothing beats the ole' Eyeball Mk 1.
And what if they are 'almost' dups, that you want to keep BOTH? (orig edition, 2nd edition? English and French?)
Maybe if you were not in such a rush to begin, you would not be spending all this effort cleaning up?
One of the first things I did with Calibre, was document my 900+ paperback collection. It took me over a year of, a dozen at at a whack, metadata cleanup.

GIGO
theducks is offline   Reply With Quote
Old 04-03-2019, 11:41 AM   #7
Tarana
Wizard
Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.Tarana ought to be getting tired of karma fortunes by now.
 
Tarana's Avatar
 
Posts: 4,003
Karma: 38840460
Join Date: Sep 2012
Location: Minneapolis
Device: PWSE, Voyage, K3, HDX, KBasic 7 & 8, Nook Glo3, Echos, Nanos
Quote:
Originally Posted by lbutlr View Post
That would require me to select two books and hit "M" at least 600 times.
Find duplicates just finds the duplicates. Then you have to perform an action on 1/2 of them (either merge or delete). Good project while you are on hold. Happened to me once resulting in over 1000 duplicates. Currently combining a couple of libraries (created before Virtual was available or before I understood how it worked). Time consuming but more productive than doodling while on hold.

Also, it bears to consider checking each book before deletion. On one of these libraries, I discovered a bunch of books that still had DRM - when BooksOnTheKnob had lots of freebees. Some of them I am getting resurrected on Amazon so that I can reload into Calibre, but some I'm just out of luck. Plus some I am no longer interested in. No guilt- these were free.

Last edited by Tarana; 04-03-2019 at 11:44 AM.
Tarana is offline   Reply With Quote
Old 04-03-2019, 08:14 PM   #8
yonkyunior
Cultivator
yonkyunior doesn't litteryonkyunior doesn't litteryonkyunior doesn't litter
 
yonkyunior's Avatar
 
Posts: 94
Karma: 216
Join Date: Feb 2015
Device: PRST2
in add books setting, enable auto merge set to overwrite.
Attached Thumbnails
Click image for larger version

Name:	AutoMerge-Addbooks-Setting.JPG
Views:	402
Size:	78.4 KB
ID:	170573  
yonkyunior is offline   Reply With Quote
Old 04-06-2019, 05:42 PM   #9
lbutlr
Enthusiast
lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.
 
Posts: 31
Karma: 103134
Join Date: Feb 2010
Device: iPhone
It's a shame that I can't use tow features that make a computer super useful (telling when things are identical and telling when they are nearly identical) to automate a task instead of doing it by manually clicking, but I have cobbled together a different solution and created a new calibre library that is free of duplicates in several orders of magnitude less time than it would have taken me to manually sort through the dupes in calibre.

Still, it seems a shame that "this might not be safe" is treated as "No, no matter what, you'll have to do this the hard way". Sure, put up warnings, but if I want you to automatically-merge anything with identical titles and names without starting over, let me.
lbutlr is offline   Reply With Quote
Old 04-10-2019, 05:50 AM   #10
mbovenka
Wizard
mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.
 
Posts: 2,050
Karma: 13579113
Join Date: Oct 2007
Location: Almere, The Netherlands
Device: Kobo Sage
If titles and authors are indeed *identical*, dup checking on import will catch them. Create a new library, set, like yonkyjunior says, 'automerge' to 'overwrite' and copy all books in your existing library to the new one.

Duplicates will merge automatically.
mbovenka is offline   Reply With Quote
Old 04-12-2019, 09:13 AM   #11
lbutlr
Enthusiast
lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.lbutlr is cognizant of many things which escape those who dream only by night.
 
Posts: 31
Karma: 103134
Join Date: Feb 2010
Device: iPhone
Quote:
Originally Posted by mbovenka View Post
If titles and authors are indeed *identical*, dup checking on import will catch them. Create a new library, set, like yonkyjunior says, 'automerge' to 'overwrite' and copy all books in your existing library to the new one.

Duplicates will merge automatically.
This is what I ended up doing, though it meant rebuilding the entire library and redownloading meta data. and "identical" since cources cannot agree on naming conventions. Importing the library with the metadata failed because that got me back to exactly what I had before, with duplicate books listed as separate entries.

C S Lewis, C S Lewis, Clive S Lewis, Clive Staples Lews, Clive S. Lewis. It's even worse for JRR Tolkien since JRR, J R R, J. R. R are all frequent variants, along with various random seeming expansions of his names.

To say nothing of the disaster that is marking an editor as the author which ads many more variations to try to sort through. I must have seen 50 variation on the tags for books edited by George RR Martin.

Heck, even with Jane Austen I had one book with the author tagged simply as "Austen" one a J. Austen, and one tagged as J Austin. ¯\_(ツ)_/¯

(Since the metadata doesn't modify the original book, I essentially had to start all over from the original sources)
lbutlr is offline   Reply With Quote
Old 04-12-2019, 12:50 PM   #12
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 39,812
Karma: 154147706
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by lbutlr View Post
This is what I ended up doing, though it meant rebuilding the entire library and redownloading meta data. and "identical" since cources cannot agree on naming conventions. Importing the library with the metadata failed because that got me back to exactly what I had before, with duplicate books listed as separate entries.

C S Lewis, C S Lewis, Clive S Lewis, Clive Staples Lews, Clive S. Lewis. It's even worse for JRR Tolkien since JRR, J R R, J. R. R are all frequent variants, along with various random seeming expansions of his names.

To say nothing of the disaster that is marking an editor as the author which ads many more variations to try to sort through. I must have seen 50 variation on the tags for books edited by George RR Martin.

Heck, even with Jane Austen I had one book with the author tagged simply as "Austen" one a J. Austen, and one tagged as J Austin. ¯\_(ツ)_/¯

(Since the metadata doesn't modify the original book, I essentially had to start all over from the original sources)
That is why quite a few people use a Intake library. An ebook is imported to the Intake library, the metadata is updated and massaged to match your standards, the modified metadata is added to the ebook and only then is it moved to the main calibre library. For the examples you gave, I would use bulk metadata edit to set the author name/sort for consistency.

A pain if you already have a large collection but once you have gone through the pain, it makes for a much cleaner and easier to search library.
DNSB is offline   Reply With Quote
Old 04-12-2019, 02:29 PM   #13
ilovejedd
hopeless n00b
ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.ilovejedd ought to be getting tired of karma fortunes by now.
 
ilovejedd's Avatar
 
Posts: 5,110
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
Quote:
Originally Posted by DNSB View Post
For the examples you gave, I would use bulk metadata edit to set the author name/sort for consistency.
I find standardizing authors, tags, etc. much easier using "Manage {field}" in Tag Browser rather than Bulk Metadata Edit.
ilovejedd is offline   Reply With Quote
Old 04-12-2019, 09:56 PM   #14
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 39,812
Karma: 154147706
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by ilovejedd View Post
I find standardizing authors, tags, etc. much easier using "Manage {field}" in Tag Browser rather than Bulk Metadata Edit.
In the main library, it might be easier but given the small size of my Intake library, BME works for those rare occasions where I want to modify more than one book's author.
DNSB is offline   Reply With Quote
Old 04-12-2019, 10:35 PM   #15
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,954
Karma: 27620688
Join Date: Mar 2012
Location: Sydney Australia
Device: none
In the Intake library I don't normally bother with Manage or BME. I edit cells in the book list, and if its worthwhile (e.g. several books in a series) use Copy & Paste to copy columns from book A to books B, C and D. For me, jumping in and out of dialogue boxes is time wasting - I also find it very bureaucratic.

BR
BetterRed is online now   Reply With Quote
Reply

Tags
duplicates, merge records


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Unable to Scan hroberts89436 Calibre Companion 4 12-11-2016 02:04 PM
SCAN COPY mehrdadmms General Discussions 2 10-14-2016 07:56 AM
Should I let my PC scan my Kindle? Dr. Drib Amazon Kindle 14 03-24-2014 01:36 PM
Duplicates handling with Find Duplicates plugin erfjr Calibre 0 03-05-2013 02:52 PM
scan to eBook Red Alert Sony Reader 9 07-29-2007 03:21 AM


All times are GMT -4. The time now is 12:58 AM.


MobileRead.com is a privately owned, operated and funded community.