Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 12-24-2014, 04:02 PM   #1
JGB
Groupie
JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.
 
Posts: 168
Karma: 1010000
Join Date: Jul 2008
Device: PRS505
Pulling specific metadata from files?

I've got a fairly good sized ebook library, and I've been cleaning it up lately, some things I prefer to set manually, but some fields I would like to have some info but am less fussy about it being perfect. The big one is published date, I don't really care if it's totally accurate, but it's nice for most books to have a general idea of when it was written so I can pick what I feel like reading.

Is there a way to have it pull just this data or specific columns of data from the file's meta-data without having it mess up all the series info I manually entered? I've tried on a few files and it overwrites the series.
JGB is offline   Reply With Quote
Old 12-24-2014, 05:52 PM   #2
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 75,722
Karma: 134321338
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
The date of publish will be picked up if it's in the metadata. If not, you will have to enter it manually.

How are you updating your metadata such that is overwrites the series info?
JSWolf is online now   Reply With Quote
Advert
Old 12-24-2014, 06:09 PM   #3
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,861
Karma: 27620684
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by JGB View Post
I've got a fairly good sized ebook library, and I've been cleaning it up lately, some things I prefer to set manually, but some fields I would like to have some info but am less fussy about it being perfect. The big one is published date, I don't really care if it's totally accurate, but it's nice for most books to have a general idea of when it was written so I can pick what I feel like reading.

Is there a way to have it pull just this data or specific columns of data from the file's meta-data without having it mess up all the series info I manually entered? I've tried on a few files and it overwrites the series.
@JGB - see Preferences ->Metadata download, here you can configure what columns you want to download. You can also configure what comes from each source via the Configure selected source button.

BR
BetterRed is offline   Reply With Quote
Old 12-24-2014, 06:50 PM   #4
JGB
Groupie
JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.
 
Posts: 168
Karma: 1010000
Join Date: Jul 2008
Device: PRS505
Hi, thanks for the info,
however I am not trying to download the information from the internet, I'm trying to pull it from the books already existing metadata. The published column is not overwriting the series column, the series info from the metadata is overwriting my correct manually entered info with wrong info.
In the edit metadata mode I can load data from the book itself all or nothing.
Download mode works ok for selective data, but it takes forever vs pulling it from the metadata of the book itself when that's possible.
JGB is offline   Reply With Quote
Old 12-24-2014, 07:02 PM   #5
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,861
Karma: 27620684
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Aah - I don't think there is any way to selectively pull metadata elements from the format files, its all or nothing.

Are you interested in the pubdate of the edition you have, or the pubdate of the original, if the latter then I don't know of anyway to download or extract, I cut and paste from the viewer - if its there?

You could set the bulk metadata edit download off to just get pubdate from one source and let it run overnight. The metadata download speed is choked, this is to prevent calibre users overloading the source providers servers. If calibre didn't do that then they would complain to Kovid. They provide the information for free with no ad revenue, so they deserve a fair shake of the sauce bottle

BR

Last edited by BetterRed; 12-24-2014 at 07:12 PM.
BetterRed is offline   Reply With Quote
Advert
Old 12-24-2014, 08:20 PM   #6
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 75,722
Karma: 134321338
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by JGB View Post
Hi, thanks for the info,
however I am not trying to download the information from the internet, I'm trying to pull it from the books already existing metadata. The published column is not overwriting the series column, the series info from the metadata is overwriting my correct manually entered info with wrong info.
In the edit metadata mode I can load data from the book itself all or nothing.
Download mode works ok for selective data, but it takes forever vs pulling it from the metadata of the book itself when that's possible.
You are doing it backwards. Load the eBook and THEN fix the metadata. Don't have Calibre pull the metadata from the eBook once you've fixed things.
JSWolf is online now   Reply With Quote
Old 12-31-2014, 05:07 PM   #7
JGB
Groupie
JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.
 
Posts: 168
Karma: 1010000
Join Date: Jul 2008
Device: PRS505
Quote:
Originally Posted by JSWolf View Post
You are doing it backwards. Load the eBook and THEN fix the metadata. Don't have Calibre pull the metadata from the eBook once you've fixed things.
That could work if there's a way to specify during the import to pull the data from the file names as the first priority, filling in data from the metadata present in the file only when that's missing, or to restrict the use of that metadata to only covers, publisher, published date and comments.
Because the metadata is not as accurate as the filename data for author and series, I can't permit caliber to pull only metadata from the internal information, it makes a mess of the imported books.
I don't need information from the online sources and that speed limit is annoying when the data is already accurate and present in the file names for most sections I care about(author, series) from the filenames, it's just a few pieces of missing information I'd like to pull from the file's metadata.

EDIT: Or is there a way to have Calibre use the file's metadata during import, and then over-write author and series from the filename data which I've already got corrected?

Last edited by JGB; 12-31-2014 at 05:14 PM.
JGB is offline   Reply With Quote
Old 12-31-2014, 05:11 PM   #8
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 75,722
Karma: 134321338
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by JGB View Post
That could work if there's a way to specify during the import to pull the data from the file names as the first priority, filling in data from the metadata present in the file only when that's missing, or to restrict the use of that metadata to only covers, publisher, published date and comments.
Because the metadata is not as accurate as the filename data for author and series, I can't permit caliber to pull only metadata from the internal information, it makes a mess of the imported books.
I don't need information from the online sources and that speed limit is annoying when the data is already accurate and present in the file names for most sections I care about(author, series) from the filenames, it's just a few pieces of missing information I'd like to pull from the file's metadata.
It doesn't matter where the data is coming from. You then fix the entry's information and use Polish Books to update the metadata and/or cover in the eBook. So you do get things fixed without having to worry about what the original metadata or filename is.
JSWolf is online now   Reply With Quote
Old 12-31-2014, 05:20 PM   #9
JGB
Groupie
JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.JGB ought to be getting tired of karma fortunes by now.
 
Posts: 168
Karma: 1010000
Join Date: Jul 2008
Device: PRS505
Quote:
Originally Posted by JSWolf View Post
It doesn't matter where the data is coming from. You then fix the entry's information and use Polish Books to update the metadata and/or cover in the eBook. So you do get things fixed without having to worry about what the original metadata or filename is.
That would seem to require a line by line editing of each individual book though, when there is already accurate data available, it's just split between two locations(filename and metadata).
Sorry if I'm being dense, I'm not very good at all of this, which is why I'm asking for help.
JGB is offline   Reply With Quote
Old 12-31-2014, 05:59 PM   #10
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,861
Karma: 27620684
Join Date: Mar 2012
Location: Sydney Australia
Device: none
@JGB - You can extract the metadata from a format file into an OPF file with the ebook-meta command

With a bit of yourFavouriteTextEditor-fu you could wrangle/mangle the OPF into a CSV file and input that into the Import List plugin to update publisher and publication date.

The OPF file is XML, there are some command line XML2CSV converters around maybe you could make use of one of them.

Would be nice if the ebook-meta program had a --to-csv option

BR

Last edited by BetterRed; 12-31-2014 at 06:06 PM.
BetterRed is offline   Reply With Quote
Old 12-31-2014, 10:46 PM   #11
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 75,722
Karma: 134321338
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by JGB View Post
That would seem to require a line by line editing of each individual book though, when there is already accurate data available, it's just split between two locations(filename and metadata).
Sorry if I'm being dense, I'm not very good at all of this, which is why I'm asking for help.
Read the following and see if it helps.

http://manual.calibre-ebook.com/gui.html#add-books
JSWolf is online now   Reply With Quote
Old 01-05-2015, 02:47 AM   #12
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,421
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
Quote:
Originally Posted by JGB View Post
That could work if there's a way to specify during the import to pull the data from the file names as the first priority, filling in data from the metadata present in the file only when that's missing, or to restrict the use of that metadata to only covers, publisher, published date and comments.
Because the metadata is not as accurate as the filename data for author and series, I can't permit caliber to pull only metadata from the internal information, it makes a mess of the imported books.
I don't need information from the online sources and that speed limit is annoying when the data is already accurate and present in the file names for most sections I care about(author, series) from the filenames, it's just a few pieces of missing information I'd like to pull from the file's metadata.

EDIT: Or is there a way to have Calibre use the file's metadata during import, and then over-write author and series from the filename data which I've already got corrected?
I would like to be able to do this too. I want to set a custom column to equal the original filename.

However, at the current time it is impossible.

I am wondering now if I can do something with a custom calibre-debug script...

Last edited by eschwartz; 01-05-2015 at 02:52 AM.
eschwartz is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Viewing format-specific metadata Chad Cloman Library Management 4 12-10-2013 10:16 PM
Download metadata from country specific site? gapstar Library Management 1 04-16-2013 05:57 AM
How to delete/suppress Calibre-specific metadata in .opf file? Doitsu Calibre 1 10-30-2012 06:31 AM
recent update no longer pulling cover from dotepub files sdow1 Calibre 3 02-14-2012 10:14 PM


All times are GMT -4. The time now is 07:46 AM.


MobileRead.com is a privately owned, operated and funded community.