11-02-2018, 03:18 PM | #1 |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
New ToC filter for "duplicates" (problem case) - oops
The Aug, 24 2018 Calibre release 3.30 has the following new feature:
"ToC Editor: When generating ToCs using headings/XPath ignore duplicate entries at the same level that have the same text." This (new) feature is currently non-optional and one circumstance where it causes a problem is with ePubs that have multiple parts and each part re-starts the chapter numbering. If the first part has five chapters (#1-5) and the second has seven (#1-7), then the first five chapters in the second part are excluded when trying to build the ToC. A ToC generator that focuses only on the chapter numbers will display chapters 1-7 with no indication that there are two parts and that five of the twelve chapters are not displayed. ________________________________________________ ERROR - the above comments refer to the new feature as "non-optional" but as the Ducks points out in later comments below, there is an easy option to turn off the feature. Last edited by Rob557; 11-05-2018 at 07:01 AM. Reason: belatedly inserting "ERROR" comment at bottom |
11-02-2018, 04:40 PM | #2 |
Well trained by Cats
Posts: 30,441
Karma: 58055868
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Oooo! This is common in Omnibus editions.
I expect that some sort of user smarts would be expected (to turn off this option). (just another example of why I avoid BULK conversions. It only takes one "I forgot about that" ) |
Advert | |
|
11-02-2018, 05:24 PM | #3 |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
I'm not sure the ToC glitch arises as a direct result of bulk conversions (?).
When the ToC for a single ePub (new or converted) is being created (or regenerated to make corrections) using various "edit TOC" functions, there will be a problem if the ePub has multiple parts and each part re-starts the numbering for the chapters. As an example, if a book has five parts, and the parts have from three to eight numbered chapters each, then the total number of chapters in the ToC will be eight because all the other chapters will be considered to be "duplicative" Multi-part Omnibus editions would be a problem only where the chapter headers are non-descriptive (e.g. Chapter 1), and therefore would appear duplicative from one part to the next. |
11-02-2018, 06:31 PM | #4 | |
Well trained by Cats
Posts: 30,441
Karma: 58055868
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
It is the USER who tosses a book to conversion (singly or in bulk) without considering WHAT needs (and absolutely, doesn't) to be done. For the most part, I avoid same format (eg Epub to Epub) conversions. I would rather hand edit (I have a lot of code snippets I use) and KNOW what got changed. |
|
11-02-2018, 07:48 PM | #5 |
null operator (he/him)
Posts: 20,989
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
|
Advert | |
|
11-03-2018, 03:42 AM | #6 |
creator of calibre
Posts: 44,509
Karma: 24495778
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
This feature refers to auto-generating tocs from contents. This kinf of auto generation can never be made bulletproof. If the heuristics calibre uses dont work for a particular book, you can always crate the toc by hand.
If a book already has a toc, this is not relevant to it. |
11-03-2018, 01:19 PM | #7 | |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
Quote:
There are a number of circumstances where it is necessary to regenerate a ToC. I can see where the issue of duplicates from the ToC generation process could at times be frustrating for less experienced users (moreso using XPath, whereas I don't recall auto-generation from headers producing duplicates). The change introduced Aug 24 will be quite helpful for those users less familiar with the ToC generation techniques, but if there is no way for a user to switch off that filter then the change can produce unexpected errors and block efforts to produce a correct ToC. I'll post a list of some books from which examples can be selected, where the question becomes HOW would the ToC generation process be able to re-generate the ToC for those books. The new filter seems to block any efforts. |
|
11-03-2018, 01:23 PM | #8 | |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
Quote:
Arthur Hailey - Airport Helen Hollick - Shadow of the King John Grisham - Rogue Lawyer Joseph Conrad - Victory Leo Tolstoy - Anna Karenina Leon Uris - Mila 18 Norman Mailer - The Naked and the Dead Stephen King - The Dead Zone How then would the duplicate-filtered ToC generation process be able to regenerate the ToC for any of the above books (assuming the ePub version you have access to have the same multi-part chapter renumbering as appears in my versions)? |
|
11-03-2018, 01:52 PM | #9 |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
PS. If the new duplicate-filter could be switched off by a user then it is true that the user may subsequently have to deal with the generation of duplicates within a ToC, but it is generally very easy to simply highlight the duplicates and delete them.
|
11-03-2018, 06:35 PM | #10 | |
Not Quite Dead
Posts: 195
Karma: 654170
Join Date: Jul 2015
Device: Paperwhite 4; Galaxy Tab
|
Quote:
I routinely re-generate TOCs using xpath. Science and history books especially have chapter structure in parts with numbered chapters. It seems unnecessarily tedious to have to hunt and peck to find and recreate missing chapters given how powerful and convenient the Calibre xpath facility is in my older copy of Calibre... |
|
11-03-2018, 09:01 PM | #11 | |
Well trained by Cats
Posts: 30,441
Karma: 58055868
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
|
|
11-03-2018, 11:48 PM | #12 |
Zealot
Posts: 108
Karma: 810
Join Date: Jul 2012
Device: Kobo
|
oops
Oops. My mistake. Thank you to the Ducks for pointing out that at the same time the ToC duplicate-filtering was introduced (Aug 24), there was also a tick box option provided at the bottom of the screen display (after selecting the XPath option) labeled "do not add duplicate entries at the same level", so that the option can be turned on or off. Silly me.
In fact I think I ticked that box when the feature was introduced, and then forgot about it when I later encountered the problem case described above. I did an internet search when I encountered the problem and only saw a comment from 'Ben L' in a different forum asking for the feature to be optional ... without realizing (recalling) that the optional feature was in fact built in by Kovid prior to releasing the new feature. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Several questions about the "BBB Filter" | Mr.Samuel | Kindle Developer's Corner | 1 | 05-02-2016 06:20 AM |
My "quest" for a JBL case (Case\Cover/Sleeve\Skin links) | tomereader | Ectaco jetBook | 18 | 12-02-2011 03:31 PM |
Getting calibre to detect "Prologue" and "Epilogue" for TOC | sherman | Calibre | 2 | 09-20-2010 03:21 AM |