08-03-2024, 05:35 PM | #1 |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Convert comments field to HTML in bulk
Hi!
I have read in these forums that while comments are displayed/edited as HTML they are not always stored internally as HTML code, depending on where they originally come from. Sometimes they are just stored as plain text. After you edit a comment using the calibre metadata editor it is always stored as HTML internally. That said, is there any way to get all the comments in a bunch of books converted to HTML the same way the metadata editor would do? Some template that performs this operation in bulk? |
08-03-2024, 06:06 PM | #2 |
Well trained by Cats
Posts: 30,452
Karma: 58055868
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
While you may paste plain text, there is always HTML on that tab.
It is just simple. <p>words</p> Same as you would get by 'using the remove formatting (of selected text) |
08-03-2024, 06:25 PM | #3 | |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
I have lots of e-books in ePub format whose Comments field was filled during the import process and it is stored internally in plain text (in the Comments field in the calibre database). I want to convert the Comments format for those e-books to HTML prior to some regex Search & Replace tweak. To that end, I need that all my e-books have their Comments field stored internally in HTML. |
|
08-03-2024, 06:48 PM | #4 | |
Resident Curmudgeon
Posts: 76,492
Karma: 136564766
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
All of your comments are converted to HTML if they are not HTML. So you don't need to convert them. Select a book you think has the comments not in HTML. Press the E key and then under the blurb, click the HTML Source tab. You'll see that it is HTML. |
|
08-03-2024, 06:55 PM | #5 | |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
|
|
08-03-2024, 06:58 PM | #6 | |
Resident Curmudgeon
Posts: 76,492
Karma: 136564766
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
Why is it you need the blurb to be HTML? |
|
08-03-2024, 07:13 PM | #7 |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
It’s bold because calibre stores it as HTML But that is AFTER you edited the Comments field with the GUI metadata editor. If it was imported as plain text and you left the Comments field untouched in the GUI it is kept as plain text internally in calibre database. The HTML code you see in the Comments metadata editor is not an accurate representation of the real information stored in the database. That can be plain text!
|
08-03-2024, 07:47 PM | #8 |
Bibliophagist
Posts: 40,603
Karma: 157444382
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Odd. When I download metadata, it shows bold, italic, underlines, etc. and those seem to survive closing and re-opening calibre. Perhaps what is stored in the database is not HTML code but the HTML code seems to survive that storage.
|
08-03-2024, 09:28 PM | #9 |
null operator (he/him)
Posts: 21,006
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
I have books with plain text comments
I'll try to find a way to HTML-ise them in bulk. @d3m0sth3n3s - I suggest you ask un_pogaz if the Comments Cleaner plugin can convert plain text to simple HTML. BR Last edited by BetterRed; 08-03-2024 at 09:48 PM. |
08-04-2024, 12:42 AM | #10 |
creator of calibre
Posts: 44,565
Karma: 24495948
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
THere isnt any such tool, though I dont really understand why you need it. All such a tool would do is mostly just wrap blocks in <p> or <div> tags ( dont recall the details).
|
08-04-2024, 06:09 AM | #11 | |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
I’m thinking that I would be better coding a more complex template that takes plain text comments into account. Just more work, I’m afraid. PS: I know it’s not a big problem, it just seems inconsistent to me that the calibre database may store imported comments without styles both in HTML code and in plain text just because you edited them in the GUI metadata editor or not. |
|
08-04-2024, 07:32 AM | #12 | |
null operator (he/him)
Posts: 21,006
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
The books in my Test library that have plain text comments would have acquired them from the format files I added… e.g. Book 801 came as a PDF from The Lancet website - calibre extracts the Description property from the PDF metadata and puts it in the Comments column, viz: Code:
The Lancet, Corrected proof. doi:10.1016/S0140-6736(22)01585-9 Code:
<description xmlns="http://purl.org/dc/elements/1.1/"><i>&amp;ldquo;It would be an endless task to trace the variety of meannesses, cares, and sorrows, into which women are plunged by the prevailing opinion that they were created rather to feel than reason, and that all the power they obtain, must be obtained by their charms and weakness.&amp;rdquo;</i>&#8212;Mary Wollstonecraft <br>Composed in 1790, Mary Wollstonecraft&amp;rsquo;s seminal feminist tract <i>A Vindication of the Rights of Woman</i> broke new ground in its demand for women&amp;rsquo;s education. A Vindication remains one of history&amp;rsquo;s most important and elegant broadsides against sexual oppression. In her introduction, renowned socialist feminist Sheila Rowbotham casts Wollstonecraft&amp;rsquo;s life and work in a new light.</description> |
|
08-04-2024, 08:41 AM | #13 | |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
I just think that if calibre turns a simple comment into HTML code just by, for example, simply adding a comma with no styling involved, whenever you edit comments with the GUI metadata editor, then it would be better for consistency to just store them all in HTML from the very beginning, no matter how comments were defined in the description embedded within the source e-book file you import into your library. That’s just my opinion on the subject. Last edited by d3m0sth3n3s; 08-04-2024 at 08:44 AM. |
|
08-04-2024, 04:36 PM | #14 |
Chalut o/
Posts: 421
Karma: 145424
Join Date: Dec 2017
Device: Kobo
|
The Plugin Comments Cleaner can convert plain text Comments to HTML comment.
(caution: default settings do many other things) To search the Not-HTML comment use the query: Code:
comments:true and not comments:"^</div>"
Note: A html comment has at least one </div> Last edited by un_pogaz; 08-05-2024 at 06:05 AM. |
08-05-2024, 07:23 PM | #15 | |
Member
Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
Also, the tricks to highlight books without HTML comments are quite useful, too. |
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Bulk convert HTML characters for epub | christopher88 | ePub | 13 | 12-02-2013 01:23 PM |
Metadata Comments Field - How to best tweak the html? | Chris_Snow | Library Management | 5 | 08-13-2013 03:58 AM |
Comments field | emeybee | Calibre | 1 | 05-27-2010 03:01 PM |
Comments field | banjomike | Calibre | 7 | 04-08-2010 03:18 PM |
Comments field | hyperact | Calibre | 3 | 12-28-2008 02:38 PM |