![]() |
#1 |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Convert comments field to HTML in bulk
Hi!
I have read in these forums that while comments are displayed/edited as HTML they are not always stored internally as HTML code, depending on where they originally come from. Sometimes they are just stored as plain text. After you edit a comment using the calibre metadata editor it is always stored as HTML internally. That said, is there any way to get all the comments in a bunch of books converted to HTML the same way the metadata editor would do? Some template that performs this operation in bulk? |
![]() |
![]() |
![]() |
#2 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,678
Karma: 58985420
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
While you may paste plain text, there is always HTML on that tab.
It is just simple. <p>words</p> Same as you would get by 'using the remove formatting (of selected text) |
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
I have lots of e-books in ePub format whose Comments field was filled during the import process and it is stored internally in plain text (in the Comments field in the calibre database). I want to convert the Comments format for those e-books to HTML prior to some regex Search & Replace tweak. To that end, I need that all my e-books have their Comments field stored internally in HTML. |
|
![]() |
![]() |
![]() |
#4 | |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 78,018
Karma: 142517494
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
All of your comments are converted to HTML if they are not HTML. So you don't need to convert them. Select a book you think has the comments not in HTML. Press the E key and then under the blurb, click the HTML Source tab. You'll see that it is HTML. |
|
![]() |
![]() |
![]() |
#5 | |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
|
|
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 78,018
Karma: 142517494
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
Why is it you need the blurb to be HTML? |
|
![]() |
![]() |
![]() |
#7 | |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
![]() |
|
![]() |
![]() |
![]() |
#8 |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 42,699
Karma: 163285328
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Odd. When I download metadata, it shows bold, italic, underlines, etc. and those seem to survive closing and re-opening calibre. Perhaps what is stored in the database is not HTML code but the HTML code seems to survive that storage.
|
![]() |
![]() |
![]() |
#9 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,359
Karma: 29268140
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
I have books with plain text comments
I'll try to find a way to HTML-ise them in bulk. @d3m0sth3n3s - I suggest you ask un_pogaz if the Comments Cleaner plugin can convert plain text to simple HTML. BR Last edited by BetterRed; 08-03-2024 at 09:48 PM. |
![]() |
![]() |
![]() |
#10 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 44,843
Karma: 26594666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
THere isnt any such tool, though I dont really understand why you need it. All such a tool would do is mostly just wrap blocks in <p> or <div> tags ( dont recall the details).
|
![]() |
![]() |
![]() |
#11 | |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
I’m thinking that I would be better coding a more complex template that takes plain text comments into account. Just more work, I’m afraid. PS: I know it’s not a big problem, it just seems inconsistent to me that the calibre database may store imported comments without styles both in HTML code and in plain text just because you edited them in the GUI metadata editor or not. |
|
![]() |
![]() |
![]() |
#12 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,359
Karma: 29268140
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
The books in my Test library that have plain text comments would have acquired them from the format files I added… e.g. Book 801 came as a PDF from The Lancet website - calibre extracts the Description property from the PDF metadata and puts it in the Comments column, viz: Code:
The Lancet, Corrected proof. doi:10.1016/S0140-6736(22)01585-9 Code:
<description xmlns="http://purl.org/dc/elements/1.1/"><i>&amp;ldquo;It would be an endless task to trace the variety of meannesses, cares, and sorrows, into which women are plunged by the prevailing opinion that they were created rather to feel than reason, and that all the power they obtain, must be obtained by their charms and weakness.&amp;rdquo;</i>&#8212;Mary Wollstonecraft <br>Composed in 1790, Mary Wollstonecraft&amp;rsquo;s seminal feminist tract <i>A Vindication of the Rights of Woman</i> broke new ground in its demand for women&amp;rsquo;s education. A Vindication remains one of history&amp;rsquo;s most important and elegant broadsides against sexual oppression. In her introduction, renowned socialist feminist Sheila Rowbotham casts Wollstonecraft&amp;rsquo;s life and work in a new light.</description> |
|
![]() |
![]() |
![]() |
#13 | |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
I just think that if calibre turns a simple comment into HTML code just by, for example, simply adding a comma with no styling involved, whenever you edit comments with the GUI metadata editor, then it would be better for consistency to just store them all in HTML from the very beginning, no matter how comments were defined in the description embedded within the source e-book file you import into your library. That’s just my opinion on the subject. Last edited by d3m0sth3n3s; 08-04-2024 at 08:44 AM. |
|
![]() |
![]() |
![]() |
#14 |
Chalut o/
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 436
Karma: 145424
Join Date: Dec 2017
Device: Kobo
|
The Plugin Comments Cleaner can convert plain text Comments to HTML comment.
(caution: default settings do many other things) To search the Not-HTML comment use the query: Code:
comments:true and not comments:"^</div>"
Note: A html comment has at least one </div> Last edited by un_pogaz; 08-05-2024 at 06:05 AM. |
![]() |
![]() |
![]() |
#15 | |
Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 23
Karma: 5854
Join Date: Jun 2012
Location: Spain
Device: Kindle DX, Kindle Paperwhite, and Kindle Scribe
|
Quote:
Also, the tricks to highlight books without HTML comments are quite useful, too. |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Bulk convert HTML characters for epub | christopher88 | ePub | 13 | 12-02-2013 01:23 PM |
Metadata Comments Field - How to best tweak the html? | Chris_Snow | Library Management | 5 | 08-13-2013 03:58 AM |
Comments field | emeybee | Calibre | 1 | 05-27-2010 03:01 PM |
Comments field | banjomike | Calibre | 7 | 04-08-2010 03:18 PM |
Comments field | hyperact | Calibre | 3 | 12-28-2008 02:38 PM |