Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 09-08-2024, 02:16 PM   #1
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
converting word doc to epub:Page break shows when it is not present in the word file.

I am using calibre 64bit Version: 7.17.0 and Word 2010.
I am having issues with page breaks at parts of the book where there are no page breaks. Attached is the test word file with the actual title. I have reinstalled word, reinstall calibre, reset the word template. and a load of other things to fix this issue. There are no visable page breaks in the document yet they still show in the converted epub file !
The page break shows before the title "The book of hymns" yet it is not in the word file.
I have not changed any configs in word or calibre.
Can anyone tell me why this might be happening.
TIA
Attached Files
File Type: docx NEW INSTALL TEST1.docx (13.3 KB, 12 views)
Jasonc310771 is offline   Reply With Quote
Old 09-08-2024, 02:39 PM   #2
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
this is another word doc that shows all the chapters correctly and there are no page breaks, I have tried to use the copy format method, but this did not work.
Attached Files
File Type: docx TEST 1 this work.docx (14.1 KB, 6 views)
Jasonc310771 is offline   Reply With Quote
Advert
Old 09-08-2024, 02:42 PM   #3
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,326
Karma: 58000000
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Imports typically insert Page Breaks before Headings (h1,h2,h3)
Do not use Headings just for STYLE. You can mak a stylesheet class that LOOKS just like a Heading but will not process like 1 (eg semi-auto generated TOC)
theducks is offline   Reply With Quote
Old 09-08-2024, 03:07 PM   #4
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
It is only this book that the issue happen too. The other book is fine and the chapter markers works fine. If I remove all headings, there are no issues with the page breaking, but then there are no markers to show it is a start of a chapter. I would like to keep it simple and not have to edit the book once imported into calibre.
Is there another way to mark the titles of the chapter names, so that in the readers they can swipe to see the chapter names to go to without having to scroll through the whole book to find where they got to (should they view another book inbetween)
Jasonc310771 is offline   Reply With Quote
Old 09-08-2024, 03:11 PM   #5
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
oh I posted but the post disappeared.
I would like to keep things simple without having to edit the book after import in to calibre. What other methods could I ues to set the chapters titles? I set these up so it is easier for people to find where they got to. Say if they viewed another book inbetween.
Jasonc310771 is offline   Reply With Quote
Advert
Old 09-08-2024, 04:26 PM   #6
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,326
Karma: 58000000
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by Jasonc310771 View Post
It is only this book that the issue happen too. The other book is fine and the chapter markers works fine. If I remove all headings, there are no issues with the page breaking, but then there are no markers to show it is a start of a chapter. I would like to keep it simple and not have to edit the book once imported into calibre.
Is there another way to mark the titles of the chapter names, so that in the readers they can swipe to see the chapter names to go to without having to scroll through the whole book to find where they got to (should they view another book inbetween)
You do not REMOVE headings. You simply change the H tags to a P or Div
BTW This really just applies to incorrect usage (not a real heading. Eg a bold sign.)
theducks is offline   Reply With Quote
Old 09-08-2024, 06:47 PM   #7
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,894
Karma: 27620684
Join Date: Mar 2012
Location: Sydney Australia
Device: none
The two DOCXs you posted use different Templates hence Styles.

NEW INSTALL TEST1
Click image for larger version

Name:	NEW INSTALL TEST1.jpg
Views:	16
Size:	267.9 KB
ID:	210692

TEST 1 this work
Click image for larger version

Name:	TEST 1 this work.jpg
Views:	14
Size:	299.9 KB
ID:	210693

Consequently we end up with different stylesheets and xhtml file break up in the EPUB

Click image for larger version

Name:	CSS files.jpg
Views:	12
Size:	330.5 KB
ID:	210694

I convert up to a dozen DOCX to EPUB a week. The only thing I do to the EPUBs is to remove superfluous metadata from the OPF and rename the .XHTML files to match chapter numbers and or names. I never touch the formatting. If I see something wrong I correct the manuscript DOCX and redo the conversion.

BR
BetterRed is offline   Reply With Quote
Old 09-08-2024, 08:18 PM   #8
retiredbiker
Evangelist
retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.
 
retiredbiker's Avatar
 
Posts: 416
Karma: 2289864
Join Date: May 2013
Location: Ontario, Canada
Device: Kindle KB, Oasis, Pop_Os!, Jutoh, Kobo Forma
Quote:
Originally Posted by Jasonc310771 View Post
The page break shows before the title "The book of hymns" yet it is not in the word file.
In your test file,THE BOOK OF HYMNS has a style of "Heading 1" just like the other headings -- "This is story 1" and so on. So the conversion is treating them just the same.

Give your THE BOOK OF HYMNS line a different, non-heading style in Word and it will no longer appear as a break in the conversion. You can make the style LOOK like anything you want, just get the "heading" type off it.

Word will not show a page break just because of a heading. Page breaks in Word are completely different things than they are in an epub.
retiredbiker is offline   Reply With Quote
Old 09-08-2024, 09:22 PM   #9
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,894
Karma: 27620684
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by retiredbiker View Post
Word will not show a page break just because of a heading. Page breaks in Word are completely different things than they are in an epub.
Word will if you 'ask' it nicely:

Click image for larger version

Name:	Clipboard.jpg
Views:	20
Size:	209.9 KB
ID:	210703

As will LO Writer, Wordperfect, etc… and pretty sure DISSOS and Profs did too.

BR
BetterRed is offline   Reply With Quote
Old Today, 06:18 AM   #10
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
I have checked all headings and all have the same style/font/spacing/margin, I just apply the heading and change the colour and size. I have tried to copy/format and past but still this does not fix it. Why would all other headings work as expected yet this one fails. I have delete the heading and a few lines of text before and after the heading, then retyped manually the text. But still at this very same section the page breaks. yet there are no hidden break or special characters.

In Calibre I go in to edit book, and view the html code, here is the end of the html before the unwanted page break.

<p class="block_4"><span class="calibre5"> </span>‘It’s all right,’<span class="calibre5"> he says, softly. </span>‘<span class="calibre5">I’m not blaming you. It’s only natural really, that you should feel like that. Only</span>…’ He pauses, and I look up. ‘Your home is in England now, Odette,’<span class="calibre5"> he continues, very gently. </span>‘Don’t you think you should be making the best of your life here?’</p>
<p class="block_4"><span class="calibre5"> I bite my lip. </span>‘<i class="calibre6">J’essaie,</i>’<span class="calibre5"> I reply. Then, </span><i class="calibre6">‘C’est difficile.</i>’<span class="calibre5"> And then I realise. I’m still speaking in French. And in fact, in all the years I live in England, I never lose my French accent. And that night, and every other night, for all the rest of my life, I dream in French.</span></p>
<p class="block_5">*</p>
</body></html>


this is the html after the unwanted page break.

Code:
<?xml version='1.0' encoding='utf-8'?>
<html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en">
  <head>
    <title>Unknown</title>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8"/>
  <link rel="stylesheet" type="text/css" href="stylesheet.css"/>
<link rel="stylesheet" type="text/css" href="page_styles.css"/>
</head>
  <body class="calibre">
<h1 class="block_9" id="toc_id_138">THE BASKET</h1>
	<p class="block_10">*</p>
	<p class="block_4"><span class="calibre5">On my birthday, there’s a letter from Maman.  Crisp blue envelope covered in thin strokes of ink.  Send it back.  I don’t want to read it.  I don’t want to read the word we can’t ever bring ourselves to say.</span> <i class="calibre6">Cancer</i>. The same in both languages.</p>


page_style.css
@page {
  margin-bottom: 5pt;
  margin-top: 5pt;
}

this is the stylesheet.css

.block_ {
  display: block;
  font-family: serif;
  font-size: 0.75em;
  line-height: 1.08;
  text-align: center;
  margin: 0 0 8pt;
  padding: 0;
}
.block_1 {
  display: block;
  font-size: 2em;
  line-height: 1.08;
  text-align: center;
  margin: 0 0 8pt;
  padding: 0;
}
.block_2 {
  display: block;
  line-height: 1.2;
  text-align: center;
  margin: 0 0 8pt;
  padding: 0;
}
.block_3 {
  display: block;
  line-height: 1.2;
  page-break-after: always;
  margin: 0 0 8pt;
  padding: 0;
}
.block_4 {
  display: block;
  line-height: 1.2;
  margin: 0 0 8pt;
  padding: 0;
}
.block_5 {
  display: block;
  font-family: serif;
  font-size: 0.75em;
  line-height: 1.08;
  margin: 0 0 8pt;
  padding: 0;
}
.block_6 {
  display: block;
  font-style: italic;
  line-height: 1.2;
  margin: 0 0 8pt;
  padding: 0;
}
.block_7 {
  display: block;
  font-size: 1.125em;
  font-weight: bold;
  line-height: 1.2;
  page-break-after: avoid;
  page-break-inside: avoid;
  margin: 12pt 0 0;
  padding: 0;
}
.block_8 {
  display: block;
  font-size: 1.125em;
  line-height: 1.2;
  margin: 0 0 8pt;
  padding: 0;
}
.block_9 {
  display: block;
  font-size: 1.125em;
  font-weight: normal;
  line-height: 1.2;
  page-break-after: avoid;
  page-break-inside: avoid;
  margin: 12pt 0 0;
  padding: 0;
}
.block_10 {
  display: block;
  font-family: serif;
  font-size: 0.75em;
  margin: 0;
  padding: 0;
}
.block_11 {
  display: block;
  line-height: 1.2;
  text-align: justify;
  margin: 0 0 8pt;
  padding: 0;
}
.block_12 {
  display: block;
  font-family: serif;
  font-size: 0.75em;
  line-height: 1.08;
  text-align: justify;
  margin: 0 0 8pt;
  padding: 0;
}
.block_13 {
  display: block;
  font-size: 1.125em;
  line-height: 1.2;
  page-break-after: always;
  margin: 0 0 8pt;
  padding: 0;
}
.block_14 {
  display: block;
  font-size: 0.75em;
  line-height: 1.08;
  margin: 0 0 8pt;
  padding: 0;
}
.block_15 {
  display: block;
  font-size: 1.125em;
  font-style: italic;
  line-height: 1.2;
  margin: 0 0 8pt;
  padding: 0;
}
.block_16 {
  display: block;
  font-size: 1.125em;
  line-height: 1.2;
  text-align: center;
  margin: 0 0 8pt;
  padding: 0;
}
.calibre {
  display: block;
  font-family: "Garamond", serif;
  font-size: 1em;
  padding-left: 0;
  padding-right: 0;
  margin: 0 5pt;
}
.calibre1 {
  line-height: 1.2;
}
.calibre2 {
  height: 52.5pt;
  line-height: 1.2;
  width: 116pt;
}
.calibre3 {
  display: block;
  page-break-after: always;
  margin: 0;
  padding: 0;
}
.calibre4 {
  display: block;
  line-height: 1.2;
}
.calibre5 {
  line-height: 1.2;
  white-space: pre-wrap;
}
.calibre6 {
  font-style: italic;
  line-height: 1.2;
}
.calibre7 {
  font-weight: bold;
  line-height: 1.2;
}
.calibre8 {
  font-size: 0.75em;
  line-height: normal;
  vertical-align: super;
}
.tab {
  line-height: 1.2;
  white-space: pre-wrap;
}
.text_ {
  font-style: italic;
  letter-spacing: -0.4pt;
  line-height: 1.2;
}
.text_1 {
  background-color: #FFF;
  font-style: italic;
  line-height: 1.2;
}
.text_2 {
  font-style: italic;
  font-weight: bold;
  line-height: 1.2;
}
.text_3 {
  font-style: italic;
  line-height: 1.2;
  text-decoration: underline solid;
}
.text_4 {
  font-size: 1.33333em;
  line-height: 1.2;
}
.text_5 {
  font-size: 1.33333em;
  font-style: italic;
  line-height: 1.2;
}
.text_6 {
  font-size: 1.5em;
  line-height: 1.2;
}

I do not want to be editing these files if I can help it, just fix the issue in the original word document.

Last edited by issybird; Today at 07:14 AM.
Jasonc310771 is offline   Reply With Quote
Old Today, 08:55 AM   #11
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,326
Karma: 58000000
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Code:
<h1 class="block_9" id="toc_id_138">THE BASKET</h1>
This is the reason you get page breaks AND TOC entries.
You have set a Page HEADER in your document. This is on every PRINTED PAGE and you don't want to do that for an EPUB as there are no 'pages', only screenfuls that varry with the USER selected size and font.
theducks is offline   Reply With Quote
Old Today, 09:28 AM   #12
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
ok, But I do not have any issues with the other story titles, all using the same method to format them.
Jasonc310771 is offline   Reply With Quote
Old Today, 10:34 AM   #13
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,326
Karma: 58000000
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Are you the Author of these word files? Because Calibre, by default breaks on H1, H2, so something IS slightly different. Note: changing the h1 to a p, will not, in this case, change the look because it is STYLED to look the same.
theducks is offline   Reply With Quote
Old Today, 12:10 PM   #14
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
I am not the author, I am helping them convert the file for Kindle.
Calibre is not breaking all H1, H2, only two, all others are unaffected. The other book I converted did not have this issue, and all formatting methods used are the same.
I have even tried to copy the format from the unaffected book title to the book I am working on now and this did not make any difference. I even cleared the formating and tried to copy/format to make sure it was changing it, which it did, but still the page break shows at this and the other title, yet all other titles show fine without a page break.
Jasonc310771 is offline   Reply With Quote
Old Today, 12:15 PM   #15
Jasonc310771
Member
Jasonc310771 began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Sep 2024
Device: calibre
Quote:
Originally Posted by retiredbiker View Post
In your test file,THE BOOK OF HYMNS has a style of "Heading 1" just like the other headings -- "This is story 1" and so on. So the conversion is treating them just the same.

Give your THE BOOK OF HYMNS line a different, non-heading style in Word and it will no longer appear as a break in the conversion. You can make the style LOOK like anything you want, just get the "heading" type off it.

Word will not show a page break just because of a heading. Page breaks in Word are completely different things than they are in an epub.
If I remove all headings, how can I mark the sections of text as chapters ? so it shows in the readers the titles of each story ?
Jasonc310771 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Converting Word Doc to ePUb- FOOTNOTES Mooninite1 ePub 8 02-28-2015 08:26 AM
Converting a Japanese Word doc to Mobi help, please ImogenRose Conversion 1 06-12-2013 01:20 PM
I need a simple app for converting Word .doc file to .mobi macdonald79 Amazon Kindle 17 07-22-2011 07:58 AM
Converting Word Doc with Tables to Epub? dhume01 ePub 8 12-28-2010 08:02 PM


All times are GMT -4. The time now is 03:26 PM.


MobileRead.com is a privately owned, operated and funded community.