07-17-2012, 03:39 PM | #1 |
Member
Posts: 17
Karma: 10
Join Date: Jul 2012
Location: Left Coast
Device: Kindle/generic android tablet
|
Best Pre-Sigil word processor tool/workflow?
Hi,
Been a lurker for a while now, but after working on several ebook projects using Sigil I have a question for other regarding best practices / tools for getting an ebook manuscript into shape BEFORE importing into Sigil. I know it isn't technically a Sigil question, but after struggling with _horrible_ html output from MSWORD 97 (really really bad), and WORD2003 (better, but so so ugly and bloated..) I figured there were better workflow options and tools that would help me avoid fixing hundreds of EPUB validation issues with every book. My key questions are: 1) What word processors can export to html that is nmore EPUB/xhtml clean. 2) What is your workflow like: i.e. original manuscript in MS WORD (as most start there..) to application X to do Y -->Then use __ to ___ --> import to Sigil --> Save to .epub. |
07-17-2012, 04:38 PM | #2 |
Junior Member
Posts: 2
Karma: 10
Join Date: Dec 2011
Location: Northern California
Device: Kindle Touch
|
Great question! I too have it, so thanks for posting it. I look forward to someone's reply. I just got Scrivener, and am still learning it what it can do and I know it has epub export options, so I wonder if it would fit this bill.
Anyway, thanks again for posting a question, I'm sure a lot of us have. |
Advert | |
|
07-17-2012, 05:49 PM | #3 | |
Imperfect Perfectionist
Posts: 549
Karma: 863576
Join Date: Dec 2011
Location: Ølstykke, Denmark
Device: none
|
Quote:
- but that's of course just my 0.02$ and I'm looking forward to the discussion coming up Regards, Kim Last edited by elibrarian; 07-17-2012 at 05:50 PM. Reason: Typos |
|
07-18-2012, 03:55 AM | #4 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
I just do it in Word. I convert the Word document to HTML with my macro. I do not use the save as (filtered) HTML in Word. I import that in Sigil and I continue there.
I have also done quite some tests with DocToHTML and that produces rather clean code and you can immediately convert it to XHTML on the go. It can also create a stylesheet. Others use OpenOffice with the Writer2Epub add-on. |
07-18-2012, 05:52 AM | #5 | |
Bookmaker & Cat Slave
Posts: 11,495
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
Quote:
The bottom line for me is that you need to know regex. At least a little. At the end of the day, I haven't found a single magic bullet that will automagically clean up Word or any other word-processing output. We very simply clean up the Word file if necessary (we get a lotta, lotta, LOTTA crappy files--I mean, really awful), but mostly we clean the files in HTML, and the tool of choice here is NoteTab Pro. Not NotePad, NoteTab. We extract the HTML and then run a variety of standardized clips to clean it; then we clean up any residual oddities. So, our process is: Word (or other input source)-->HTML-->NoteTabPro-->Sigil. From the ePUBs, we have custom PERL and, again NTP clips that we use to create an inline TOC from the ncx, as well as make some other mods (usually the guide), and then drop it on Kindle Previewer/Kindlegen for MOBI versions. That's it. So, our "magic bullet" is simply to work in a super HTML editor. We do the finalization in Sigil, and any post-production copyedits there as well. That's it. HTH, Hitch |
|
Advert | |
|
07-18-2012, 06:36 AM | #6 |
Imperfect Perfectionist
Posts: 549
Karma: 863576
Join Date: Dec 2011
Location: Ølstykke, Denmark
Device: none
|
Oh well, forgot the question of our workflow, here goes:
(Seems like a lot of work, but there aren't really any shortcuts available, if you want to produce output of a reasonable quality ... the longest part of the process is of course the proofreading - and we like reading here ) Regards, Kim |
07-18-2012, 05:25 PM | #7 | |
Bookmaker & Cat Slave
Posts: 11,495
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
Quote:
You know me...always trolling the software, like a great white shark, looking for easier ways to do stuff... H. |
|
07-19-2012, 03:32 AM | #8 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
It is rather clean, yes. It retains styles in a separate stylesheet (or internal, whatever you want) and has many blows and whistles to tune it to your liking. For example, you can specify to retain italic in the stylesheet, but ignore color usage in the styles.
It is relatively fast and does not rely on Word inner HTML conversion engine. Some things I like better in my macro, others better in the program. The creator is also very helpful. I had a request that I could specify the styles to retain and convert all others to standard paragraph and it is on the To-Do list. I found a small bug and it was solved the next day. One other thing that might interest you, is the option to run various RegEx after the conversion as part of the process. I use it to convert placeholders at the latest option and also to clean something up. If you want I can convert a document for you so you can see the result. You know how to reach me. |
07-19-2012, 05:50 AM | #9 |
Bookmaker & Cat Slave
Posts: 11,495
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
Thanks, Tox! I'll take a look at it.
Hitch |
07-20-2012, 03:37 AM | #10 | |
Fanatic
Posts: 527
Karma: 1048576
Join Date: May 2009
Device: bebook; prs-950; nook simple touch; HTC Jetstream tablet
|
Quote:
Finally I purchased the upgraded FR XIX Fraktur edition (Recognition Server v3) when ABBYY greatly reduced its price. Still pretty expensive but certainly worth the price if one works with 18th/early 19th century books. What a difference! RS3 reduced correction/proofing time by many days per book! |
|
07-20-2012, 04:12 AM | #11 | |
Imperfect Perfectionist
Posts: 549
Karma: 863576
Join Date: Dec 2011
Location: Ølstykke, Denmark
Device: none
|
Quote:
It's funny how the human eye and brain is able to fill out the voids in such document, so we actually can get something legible out of it, isn't it? regards, Kim |
|
07-27-2012, 10:36 AM | #12 |
Author
Posts: 8
Karma: 10
Join Date: Jul 2012
Device: Samsung Galaxy Tab 7+
|
Libre to Sigil to Vendor
I take an MSWord or any other format manuscript and open it LibreOffice. All additional needed content for the actual "ebook" is added and positioned... though not cover or sometimes even other graphics. Styles are cleaned up and streamlined.
Using the plugin "Writer2xhtml", I export the prepared ebook straight to EPUB format. This is then opened up in Sigil for cover addition, tweaking, etc. I have used multiple layout programs, and things like DreamWeaver, Komodo Edit, etc. etc. I own InDesign and a full Adobe suite and three other packages for layout of traditional publications with export to EPUB as well. I own multiple office suites... The work flow above is the leanest, quickest, cleanest way to produce purely an ebook and nothing else if that is what you are after. And it's nearly glitch free. I rarely see anything go wrong during validation unless I've personally tried to do something odd. I would not do a pure ebook release any other way, especially since I don't us aggregators and only distribute through direct publishing portals such as KDP, PubIt, WritingLife, etc. And only once as one epub run into a problem due to a slightly non-standard cover image. |
08-03-2012, 01:58 AM | #13 | |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
Quote:
I complete writer2xhtml with Sigil mainly to add dropcaps and a multiple decoration. Here is the latest example EPUB. |
|
08-03-2012, 10:38 PM | #14 |
Witless protection Agent
Posts: 290
Karma: 1002898
Join Date: Nov 2009
Location: Los Angeles
Device: Kindle
|
Here is something to add to your toolkit: JEdit
This is a free programmers editor. It is outstanding with text. But the best part is is the macro language. You can record temporary macros and it shows you all the commands it created. You can then use this as the basis for a more permanent macro. You do NOT want to use this to create content, but for sucking in .txt or .html files and cleaning them up or doing regular expression search or replaces - it works great. I take text files and re-format the paragraphs, then add <p> tags. Then another macro searches for chapter breaks and converts them to <div id='chapter'> tags. Etc. |
08-04-2012, 02:25 PM | #15 |
Evangelist
Posts: 416
Karma: 1045911
Join Date: Sep 2011
Location: Cape Town, South Africa
Device: Kindle 3
|
I just use plaintext markup and text2tags to output xhtml. I have a little wiki-like app for keeping it all together and providing a Kate kpart based interface. Might release it in the future, it's pretty similar to old Sigil in some ways.
A few modifications to the markup and generation to make things nicer. Output is pretty much just thrown into Sigil to be compiled into an epub, very little editing needs to be done unless different styles need to be applied |
Tags |
html, tool, word processors |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Atlantis Word Processor | librarianchat | Calibre | 3 | 12-15-2011 02:38 AM |
Word Processor? | Joefitch | Kindle Developer's Corner | 1 | 08-05-2011 11:53 AM |
eink word processor? | Giggleton | General Discussions | 11 | 02-20-2011 07:20 PM |
Android Word processor | ivan | enTourage Archive | 3 | 02-07-2011 10:14 AM |
Keyboard and Word Processor | Devlar | iRex | 2 | 06-11-2007 04:43 AM |