06-03-2013, 02:01 PM | #1 |
eBook DIYer
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
Word > DocToHtml > Sigil
I am surprised there are only 2 very old threads in this forum speaking about DocToHtml. I just had a look at their documentation. The tool does every thing I need including a batch mode and command line support.
I have noticed in the example they provide that the HTML code generated is pretty clean and could easily passed through a battery of regex SR. For instance ... The tool creates : Code:
<h3><a name="_Analysis_document">A</a>nalysis document</h3> Code:
<h3 id="_Analysis_document">Analysis document</h3> If their marketing is correct (hum, hum), I already have a lot of ideas to improve my flow. Zen. I plan to start the evaluation next Monday. However I feel suspicious the silence in this forum. Any feedback before I invest time on the evaluation? Thanks. |
06-03-2013, 02:45 PM | #2 | |
Grand Sorcerer
Posts: 27,903
Karma: 198500000
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
|
|
Advert | |
|
06-03-2013, 02:57 PM | #3 | |
Well trained by Cats
Posts: 30,378
Karma: 58053698
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
another way of saying: Just because you use Sigil (or xyz) in your process, does not make Those forums the best place to pose a question. Your (the OP) job (besides creating the work ) is to determine where things go Left and get help (from the dedicated forum) keeping thing straight and narrow. Good luck |
|
06-03-2013, 03:00 PM | #4 |
Resident Curmudgeon
Posts: 75,917
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
This belongs in the conversion forum where there is a thread on the new Tool by Toxaris that is used to take Word's mess and clean it up for use in converting to ePub.
|
06-03-2013, 04:01 PM | #5 |
eBook DIYer
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
Oops. Sorry if I selected the inappropriate forum. I meant there are only 2 threads in the entire MobileRead forum, not in Sigil only.
I know Toxaris tool and started testing it. I gave up though. It doesn't work in its current state, it doesn't implement what I need and I am not sure to be understood. Gentle moderator, please move this thread to whatever forum you want. Thanks. |
Advert | |
|
06-03-2013, 04:10 PM | #6 |
eBook DIYer
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
Silence and agressivity. Interesting.
|
06-03-2013, 05:20 PM | #7 |
Resident Curmudgeon
Posts: 75,917
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
06-03-2013, 05:23 PM | #8 | |
eBook DIYer
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
You don't listen to me my friend. I said ...
Quote:
|
|
06-03-2013, 05:44 PM | #9 |
Resident Curmudgeon
Posts: 75,917
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
06-03-2013, 05:55 PM | #10 |
eBook DIYer
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
|
06-03-2013, 06:03 PM | #11 |
mostly an observer
Posts: 1,515
Karma: 987654
Join Date: Dec 2012
Device: Kindle
|
I use word2cleanhtml.com to accomplish this, and I welcome seeing discussions of this problem in the Sigil forum. I had no idea they were frowned on!
|
06-03-2013, 06:10 PM | #12 |
A Hairy Wizard
Posts: 3,186
Karma: 18843349
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
Perhaps you can write your own set of macros that do what you want, how you want it?
There is no need to use some other regex s/r program before sigil. Sigil has it built in. If you dont know how to write macros you can actually save common regex's in sigil so you don't need to write a macro to do it. Wolf actually had a very good suggestion for you if you are unwilling or unable to use the recommended toxaris plugin. If you are asking for our feedback or recommendations then the silence IS feedback of a sort...people don't have anything good to say about it either because it's not a good tool, or there are better methods/processes and we don't use doc2html. But by all means, try out the doc2html program. Let us know how it works for you. |
06-03-2013, 06:20 PM | #13 | |
A Hairy Wizard
Posts: 3,186
Karma: 18843349
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
Quote:
There is no need to use calibre to convert to ePub first, just add the HTML file directly to sigil. It will create the ePub without needing to clean up all the calibre mess. So: 1) use Toxaris' plugin to clean up the document and save as HTML (or ePub) 2) open the resulting file in sigil and make final corrections (s/r, regex) Or 1) SaveAs filtered HTML 2) open the resulting file in sigil and make final corrections (s/r, regex) Cheers! |
|
06-03-2013, 06:29 PM | #14 | |
eBook DIYer
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
I know word2cleanhtml.com, it's the best solution I found so far except doing everything with Sigil. It is a shame it can't be run with a command file.
Quote:
Anyway, I like this kind of acid environment. It makes me stronger. |
|
06-03-2013, 06:30 PM | #15 | |
Guru
Posts: 631
Karma: 7544080
Join Date: Apr 2013
Location: Berlin
Device: PRS 350, Kobo Aura
|
Quote:
If you have a good word document - one with styles - this will produce a very clean epub. Maybe you want to run a few regexes above it, for example to get rid of MsNormal. If you always use consistent styles in word, you can reuse your epub stylesheet for each converted document. If you have footnotes or some other more complicated things, you want to "fix" them with regexes also. They do work, but could be a little bit prettier. Just remember: Use styles in word. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Best Pre-Sigil word processor tool/workflow? | Leverpullr | Sigil | 25 | 08-27-2012 02:18 PM |
cleaning up a word document in Sigil | BeccaPrice | Sigil | 9 | 10-08-2011 03:06 PM |