Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Other formats > IMP

Notices

Reply
 
Thread Tools Search this Thread
Old 08-30-2010, 05:10 PM   #61
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by Westlyn View Post
I find that any .IMP more than about 100k takes an ever increasing time eg 400Kb takes about 20 minutes but 1500K takes about 4 hours and a 2500Kb IMP hadn't produced html after 7 or 8 hours, so there seems to be an exponential-like increase in processing time as the file size increases.

IMP_DUMP seems to be much less size affected with the same 1500Kb file taking less than 30 seconds to output the txt file.
IMP_DUMP just decompresses the text that was compressed as part of the .imp build process. It basically is a decompressor written in C and is extremely fast since it does no "processing" on THAT text.

I too do find the writing of this html file extremely slow, but it is a more complex process than just decompressing the text. It has to search/look-up the 'styles' used for each span of text and insert proper paragraph / new page breaks. There's even "special codes" spots that deal with images/tables/etc...

Perhaps Michael can review same for better efficiencies and try to curb the exponential growth in time when compared to file size.

Quote:
The file that takes 4 hours also generates a .html file where the text is unreadable but the imp_dump .txt file is readable

eg Output html in browser from ConvertIMPGUI:

Code:
„ ™ P) M a o “ % 
…s C i’ +' · 

è’ 7 < F q F k •s ™¹, s Á™ ™ K ˆ 

7t 7 ™ ™ K r ¡ ˜ sf aÁ™ ]4c q ‘{ Í£ ±c’ F\2 ˜ { ^ cv  › K
but the imp_dump output is readable; but is output with unix not windows line end termination. My highlighting of the line end chars.

Code:
"LF"PROLOGUE "LF"	"LF"I have a story to tell you. It has many beginnings, and perhaps one ending. Perhaps not. Beginnings and endings are contingent things anyway; inventions, devices. Where does any story really begin? There is always context, always an encompassingly greater epic, always something before the described events, unless we are to start every story with, 'BANG! Expand! Sssss . . .', then itemise the whole subsequent history of the universe before settling down, at last, to the particular tale in question. Similarly, no ending is final, unless it is the end of all things . . . "LF"Nevertheless
I must be being dense today but I couldn't see how to attach a file to a private email in this forum. So bear with me until I find out how.
You can't, so stop looking... Best to send Michael a private message asking him for his email or better still provide him with your off-site email and then you can make a "connection" for further email attachments...

Quote:
I'll send you a zip file with the .imp, the .html and .txt output from ConvertIMPGUI and IPM_dump. Hopefully that would make it easier to track down the issue. I'm assuming the massive performance hit is maybe related to the output formatting issue.

Thanks again for being willing to take a look
I'm interest in this too!
nrapallo is offline   Reply With Quote
Old 09-03-2010, 12:21 PM   #62
mscott161
Enthusiast
mscott161 has a complete set of Star Wars action figures.mscott161 has a complete set of Star Wars action figures.mscott161 has a complete set of Star Wars action figures.mscott161 has a complete set of Star Wars action figures.
 
mscott161's Avatar
 
Posts: 42
Karma: 370
Join Date: Dec 2008
Device: ebookwise, sony
Is it possible that the IMP file is also encrypted? The convertIMP does not handle encrypted IMP files and would probably have a tough time with it. I can add the code to it if I had one of your IMP books to test with.
Michael
mscott161 is offline   Reply With Quote
Advert
Old 09-07-2010, 06:50 AM   #63
Westlyn
Enthusiast
Westlyn began at the beginning.
 
Posts: 33
Karma: 14
Join Date: Jul 2010
Device: Windows Mobile and Android
Attachment 57817
Quote:
Originally Posted by mscott161 View Post
Is it possible that the IMP file is also encrypted? The convertIMP does not handle encrypted IMP files and would probably have a tough time with it. I can add the code to it if I had one of your IMP books to test with.
Michael
I'm pretty sure that the IMP is not encrypted, not least of which is that I don't think DeIMP can handle encrypted files and in any case I'm not providing any key when deimping.

Attached an encrypted zipfile with relevant files inside. Password sent via private mail.
Attached Files
File Type: zip IMPand outputs.zip (1.66 MB, 1129 views)

Last edited by Westlyn; 09-07-2010 at 06:55 AM. Reason: zip may have been corrupt
Westlyn is offline   Reply With Quote
Old 09-10-2010, 01:13 PM   #64
mscott161
Enthusiast
mscott161 has a complete set of Star Wars action figures.mscott161 has a complete set of Star Wars action figures.mscott161 has a complete set of Star Wars action figures.mscott161 has a complete set of Star Wars action figures.
 
mscott161's Avatar
 
Posts: 42
Karma: 370
Join Date: Dec 2008
Device: ebookwise, sony
Westlyn,

I found the problem. The book is uncompressed. I have fixed the code and updated the attachments and the download link in the first post of this thread.

Michael
mscott161 is offline   Reply With Quote
Old 01-02-2011, 01:39 PM   #65
Susant1
Junior Member
Susant1 began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Jan 2011
Device: eBookwise
Help please?

Quote:
Originally Posted by mscott161 View Post
Westlyn,

I found the problem. The book is uncompressed. I have fixed the code and updated the attachments and the download link in the first post of this thread.

Michael
I'm new here and have read the thread from the beginning. I have installed the executable as suggested by nrapallo in 11-26-2009. But when I try to open it, I get a box, "Application failed to initialize properly (OxcOOO135)" I'm running WinXP, SP2. Any suggestions would be greatly appreciated. TIA Susan
Susant1 is offline   Reply With Quote
Advert
Old 05-23-2011, 01:46 AM   #66
Hat8ee
Junior Member
Hat8ee began at the beginning.
 
Posts: 1
Karma: 10
Join Date: May 2011
Device: Nook Color
Thanks

Thanks so much for all your work.
Hat8ee is offline   Reply With Quote
Old 11-06-2012, 11:07 PM   #67
GrannyGrump
Obsessively Dedicated...
GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.
 
GrannyGrump's Avatar
 
Posts: 3,200
Karma: 34978132
Join Date: May 2011
Location: JAPAN (US expatriate)
Device: Sony PRS-T2, ADE on PC
NO HTML EXTRACTION AVAILABLE

I am throwing this question out into cyberspace, with little hope, as it looks like the IMP format has been mostly abandoned.

A few titles in the library that I want are only in IMP format. So I tried this converter, but with mixed results.

I "installed" IMP GUI Converter v 1.36 as instructed --- mscott161 stated that all that is required is the ConvertIMPGUI executable and the ICSharpCode.SharpZipLib DLL. I launched the GUI and loaded a book into it. I CAN get the text extraction, but the HTML tool on the menu is grayed out / disabled.

The text extraction pulled out the text well enough, but apparently cannot deal with unicode characters. Lots of question marks/null glyphs for curly quotes, mdashes, diacritics, etc. I don't know if the HTML extraction would have better results, because it is disabled and not usable.

Any suggestions how to get the HTML extraction working?

Last edited by GrannyGrump; 11-06-2012 at 11:15 PM.
GrannyGrump is offline   Reply With Quote
Old 11-07-2012, 06:47 PM   #68
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by grannyGrumpy View Post
I am throwing this question out into cyberspace, with little hope, as it looks like the IMP format has been mostly abandoned.
I rarely use the .imp format anymore, but still am willing to dabble with it....

Quote:
A few titles in the library that I want are only in IMP format. So I tried this converter, but with mixed results.
Which ones in particular? Early on, there were two distinct ways to produce .imp files, one using compressed text (the norm) and one using just images for each page (this doesn't have anything to extract/convert).

Perhaps, I can have a go at converting it, if you give me a link to the ebook...

Quote:
I "installed" IMP GUI Converter v 1.36 as instructed --- mscott161 stated that all that is required is the ConvertIMPGUI executable and the ICSharpCode.SharpZipLib DLL. I launched the GUI and loaded a book into it. I CAN get the text extraction, but the HTML tool on the menu is grayed out / disabled.
Best to leave the .exe and associated files in that Debug directory under the bin folder; otherwise, you may encounter problems like you are noticing.

Quote:
The text extraction pulled out the text well enough, but apparently cannot deal with unicode characters. Lots of question marks/null glyphs for curly quotes, mdashes, diacritics, etc. I don't know if the HTML extraction would have better results, because it is disabled and not usable.
My imp_dump tool just extracts the raw text as well. If you are adventurous, try using the cpan EBbook-Tools imp extraction as discussed in this thread.

Quote:
Any suggestions how to get the HTML extraction working?
I don't think HTML extraction was working 100% to begin with. We never got that far...
nrapallo is offline   Reply With Quote
Old 11-11-2012, 06:59 AM   #69
GrannyGrump
Obsessively Dedicated...
GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.
 
GrannyGrump's Avatar
 
Posts: 3,200
Karma: 34978132
Join Date: May 2011
Location: JAPAN (US expatriate)
Device: Sony PRS-T2, ADE on PC
Smile

Hi nrapallo, so glad to get your reply!
Sorry to be so late coming back.

Just as background, all I did was download the zip file for the IMP GUI Converter, and extracted that entire folder to my drive, then drilled down to the Debug / bin folder and launched the executable. It runs, I just don't know if I need to do anything else to get the html extraction to work.

One of the files I was trying was Zelda Pinwheel's "Jumping Frog" by Mark Twain, because I am doing that book in ePub, and wanted to see if she had restored the diacritics (I don't speak French). This is the link .

I tried a couple of other IMP files as well, but can't remember titles just now.

I also googled unsuccessfully for a reader app that will display IMP files on my computer.

All I want to do is extract a reasonably clean text or html file that I can convert to ePub. Did I misunderstand what this tool does?

I will give a try to the other extraction tool you linked to, and see how that goes.

Thanks for the advice!
GrannyGrump is offline   Reply With Quote
Old 11-12-2012, 06:03 PM   #70
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by grannyGrumpy View Post
Hi nrapallo, so glad to get your reply!
Sorry to be so late coming back.
No worries...

Quote:
Just as background, all I did was download the zip file for the IMP GUI Converter, and extracted that entire folder to my drive, then drilled down to the Debug / bin folder and launched the executable. It runs, I just don't know if I need to do anything else to get the html extraction to work.
Using WinXP, when I download the latest ConvertIMPGUI.zip I can produce the .txt, .html and even the .lrf conversions. I attach same below for your perusal.

Quote:
One of the files I was trying was Zelda Pinwheel's "Jumping Frog" by Mark Twain, because I am doing that book in ePub, and wanted to see if she had restored the diacritics (I don't speak French). This is the link .
Ouch, diacritics support IS very limited or even non-existent!!!

Quote:
I tried a couple of other IMP files as well, but can't remember titles just now.

I also googled unsuccessfully for a reader app that will display IMP files on my computer.
You can get an imp_viewer for your PC when you install the eBook-Publisher software available from here. It even allows you to print the .imp to a printer or to .pdf file using a PDF printer driver like PrimoPDF (it's free).

Quote:
All I want to do is extract a reasonably clean text or html file that I can convert to ePub. Did I misunderstand what this tool does?
The steps I take are usually click the tabs in order i.e. General, Book Properties, and then Text Content. Once there, I click the Save Text button first, then the Save Html button. The latter launches a browser to display the results, but saves the .html anyway.

Quote:
I will give a try to the other extraction tool you linked to, and see how that goes.
I used the Ebook-Tools v0.4.6 to convert the .imp to .html and the results were quite acceptable. I did notice that the diacritics didn't translate well and discovered that the .imp stores them in "MacRoman" OS font encoding!!!! I knew the original SoftBook (like my REB1200) used the Mac OS as its base for the GUI and now I know it also used it for the text encoding! That was news to me!!! [BTW, I used 'iconv' to convert from macroman text to ascii text.]

Anyway, I attach the various conversions using ConvertIMPGUI, Ebook-Tools, imp_dump and PrimoPDF/imp_viewer!

Quote:
Thanks for the advice!
No problem, this was a worthwhile exercise for me.

Oh, and the .html source (now attached as "frog-html.zip" below) used for Zelda's .imp version has been already converted to .epub and .pdf at Feedbooks.com here! Just use these instead of converting....

Enjoy!
Attached Files
File Type: zip ConvertIMPGUI.v.1.3.6a-imp-conversion.zip (78.4 KB, 950 views)
File Type: zip EBook-Tools-0.4.6-imp-conversion.zip (61.7 KB, 912 views)
File Type: zip impviewer-print-to-pdf-imp-conversion.zip (1,002.1 KB, 893 views)
File Type: zip Imp_dump_v0.1_windows_executable-imp-conversion.zip (46.9 KB, 875 views)
File Type: zip frog-html.zip (17.4 KB, 861 views)

Last edited by nrapallo; 11-13-2012 at 10:43 PM. Reason: attached original source .html as retrieved by WayBackMachine at archive.org
nrapallo is offline   Reply With Quote
Old 11-28-2012, 10:17 AM   #71
GrannyGrump
Obsessively Dedicated...
GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.GrannyGrump ought to be getting tired of karma fortunes by now.
 
GrannyGrump's Avatar
 
Posts: 3,200
Karma: 34978132
Join Date: May 2011
Location: JAPAN (US expatriate)
Device: Sony PRS-T2, ADE on PC
I am SO sorry for the belated response, I didn't see a notification of another reply.

Thank you so much for doing these conversions, and maybe you gave me a boost by the file name with v 1.3.6 A. I will be trying this out again with that version, and keeping my fingers crossed.

And thank you for the FeedBooks link. I had downloaded books from there quite some time ago, and they all seemed to be in plain-text, a la Gutenberg. Who knew they have nicely formatted books... (would be nice if they gave a hint how the book looks)

Thank you for the time and trouble, Santa better be generous with you for your good deeds.

GrannyGrump is offline   Reply With Quote
Old 04-04-2015, 07:10 PM   #72
svru
Junior Member
svru began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Jan 2013
Device: PocketBook Touch 622
German Umlauts

Hello mscott161,

your greatful tool "Convert IMP" doesn't work with german umlauts! That's very sad for my german eBooks. I am a German - sorry for my bad english possibly.

Would it be possible to make this possible?
I find no way around german IMP files to use in Calibre.



Calibre represents instead the German umlauts are any special characters.


Sven
svru is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
GUI bug? edbro Calibre 2 06-19-2010 10:55 AM
Gui confusements. brewt Calibre 8 12-30-2008 01:16 PM
Mobi2Mobi GUI Batch Jad Kindle Formats 8 10-22-2008 08:22 AM
iLiad GUI for the iLiad tribble iRex Developer's Corner 1 06-01-2007 03:29 PM


All times are GMT -4. The time now is 09:42 PM.


MobileRead.com is a privately owned, operated and funded community.