06-15-2010, 05:35 AM | #31 |
Zealot
Posts: 115
Karma: 150
Join Date: Jul 2008
Location: Netherlands Veenendaal
Device: Palm T5, Sony PRS-505, Nook Color
|
OK, here we go:
Download from http://sourceforge.net/projects/unxutils/files/ the following package: UnxUtils.zip Unpack the UnxUtils.zip where you want it (ex: c:\unixutils), note where it ends up since we need the path to the bin folder and the path to usr\local\wbin Download pdftotext from foolabs.com: ftp://ftp.foolabs.com/pub/xpdf/xpdf-3.02pl4-win32.zip Extract the contents, or pdftotext.exe alone, to the usr\local\wbin folder Open a command prompt type: set PATH=c:\unixutils\bin;c:\unixutils\usr\local\wbin; %PATH% type: sh (this should give you a unix like shell which will run the batch file) type: ls (this should give you a listing of files in your current directory) type: exit You're back at the command prompt of Windows. Copy the attached isbn.bat into the wbin folder and rename it to isbn.zsh, edit the first line to reflect your installation of UnxUtils. (For unknow reasons to me I can't upload isbn.zsh ) Copy your PDF's, if you're not trusting me that they won't be touched , into a temporary folder, no sub folders allowed, for example C:\PDF. In the command prompt type: cd c: cd \pdf type: sh type: isbn.zsh The script will try to find the following order of ISBN's: ISBN-13 ISBN-10 ISBN: ISBN If found it will rename the file and move it into the done folder. Locations can be changed in the script if needed but first leave the defaults since it will make finding problems much easier. After that make sure that Calibre is set to read the isbn number from the filename and start importing. Preferences->Add/Save->Adding Books>Regular expression: (?P<isbn>[0-9].+$) Further I have 'Read metadata only from file name' not checked. That should do the trick, I hope ;-) Regards, Joop |
06-15-2010, 12:02 PM | #32 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
Thanks a lot for your great script and help!
Can't run unixutils it in Win7 X64 (abnormal program termination) So I did it in WinXP in a VMware. I have a very high ISBN output which is great because it is a folder of files which failed all in e-library, but the script often is stopping without any reason and without any output so I interrupt it and start it again, may be it has something to do with the VMware Shared Folder where the files are in? |
Advert | |
|
06-16-2010, 04:01 AM | #33 | ||
Zealot
Posts: 115
Karma: 150
Join Date: Jul 2008
Location: Netherlands Veenendaal
Device: Palm T5, Sony PRS-505, Nook Color
|
Quote:
Quote:
If it has something todo with the shared folder I would suspect that every file failed. Can you show some of the output of the script? Regards, Joop |
||
06-16-2010, 04:35 PM | #34 | |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
Quote:
When I have a lot of time I will try to rewrite your script with windows powershell if it is possible |
|
06-17-2010, 04:12 AM | #35 | ||
Zealot
Posts: 115
Karma: 150
Join Date: Jul 2008
Location: Netherlands Veenendaal
Device: Palm T5, Sony PRS-505, Nook Color
|
Quote:
Its really wierd because if the pdf doesn't contain an ISBN number the script will print the name of the PDF and then process the next one, printing the name and move on. One thing that might cause this is a pdf file which isn't strictly conforming to the specs. We have an application which import a pdf which is first split into single pages and the application which generates this multipage pdf generates an invalid pdf. (MS reporting service) Quote:
Regards, Joop |
||
Advert | |
|
06-18-2010, 04:42 AM | #36 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
I already made a working PowerShell Script....
Just needs Windows PowerShell and the pdftotext.exe The next thing I will figure out is to find ISBN-13, ISBN-10, ISBN in one loop to make it a bit faster. |
06-18-2010, 10:16 PM | #37 |
Junior Member
Posts: 8
Karma: 10
Join Date: Jul 2009
Device: none
|
Still alive?
I'm surprised this thread is still alive. Unfortunately, my ebook collection has grown to the thousands, and I still have many that have sparse metadata. Good luck guys and keep up the good work!
|
06-19-2010, 09:23 AM | #38 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
If someone is interested in a PDF > ISBN Extractor PowerShell Script, let me know.
|
06-20-2010, 08:25 AM | #39 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
Does anybody know if it is possible to limit the lines/pages output txt with ebook-convert?
I can't find a parameter for it. I will try to make ISBN detection for pdf, epub and may be more and use only calibre own commands. |
06-20-2010, 10:29 AM | #40 | |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
If you are asking about limiting articles and feeds during news fetching (my primary use for ebook-convert) the answer is yes, but not by page or line, only by article, feed, date, etc. |
|
06-21-2010, 03:43 AM | #41 |
Zealot
Posts: 115
Karma: 150
Join Date: Jul 2008
Location: Netherlands Veenendaal
Device: Palm T5, Sony PRS-505, Nook Color
|
|
06-22-2010, 02:06 PM | #42 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
Last edited by chrisix; 06-22-2010 at 02:08 PM. |
06-22-2010, 02:07 PM | #43 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
|
06-22-2010, 05:38 PM | #44 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
PDF ISBN Extractor for Windows PowerShell
Here it is, put pdftotext.exe in same path.
I still have a problem with some situations, but may be you find a better much cleaner way, I already get crazy..... 0.2 > small changes. 0.3 > better results. Last edited by chrisix; 07-07-2010 at 01:02 PM. |
07-01-2010, 06:22 PM | #45 |
Enthusiast
Posts: 31
Karma: 12
Join Date: Jun 2010
Device: iPad
|
8 downloads no comment?
is it usable for anybody? chris |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Kobo future firmware feature request thread | sabredog | Kobo Reader | 2175 | 10-04-2024 08:29 PM |
Extract ISBN from PDF? | mdroberts | Calibre | 14 | 12-16-2016 07:32 AM |
Kobo future Hardware feature request thread | Psyke | Kobo Reader | 1 | 01-07-2011 06:09 PM |
[Old Thread] Calibre 'feature request' thread | Waba | Calibre | 2 | 02-10-2010 07:52 PM |
Feature request thread? | Dahak | Calibre | 1 | 08-02-2009 12:51 AM |