11-20-2008, 02:27 AM | #31 |
Member
Posts: 15
Karma: 10
Join Date: Oct 2008
Device: PRS 500
|
Very impressive!
This kind of design style is just what I dreamed: I press one button and you do all the work!!!!! From algorithm's point of view, I think the most interesting part is the algorithm that detects blocks of contents from the pdf layout. If the program can incorporate the algorithm that breaks lines, it will be able to break 1-page wide blocks to 1/2-page wide block. The program will be a star to handle academic papers then (what a desperate target?). There may be many small things need to be tweaked, but the major function (handle sing-column and double-column) is there. And, please, please, please keep the current operation style. I understand that you experts like commend line a lot. But this types of clean and intuitive UI is very important to the newbies (Do I need some HCI refences to support this argument?). At last, a small thing I think may make the output looks better: when segmenting the small block to fit into the 6'' pages, you may want to detect whether a cut may cross some content or not. If it does, we may need to move the cut a little bit higher. This is only a small problem. The current tool is fantastic at this stage. |
11-20-2008, 02:28 AM | #32 |
Member
Posts: 11
Karma: 10
Join Date: Oct 2008
Device: Hanlin V3
|
|
Advert | |
|
11-20-2008, 07:35 AM | #33 |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
|
11-20-2008, 08:47 AM | #34 |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Taesoo Kwon:
As a challenge, this is the first .pdf (see attachment below) I tried to convert to ebook (.html and images) that I was not successful in getting anywhere near a satisfactory result. I tried converting it with early version of PDFRead and it did work quite well for me. That convinced me that .pdf to images was the only solution for complex .pdfs. It made me then request that color support be added to PDFRead v1.7; and that's why I'm involved with PDFRead v1.8! I just tried to get two column output using PaperCrop 0.3 and this .pdf, but wasn't able to. What algorithm would you suggest in getting this .pdf converted using a combination of reflow and/or two-column support? Like I said before, a challenge! |
11-20-2008, 10:01 PM | #35 | |
Enthusiast
Posts: 27
Karma: 163
Join Date: Nov 2008
Device: Kobo wifi
|
CAA.pdf
Quote:
Of course, it is easy to segment the pdf file in a PDFRead-way (simply dividing a page into two regions.) I would include this over-simplified but robust segmentation method into PaperCrop as an option someday. Last edited by Taesoo Kwon; 11-20-2008 at 10:12 PM. |
|
Advert | |
|
11-20-2008, 10:15 PM | #36 |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Yep, it is! Given that that .pdf is difficult to segment, I hope it doesn't discourage you. I was only trying to see if you may have had any insights on how best to tackle it's conversion.
Your program couldn't figure it out because it's not two-column but behaves like it is and reflow get confused with large blocks of center inset images. I guess we'll leave it until version 0.4 ... Thanks for trying! |
11-23-2008, 03:06 PM | #37 |
Member
Posts: 11
Karma: 10
Join Date: Oct 2008
Device: Hanlin V3
|
Where are the pictures stored? I mean I don't see anything in a folder which is created together with PDF. Is there a way to get those pictures?
|
11-24-2008, 12:37 PM | #38 | |
useR!
Posts: 299
Karma: 651
Join Date: Nov 2007
Location: NY
Device: Onyx Boox Max 2, Kobo Libra H2O, iRiver Story HD
|
Quote:
The option 'output_to_pdf' should be changed to 'false' as follows. Code:
output_to_pdf=false |
|
12-01-2008, 04:22 AM | #39 |
Zealot
Posts: 112
Karma: 113786
Join Date: Jul 2008
Location: Germany
Device: Sony PRS-T3S, CoolReader on 4'' Android phone
|
withdrawn
Last edited by hansl; 12-18-2008 at 05:49 AM. |
12-08-2008, 05:10 AM | #40 |
Zealot
Posts: 112
Karma: 113786
Join Date: Jul 2008
Location: Germany
Device: Sony PRS-T3S, CoolReader on 4'' Android phone
|
withdrawn
Last edited by hansl; 12-18-2008 at 05:49 AM. |
12-13-2008, 12:52 PM | #41 |
Enthusiast
Posts: 27
Karma: 163
Join Date: Nov 2008
Device: Kobo wifi
|
Sorry for a late response.
Hansl, could you please send me the PDF file to me? (taesoobear@gmail.com) I may not be able to fix the bug soon but I will try when I have time. |
12-18-2008, 05:48 AM | #42 |
Zealot
Posts: 112
Karma: 113786
Join Date: Jul 2008
Location: Germany
Device: Sony PRS-T3S, CoolReader on 4'' Android phone
|
Sorry, I removed the generated pdf.
I'm too busy currently to deal with the issue, but thanks for your reply. hansl |
12-22-2008, 03:08 PM | #43 |
Wizard
Posts: 3,671
Karma: 12205348
Join Date: Mar 2008
Device: Galaxy S, Nook w/CM7
|
I have a PDf with 300DPI images. I'd like to crop the PDF but keep the 300DPI. However PaperCrop is converting the image to 72DPI. Is there a setting that I can set to keep the DPI the same resolution as the original PDF (e.g. 300DPI) ?
Thank you, =X= |
12-26-2008, 09:24 AM | #44 |
Enthusiast
Posts: 27
Karma: 163
Join Date: Nov 2008
Device: Kobo wifi
|
|
01-08-2009, 01:47 AM | #45 |
Wizard
Posts: 3,671
Karma: 12205348
Join Date: Mar 2008
Device: Galaxy S, Nook w/CM7
|
I'm embarrassed to ask, but how do I get Caritas' Reflow to work. And how can I validate that it works.
=X= |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Q: multi-column PDF to single column mobi format converstion | auburn1975 | Calibre | 7 | 01-28-2012 06:11 PM |
eBook PDF - free tool for creating PDF eBooks from text files | KACartlidge | 6 | 01-04-2012 09:41 AM | |
Multi column sort? | nexus100 | Calibre | 1 | 07-11-2010 11:19 PM |
Multi-column articles in PDF | tdido | OpenInkpot | 7 | 06-30-2009 11:13 AM |