12-17-2022, 05:12 PM | #1 |
Wizard
Posts: 1,363
Karma: 6794938
Join Date: Sep 2021
Location: Australia
Device: Kobo Libra 2
|
Scanning books
Oh wow. I just spotted this... https://imgur.com/gallery/8WvxKsm
I had always assumed the books were torn apart and fed through a sheet feeder. Who has one? |
12-17-2022, 05:53 PM | #2 |
Enthusiast
Posts: 38
Karma: 20000
Join Date: Jun 2010
Device: ipad
|
Seems like it could easily pull pages out of the binding. Here’s an article about how Google does it using cameras and image processing software to undo the distortion.
https://www.npr.org/sections/library...t_7508978.html |
Advert | |
|
12-17-2022, 07:42 PM | #3 | |
Grand Sorcerer
Posts: 6,772
Karma: 26974049
Join Date: Apr 2009
Location: USA
Device: iPhone 15PM, Kindle Scribe, iPad mini 6, PocketBook InkPad Color 3
|
Quote:
|
|
12-17-2022, 08:01 PM | #4 | |
Wizard
Posts: 1,363
Karma: 6794938
Join Date: Sep 2021
Location: Australia
Device: Kobo Libra 2
|
Quote:
But I have captured a screenshot of the short 7 second video that shows the mechanical arm scanning a book page then moving onto the next page to scan... |
|
12-17-2022, 08:25 PM | #5 | |
Grand Sorcerer
Posts: 6,772
Karma: 26974049
Join Date: Apr 2009
Location: USA
Device: iPhone 15PM, Kindle Scribe, iPad mini 6, PocketBook InkPad Color 3
|
Quote:
I bought an overhead scanner, mostly to digitize sheet music. It doesn't actually 'scan', rather captures image with a camera. The material does not need to be perfectly flat, just flat enough so the camera can peer into the crevice of an open book. Then software de-skews the image to restore the proper geometry. It has multiple LED lights to eliminate shadows and they sell a fold-up light box that goes over everything, with interior reflective surface to further even out the light and ensure the best results. You still have to physically turn pages, it will detect page turns automatically, or you can use the supplied foot switch to trigger capture. In theory it will do an entire book in the time it takes you to turn pages. Of course you need to develop some technique. The real work comes after the scanning is done: verify everything looks good, rescan pages that aren't, organizing and tagging the content, and storing the digital files somewhere. I have yet to use it for a real project, but I think it's better than anything out there for under $500. My only worry is that there is no 3rd party software that works with it: https://www.amazon.com/gp/product/B07JMTPJ8S/ I don't think I'll be scanning books with it, but there might be a few exceptions. I haven't even gotten started with the sheet music. I also got a state of the art film scanner. I retired recently and thought I'd have all this time for such long deferred projects. I was wrong. Last edited by tomsem; 12-17-2022 at 08:39 PM. |
|
Advert | |
|
12-17-2022, 10:17 PM | #6 | |
Wizard
Posts: 1,571
Karma: 11380098
Join Date: Aug 2010
Location: NE Oregon
Device: Kobo Sage, Pocketbook Era, Kobo Forma, Kindle Oasis 2
|
Quote:
|
|
12-18-2022, 06:14 AM | #7 | |
the rook, bossing Never.
Posts: 12,378
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
Big official outfits have used page turning and cameras for years and avoid opening the books too far. The camera and lighting part is cheap enough; the page turner is more expensive. Google is breaking copyright but they lobbied to win a US court case on it. They claim to only make a "fair use" amount available online though they store the entire book and OCR for search on servers on the internet. Google and Microsoft supply the Internet Archive with copies. The Internet Archive has also been scanning copyright material for about 20 years. They don't pay for content in their Open Library. |
|
12-19-2022, 06:15 AM | #8 | |
Avid reader
Posts: 868
Karma: 6399168
Join Date: Apr 2009
Location: UK
Device: Samsung Galaxy Z Flip 4 / Kindle Paperwhite / TCL Nxtpaper 14
|
Quote:
Andrew PS This used to be the go to place for book scanning but it looks like the web site is fast succumbing to bit rot: https://www.diybookscanner.org/ Last edited by andyh2000; 12-19-2022 at 06:18 AM. |
|
12-19-2022, 05:56 PM | #9 | |||||
Wizard
Posts: 2,304
Karma: 12587727
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
Destructive is way cheaper + faster, but you lose the book. You cut the binding off + feed it into a sheet feeder. Non-Destructive is more expensive + slower, but the book stays whole. You flip the pages, taking a photo (or scan) of each page. Scanning is slower, like the video you linked to. This will flatten the pages + try to create a very good scan. You don't need fancy lighting setups or anything, because the scanner itself makes sure there's "consistent lighting" across the entire page. Photographing is faster, but you need to do a lot more error-correcting later. (Warped pages, bad lighting, etc.) B&W vs. Color adds a whole other issue. (Color is harder+slower.) - - - There are various cost-levels with each of these categories. What you linked to is probably a very expensive machine, but it means someone doesn't have to be standing there flipping the pages or pushing buttons. But, things can be done at decent quality very cheaply nowadays, especially with that fancy phone in your pocket. See anonlivros's great tutorial: and my post-processing advice to turn it into a PDF->Ebook. He shows you how you could use a:
to help take photographs of your books. Yes, you'll still have to flip the pages, but he gives quite a few tips on how to speed up the process (like using your voice to activate the camera!). - - - Quote:
and, if you want to see them digitizing vinyl albums:
Quote:
This type of lock-in is completely unneeded, especially nowadays. With high-quality cameras in cellphones now, you can replicate a lot of this stuff. (See the anonlivros's tutorial above.) Quote:
To a human, things may look fine... but when you try to convert color photo->B&W, you could get some serious "haloing". I don't have an image on hand of haloing, but you can see a related concept of the previous-page bleeding through to the front in my recent post: In the "Minimize the Colors" section, you can see:
That's one issue where a high-quality scanner may take care of it automatically. It may auto-adjust the lighting to try to capture ONLY the front page, while ignoring the light text bleeding through. While a human can look at the photo + easily "ignore" that, and still be able to read the text... the computer might have a hard time doing OCR or making the photos B&W. (Leading to less accurate text and/or you'll get extremely bloated filesizes.) Quote:
Not much has changed in all those years though. The same problems still exist:
But the surrounding tools have gotten much cheaper/faster/better. Like back then, you needed expensive cameras to get "high enough DPI" for good OCR. Now, a cellphone in your pocket can take just as good (if not better) photos. Phones also have voice-activation, so you don't need those extra buttons/foot-pedals. Phones also have gyroscopes, so you can more easily align your photos so you don't need to do as much trapezoidal correction. Phones have fancy screens, so you don't need to be hooked up to a computer + can instantly see the photos and correct there, etc., etc. Last edited by Tex2002ans; 12-19-2022 at 06:16 PM. |
|||||
12-20-2022, 10:29 AM | #10 |
the rook, bossing Never.
Posts: 12,378
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Excellent very clear tutorial on it!
|
12-20-2022, 11:00 AM | #11 |
Wizard
Posts: 1,016
Karma: 4407864
Join Date: Feb 2009
Location: North Carolina
Device: Kobo Aura1, Nexus7.2, Galaxy Tab A 8.4, Ipad Air & Mini 6
|
"See anonlivros's great tutorial:
2022: "Tutorial-from Paper Book to Ebook PDF - 400 pages in 4 hours" "But you can check it out with the auto-translated version of the link below: https://translate.google.com/transla...ros/anonlivros" I can't get it to display. Google just says it can't translate the page. Any help would be appreciated. |
12-20-2022, 12:20 PM | #12 | |
Avid reader
Posts: 868
Karma: 6399168
Join Date: Apr 2009
Location: UK
Device: Samsung Galaxy Z Flip 4 / Kindle Paperwhite / TCL Nxtpaper 14
|
Quote:
Andrew |
|
12-20-2022, 05:28 PM | #13 | ||
Wizard
Posts: 2,304
Karma: 12587727
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
https://github.com/anonlivros/anonlivros It's in Portuguese, so just run that through whatever translation site you prefer. Quote:
|
||
12-21-2022, 08:38 AM | #14 |
Wizard
Posts: 1,016
Karma: 4407864
Join Date: Feb 2009
Location: North Carolina
Device: Kobo Aura1, Nexus7.2, Galaxy Tab A 8.4, Ipad Air & Mini 6
|
Thanks andyh2000 and Tex2002ans. I see what I can do with translations!
|
12-21-2022, 11:53 PM | #15 |
Guru
Posts: 993
Karma: 12000001
Join Date: Nov 2009
Location: Seattle Wahington U.S.
Device: kindle
|
Almost seems it might be easier to simply re-type it. Depending on how fast you type.
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Scanning in your own books | gazza | News | 125 | 01-24-2016 05:42 PM |
New to Scanning Books | Ceryta | Workshop | 6 | 04-26-2011 07:23 PM |
new to scanning books | sovre | Workshop | 5 | 01-09-2011 07:50 PM |
Scanning books - New need help | Sporadic | Workshop | 9 | 04-19-2009 02:11 PM |
Scanning books | Nate the great | Lounge | 10 | 11-04-2007 02:20 AM |