Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 07-08-2020, 02:10 AM   #1
NeHe
Junior Member
NeHe began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Jul 2020
Device: Android Tablet
Searching within books, is it possible

Hi there. I am a new user, and have a Calibre library including several hundred pdfs of scanned books (textbooks from my University days and a large cookbook collection) converted to PDF and OCRed so they are indexed and individually searchable.

Is there a plugin or other companion app that would be able to search content within a book? So for example I could search for "banana bread" and find which cookbooks had a banana bread recipe in them.

There are desktop search tools (Recoll for example) that I could point at the library directory, but I was wondering if there is something more integrated that people have used?

Thanks.

--Neil.
NeHe is offline   Reply With Quote
Old 07-08-2020, 04:05 PM   #2
capink
Wizard
capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.capink ought to be getting tired of karma fortunes by now.
 
Posts: 1,139
Karma: 1954142
Join Date: Aug 2015
Device: Kindle
There is a plugin called Quality Check that allows you to search within books for words. You can search the whole library or narrow the scope to selected books only. It only works for books with epub format.

edit: there is an abandoned a recoll plugin that I never used and I don't think it works on recent calibre versions.
capink is offline   Reply With Quote
Old 07-08-2020, 05:12 PM   #3
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 76,491
Karma: 136564766
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quality Check will not search PDF. I think QC only is able to search ePub.
JSWolf is offline   Reply With Quote
Old 07-08-2020, 05:13 PM   #4
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 76,491
Karma: 136564766
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by NeHe View Post
Hi there. I am a new user, and have a Calibre library including several hundred pdfs of scanned books (textbooks from my University days and a large cookbook collection) converted to PDF and OCRed so they are indexed and individually searchable.

Is there a plugin or other companion app that would be able to search content within a book? So for example I could search for "banana bread" and find which cookbooks had a banana bread recipe in them.

There are desktop search tools (Recoll for example) that I could point at the library directory, but I was wondering if there is something more integrated that people have used?

Thanks.

--Neil.
Sorry, there are no plugins that will search the text layer of PDF.
JSWolf is offline   Reply With Quote
Old 07-08-2020, 05:27 PM   #5
ownedbycats
Custom User Title
ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.
 
ownedbycats's Avatar
 
Posts: 9,575
Karma: 64960983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
If you can find an external program to search the PDFs (I use Agent Ransack, but it's a bit expensive), you can use the Drop Search Results plugin to drag and drop the results in and have them display in Calibre. It's not really the integrated solution you were looking for but it's the best we have for now.

Last edited by ownedbycats; 07-08-2020 at 05:30 PM.
ownedbycats is online now   Reply With Quote
Old 07-08-2020, 07:32 PM   #6
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,006
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by ownedbycats View Post
If you can find an external program to search the PDFs (I use Agent Ransack, but it's a bit expensive), you can use the Drop Search Results plugin to drag and drop the results in and have them display in Calibre. It's not really the integrated solution you were looking for but it's the best we have for now.
Agent Ransack/File Locator LITE is free, it's maybe good enough for some. AFAIK the MythicSoft products are only available for MS Windows.

But, IMO the Windows 10 Search feature is good enough for most people, especially if they use the Advanced Query Syntax

An ePub iFilter for Windows is available, there's a link in the Useful tools sticky in the Related Tools sub-forum.

Quote:
Originally Posted by NeHe View Post
Hi there. I am a new user, and have a Calibre library including several hundred pdfs of scanned books (textbooks from my University days and a large cookbook collection) converted to PDF and OCRed so they are indexed and individually searchable.

Is there a plugin or other companion app that would be able to search content within a book? So for example I could search for "banana bread" and find which cookbooks had a banana bread recipe in them.

There are desktop search tools (Recoll for example) that I could point at the library directory, but I was wondering if there is something more integrated that people have used?

Thanks.

--Neil.
Since you mention Recoll I assume you're using Linux

There is a Recoll plugin, see Index of plugins. But I'm pretty sure it only works on very old versions of calibre, and that the originator has abandoned it. You could modify it for your own use.

Or do as you suggest, use Recoll, or another Linux full text utilities directly. I'm not sure which Linux search utilities work with the Drop Search Results; have a look in the DSR thread.

Full text search is on Kovid's longer term to do list, he's mentioned using Lucene as the search engine.

BR
BetterRed is offline   Reply With Quote
Old 07-08-2020, 08:05 PM   #7
ownedbycats
Custom User Title
ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.
 
ownedbycats's Avatar
 
Posts: 9,575
Karma: 64960983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
Quote:
Originally Posted by BetterRed View Post
Agent Ransack/File Locator LITE is free, it's maybe good enough for some. AFAIK the MythicSoft products are only available for MS Windows.
From what I recall epub/pdf searching is only available in the paid version.
ownedbycats is online now   Reply With Quote
Old 07-08-2020, 08:35 PM   #8
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,006
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Win 10 Search with the free ePub and PDF iFilters installed will search the content of ePubs and PDFs.

I'm not criticising Mythicsofts products, I've been using FileLocator Pro for years. But I find Windows 10 Search and AQS more than good enough for most searches, and for some searches better than File Locator… or X1. Its generally a tad faster, and it's free.

BR
BetterRed is offline   Reply With Quote
Old 07-08-2020, 08:56 PM   #9
ownedbycats
Custom User Title
ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.
 
ownedbycats's Avatar
 
Posts: 9,575
Karma: 64960983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
Personally I had way too many annoyances with the Windows search indexer (runaway CPU usage! even after rebuilding the index and restarting the service!) that I just disabled it entirely. I use Everything + Agent Ransack most of the time these days. Sometimes NirSoft's SearchMyFiles.
ownedbycats is online now   Reply With Quote
Old 07-08-2020, 09:27 PM   #10
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,006
Karma: 27620706
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Yeah, I had those problems on XP and to a lesser extent on Win 7, that's why I have File Locator Pro and X1. But I've not had them on Win 10, maybe because I only index my data drives and not my system drive. To search for folders and files based on their names I use the xplorer² (file manager) Find tool.

Oh, and I do content searches from File Explorer or xplorer² - not the Winkey+S applet.

BR

Last edited by BetterRed; 07-08-2020 at 09:42 PM.
BetterRed is offline   Reply With Quote
Reply

Tags
content, indexing, library, searching


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Searching Books Samurai Calibre 2 07-24-2017 07:51 AM
Need help in searching books. Aaryan.25 Introduce Yourself 2 11-06-2016 08:51 PM
Searching books crutledge Upload Help 1 04-06-2015 09:10 AM
Books Metadata Sync and Searching Books gavinjb Kobo Reader 7 05-29-2014 11:58 AM
searching for books on the go curious Library Management 10 07-23-2013 03:18 PM


All times are GMT -4. The time now is 04:52 AM.


MobileRead.com is a privately owned, operated and funded community.