05-18-2024, 09:38 AM | #16 | |
the rook, bossing Never.
Posts: 12,388
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
Proof read with your Deja Vu set to 11? The concordance tool sounds good to detect repetitive writing, which really annoys people if it slips past the proof reading. |
|
05-18-2024, 09:48 AM | #17 | |
the rook, bossing Never.
Posts: 12,388
Karma: 92073397
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
Indeed any search tool is useless unless you know the exact repeating text. [* Unless it has a project / session mode that remembers previous files?] |
|
Advert | |
|
05-18-2024, 10:15 AM | #18 | |
Resident Curmudgeon
Posts: 76,535
Karma: 136565488
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
05-18-2024, 11:00 AM | #19 | |
Perfectionist
Posts: 72
Karma: 12802
Join Date: Apr 2014
Device: none
|
Quote:
And double-clicking a hit gives you short context. And double-clicking the context brings up the whole text, with repetition highlighted. Perfect. Doitsu, thank you so much! See above. |
|
10-14-2024, 11:17 PM | #20 | |
Wizard
Posts: 2,304
Karma: 12587727
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
But like Doitsu pointed out, what you want to look for is called:
So:
You can also use Calibre to temporarily convert your book to a TXT, and then there are plenty of "n-gram" tools out there to try and test out. - - - Side Note: I've written about "List-Based Spellchecking" + n-grams in detail, and have been using this to rip apart + edit books... for over 10 years now. For some of my recent posts, see:
I cover stuff like how I use Spellcheck Lists to catch:
then how I use n-grams to catch repetitious repetitions throughout the books! I also use Regular Expressions to quickly catch/refine/clean up a lot of this repetitious crap too! - - - Side Note #2: I even gave a talk about this last year in the:
- - - Side Note #3: If you're interested, just last week I wrote an "article" on how I use n-grams. This past month, I've been working on (conversion+proofing of) a 450k word beast of an ebook... The author wanted me to copyedit/proofread, so I:
Here's a little sample: - - - - - - - - - - N-grams These show you how many times you "repeat a phrase"/"chunk of words". So a list of "3-grams" would show you every "chunk of 3 words in a row". So if you took:
and ran 3-grams on it, the output would show:
You repeated "an example sentence" twice! When you run this across the entire book, these "repetitive patterns" pop right out! How I Use Them 1. I start with the biggest n-grams first... • Then work my way down. • 6-grams, 5-grams, 4-grams, ... 2. When I find an interesting phrase + high number... • I search the entire book for it. 3. I read the sentence... • Use this to chop/refine! • Fix/reword sentences as needed. 4. Repeat Step 2 in passes. In your case, we can skip the 7-grams and 6-grams (it's mostly just these super-long titles like "Chairman of the Joint Chiefs of Staff"). 5-grams is where we start seeing really interesting patterns. [... It then goes through 5-grams, 4-grams, 3-grams, 2-grams... showing the types of things/patterns that can be found with each. ...] Last edited by Tex2002ans; 10-15-2024 at 12:04 AM. |
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Repeated text after page turn. | Pawel1212 | Onyx Boox | 6 | 01-13-2023 04:09 AM |
Replace repeated item with the number of times it is repeated | 1v4n0 | Sigil | 3 | 04-01-2021 05:41 PM |
Find duplicate books | MOJOJE | Library Management | 1 | 08-13-2020 06:59 PM |
Repeated text pdf to epub conversion | magicman1223 | Conversion | 3 | 04-25-2014 03:02 PM |
Find duplicate books... | silentguy | Calibre | 10 | 12-10-2010 12:03 PM |