03-18-2011, 02:18 PM | #1 |
Connoisseur
Posts: 99
Karma: 170
Join Date: Nov 2010
Location: Airdrie Alberta
Device: Sony 650
|
Trouble removing span class
This is in my recipe
remove_tags = [dict(name='span', attrs={'class':'articledate'})] Below is the relevant piece of html of one of the pages. Why does this not remove the dates ? Thanks Spoiler:
|
03-18-2011, 02:21 PM | #2 |
creator of calibre
Posts: 44,380
Karma: 23766374
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Because remove_tags does not apply to calibre generated content
|
Advert | |
|
03-18-2011, 02:22 PM | #3 |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
You have quoted and are looking at the output html, but remove_tags operates on the input html. They can be significantly different. You have to match the html of the input.
|
03-18-2011, 03:29 PM | #4 |
Connoisseur
Posts: 99
Karma: 170
Join Date: Nov 2010
Location: Airdrie Alberta
Device: Sony 650
|
Thanks
Sometimes something as obvious as that sometimes needs another eye |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PRS-650 SD Card Importance? SDHC, SDHC Class 4, Class 10 etc is it important | Renji | Sony Reader | 11 | 12-03-2011 12:30 PM |
Why define a paragraph as a span with no different or extra formatting? | bfollowell | ePub | 7 | 03-16-2011 10:30 PM |
'Heading color' and 'p class span' | mufc | Recipes | 7 | 12-22-2010 09:02 PM |
Span tags, h1s and emspaces | ConorHughes | ePub | 11 | 09-30-2010 05:00 PM |
STREET & CLAIRVOYANCE by Ryan A. Span | Winter | Self-Promotions by Authors and Publishers | 36 | 09-01-2010 11:09 AM |