03-05-2019, 04:58 PM | #1 |
Junior Member
Posts: 3
Karma: 10
Join Date: Mar 2019
Device: PocketBook Touch HD 3
|
derStandard default recipe not working
The default recipe for one for one of the most famous quality newspapers in Austria, der Standard (or derStandard), is pretty outdated and doesn't work anymore at all.
This is what the current default looks like: Spoiler:
While this might've worked a few years ago, it seems the Standard has changed it's feeds and some other stuff regarding their online presence as well. As you can see here (attention: German. Use a translator if you need to understand that, but it should be rather self explanatory), the feeds list is no longer up to date. It should be like that instead if I understood that part correctly: Code:
feeds = [ (u'Newsroom', u'http://derStandard.at/?page=rss&ressort=Seite1'), (u'International', u'http://derstandard.at/?page=rss&ressort=International'), (u'Inland', u'http://derstandard.at/?page=rss&ressort=Inland'), (u'Wirtschaft', u'http://derStandard.at/?page=rss&ressort=Wirtschaft'), (u'Web', u'http://derStandard.at/?page=rss&ressort=Web'), (u'Sport', u'http://derStandard.at/?page=rss&ressort=Sport'), (u'Panorama', u'http://derStandard.at/?page=rss&ressort=Panorama'), (u'Etat', u'http://derStandard.at/?page=rss&ressort=Etat'), (u'Kultur', u'http://derStandard.at/?page=rss&ressort=Kultur'), (u'Wissenschaft', u'http://derStandard.at/?page=rss&ressort=Wissenschaft'), (u'Gesundheit', u'http://derStandard.at/?page=rss&ressort=Gesundheit'), (u'Bildung', u'http://derStandard.at/?page=rss&ressort=Bildung'), (u'Meinung', u'http://derStandard.at/?page=rss&ressort=Meinung'), (u'Lifestyle', u'http://derStandard.at/?page=rss&ressort=Lifestyle'), (u'Reisen', u'http://derStandard.at/?page=rss&ressort=Reisen'), (u'Familie', u'http://derstandard.at/?page=rss&ressort=Familie'), (u'Meinung', u'http://derStandard.at/?page=rss&ressort=Meinung'), (u'User', u'http://derStandard.at/?page=rss&ressort=User'), (u'Karriere', u'http://derStandard.at/?page=rss&ressort=Karriere'), (u'Immobilien', u'http://derstandard.at/?page=rss&ressort=Immobilien'), (u'Automobil', u'http://derstandard.at/?page=rss&ressort=Automobil'), (u'dieStandard', u'http://derStandard.at/?page=rss&ressort=diestandard'), ] Thanks in advance, Flugschwein PS: according to https://calibre-ebook.com/dynamic/recipe-usage derStandard is the 4th most downloaded German (language wise, not nationality) newspaper using calibre |
03-07-2019, 10:43 PM | #2 |
creator of calibre
Posts: 44,407
Karma: 23977332
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
03-09-2019, 02:20 PM | #3 | |
Junior Member
Posts: 3
Karma: 10
Join Date: Mar 2019
Device: PocketBook Touch HD 3
|
Quote:
Thanks a lot! |
|
03-10-2019, 08:03 PM | #4 |
Enthusiast
Posts: 36
Karma: 10
Join Date: Dec 2017
Location: Los Angeles, CA
Device: Smart Phone
|
update to derStandard
I think this fixes the problem.
Recipe for derStandard: Code:
#!/usr/bin/env python2 # -*- coding: utf-8 -*- from __future__ import unicode_literals, division, absolute_import, print_function __license__ = 'GPL v3' __copyright__ = '2009, Gerhard Aigner <gerhard.aigner at gmail.com>' ''' http://www.derstandard.at - Austrian Newspaper ''' from calibre.web.feeds.news import BasicNewsRecipe def classes(classes): q = frozenset(classes.split(' ')) return dict(attrs={ 'class': lambda x: x and frozenset(x.split()).intersection(q)}) class DerStandardRecipe(BasicNewsRecipe): title = u'derStandard' __author__ = 'Gerhard Aigner and Sujata Raman and Marcel Jira and Peter Reschenhofer' description = u'Nachrichten aus Österreich' publisher = 'derStandard.at' category = 'news, politics, nachrichten, Austria' use_embedded_content = False remove_empty_feeds = True no_stylesheets = True encoding = 'utf-8' language = 'de_AT' oldest_article = 1 max_articles_per_feed = 100 ignore_duplicate_articles = {'title', 'url'} masthead_url = 'http://images.derstandard.at/2012/06/19/derStandardat_1417x274.gif' feeds = [ (u'Newsroom', u'https://derStandard.at/?page=rss&ressort=Seite1'), (u'International', u'https://derstandard.at/?page=rss&ressort=International'), (u'Inland', u'https://derstandard.at/?page=rss&ressort=Inland'), (u'Wirtschaft', u'https://derStandard.at/?page=rss&ressort=Wirtschaft'), (u'Web', u'https://derStandard.at/?page=rss&ressort=Web'), (u'Sport', u'https://derStandard.at/?page=rss&ressort=Sport'), (u'Panorama', u'https://derStandard.at/?page=rss&ressort=Panorama'), (u'Etat', u'https://derStandard.at/?page=rss&ressort=Etat'), (u'Kultur', u'https://derStandard.at/?page=rss&ressort=Kultur'), (u'Wissenschaft', u'https://derStandard.at/?page=rss&ressort=Wissenschaft'), (u'Gesundheit', u'https://derStandard.at/?page=rss&ressort=Gesundheit'), (u'Bildung', u'https://derStandard.at/?page=rss&ressort=Bildung'), (u'Meinung', u'https://derStandard.at/?page=rss&ressort=Meinung'), (u'Lifestyle', u'https://derStandard.at/?page=rss&ressort=Lifestyle'), (u'Reisen', u'https://derStandard.at/?page=rss&ressort=Reisen'), (u'Familie', u'https://derstandard.at/?page=rss&ressort=Familie'), (u'Meinung', u'https://derStandard.at/?page=rss&ressort=Meinung'), (u'User', u'https://derStandard.at/?page=rss&ressort=User'), (u'Karriere', u'https://derStandard.at/?page=rss&ressort=Karriere'), (u'Immobilien', u'https://derstandard.at/?page=rss&ressort=Immobilien'), (u'Automobil', u'https://derstandard.at/?page=rss&ressort=Automobil'), (u'dieStandard', u'https://derStandard.at/?page=rss&ressort=diestandard'), ] def get_browser(self): br = BasicNewsRecipe.get_browser(self) headers = { 'X-Requested-With': 'XMLHttpRequest', 'Content-Type': 'application/json; charset=UTF-8', 'DNT': '1', 'Pragma': 'no-cache', 'Cache-Control': 'no-cache' } import mechanize req = mechanize.Request(url='https://derstandard.at/privacyprotection/api/agree', data=None, headers=headers, method='POST') br.open(req) return br keep_only_tags = [ classes('artikel'), ] remove_tags = [ dict(name=['link', 'iframe', 'style', 'hr']), dict(attrs={'class': ['lookup-links', 'media-list']}), dict(name='form', attrs={'name': 'sitesearch'}), dict(name='div', attrs={'class': ['socialsharing', 'block video', 'blog-browsing section', 'diashow', 'supplemental']}), dict(name='div', attrs={'id': 'highlighted'}) ] remove_attributes = ['width', 'height'] |
03-12-2019, 01:00 PM | #5 |
Junior Member
Posts: 3
Karma: 10
Join Date: Mar 2019
Device: PocketBook Touch HD 3
|
|
Tags |
austria, calibre, derstandard, german, recipe |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
The Hindu Recipe(Better than the default provided with calibre) | sexymax15 | Recipes | 7 | 04-26-2017 11:43 AM |
Improving derStandard-recipe - how to get cover image? | Spindoctor | Recipes | 7 | 05-09-2012 12:57 PM |
Custom news recipe default language | ppclarke | Recipes | 2 | 04-13-2012 07:58 AM |
Strange problem with new default dictionary not working | frodon | Amazon Kindle | 4 | 04-24-2011 04:46 AM |
Recipe not working | phkoech | Calibre | 3 | 08-13-2009 05:41 PM |