09-22-2015, 06:59 PM | #1 |
Junior Member
Posts: 3
Karma: 10
Join Date: Sep 2015
Location: Argentina
Device: Kindle
|
Spanish recipes not working properly
Hi guys! I was trying to download a couple of news with Calibre and I realized that a few are not working properly. I tried to use some of the recipes inside "spanish (Argentina)" and found this:
- "Miradas al Sur": Fails "No articles found, aborting". - "Clarin": Creates the mobi file, but inside are the same short news repeated over and over about 200 times. - "Veintitres": Creates a file with only code inside. - "Perfil": Downloads only the news sections, but not the news. - "Infobae.com": same as Clarin, short news repeated over and over. - "Telam": Fails. "No articles found, aborting". - "ElArgentino.com": same as "Veintitres". - "Ambito Financiero": It only fails to download any kind of images including Cover, but the news are downloaded fine. I honestly would be happy with just the first three or four sources working fine, but I wanted to report also other sources not working properly. I don't understand (yet) the language/code to be able to correct them myself, so if anyone would like to give me a hand I'll be really grateful PS: Since this is my first post, I would like to thank Kovid for creating Calibre, it's an outstanding tool, I'm really impressed with it! It's a must-have, so thanks a lot for your hard work on it!! And thanks everyone else for helping in its development |
09-23-2015, 03:50 AM | #2 |
creator of calibre
Posts: 44,387
Karma: 23798586
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
You're welcome
|
Advert | |
|
09-24-2015, 09:23 PM | #3 |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
I got the info and will see what can be done.
|
09-25-2015, 09:06 AM | #4 |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
Here is the updated version of recipe for Clarin. This one now requires user account. Make sure not to create account with Google+ or Facebook. It has to be account created on the clarin site.
https://bugs.launchpad.net/calibre/+bug/1499725 Will work on the others in the following days. |
09-25-2015, 11:44 AM | #5 |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
Updated recipe for La Nacion
https://bugs.launchpad.net/calibre/+bug/1499772 |
Advert | |
|
10-11-2015, 11:24 AM | #6 |
Junior Member
Posts: 3
Karma: 10
Join Date: Sep 2015
Location: Argentina
Device: Kindle
|
Thanks A LOT for your work on this! I've been really busy so I couldn't check this earlier. I just upgraded Calibre and all the recipes have been upgraded. I really appreciate you taking the time to fix all those recipes!
|
05-31-2021, 07:00 PM | #7 |
Guru
Posts: 761
Karma: 308700
Join Date: Sep 2017
Location: Argentina
Device: moon+ reader, kindle paperwhite
|
Hello, apparently the thread is old.
The recipe for financial ambit has stopped working. |
06-02-2021, 09:18 AM | #8 |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
These issues have been resolved. Updated recipes should appear in the next release of Calibre.
|
11-28-2021, 09:34 AM | #9 |
Junior Member
Posts: 2
Karma: 10
Join Date: Nov 2021
Device: kindle
|
Hi there.
First of all, thank all of you for putting such an effort in keeing this alive. That being said, I´d like to report that the Clarín recipe (as well as La Nación) is not working anymore (it gails to download all the articles). Thanks again for your efforts. |
11-28-2021, 10:02 AM | #10 |
Junior Member
Posts: 2
Karma: 10
Join Date: Nov 2021
Device: kindle
|
it works now!
Well, I have been tampering with the recipe (which seems to be the exact word, since I have literally no idea about it), and it seems to work now. I just erased the last commands on pre and post processing. It probably would benefit fro some additional tweaking, but I will copy the new code here, in case somebody wants to use it or, evenbetter, improve it.
#!/usr/bin/env python # -*- mode: python -*- # -*- coding: utf-8 -*- from __future__ import unicode_literals __license__ = 'GPL v3' __copyright__ = '2008-2016, Darko Miletic <darko.miletic at gmail.com>' ''' clarin.com ''' try: from urllib.parse import urlencode except ImportError: from urllib import urlencode from calibre import strftime from calibre.web.feeds.news import BasicNewsRecipe class Clarin(BasicNewsRecipe): title = 'Clarín' __author__ = 'Darko Miletic, updated by GGsalas' description = 'Clarin.com. Noticias de la Argentina y el mundo. Información actualizada las 24 horas y en español. Informate ya' publisher = 'Grupo Clarin' category = 'news, politics, Argentina' oldest_article = 1 max_articles_per_feed = 100 use_embedded_content = False no_stylesheets = True encoding = 'utf8' delay = 1 language = 'es_AR' publication_type = 'newspaper' needs_subscription = 'optional' INDEX = 'http://www.clarin.com' LOGIN = 'https://app-pase.clarin.com/pase-registracion/app/pase/ingresarNavegable?execution=e1s1' masthead_url = 'http://www.clarin.com/images/logo_clarin.svg' cover_url = strftime('http://tapas.clarin.com/tapa/%Y/%m/%d/%Y%m%d_thumb.jpg') compress_news_images = True scale_news_images_to_device = True compress_news_images_max_size = 10 # kB scale_news_images = True handle_gzip = True # To get all the data (images) auto_cleanup = False extra_css = """ h1#title { line-height: 1em; margin: 0 0 .5em 0; } p.volanta { font-size: .7em; margin-bottom: .5em; } .bajada h2 { font-size: 1em; line-height: 1em; color: #666666; margin: 0 0 1em 0; } .figcaption { font-style: italic; font-size: .9em; margin-bottom: .5em; } """ conversion_options = { 'comment': description, 'tags': category, 'publisher': publisher, 'language': language } keep_only_tags = [ dict(name='p' , attrs={'class' : 'volanta'}), dict(name='h1' , attrs={'id': 'title'}), dict(name='div', attrs={'class' : 'bajada'}), dict(name='div', attrs={'id' : 'galeria-trigger'}), dict(name='div', attrs={'class' : 'body-nota'}) ] remove_tags = [ dict(name=['meta', 'base', 'link', 'iframe', 'embed', 'object']), dict(attrs={'class': ['tags-bar', 'breadcrumb', 'share-bar', 'share', 'sp__SM']}), dict(name='div', attrs={'class': lambda x: x and 'r-nota' in x.split()}), dict(attrs={'id': ['relacionadas']}), dict(name='a', attrs={'class':'content-new'}) ] remove_tags_after = dict(name='div', attrs={'id': 'relacionadas'}) remove_attributes = ['lang'] # Images on hightlights view def populate_article_metadata(self, article, soup, first): if first and hasattr(self, 'add_toc_thumbnail'): picdiv = soup.find('img') if picdiv is not None: self.add_toc_thumbnail(article, picdiv['src']) feeds = [ (u'Lo Ultimo', u'http://www.clarin.com/rss/lo-ultimo/'), (u'Politica', u'http://www.clarin.com/rss/politica/'), (u'Opinion', u'https://www.clarin.com/rss/opinion/'), (u'Cultura', u'https://www.clarin.com/rss/cultura/'), (u'Economia', u'https://www.clarin.com/rss/economia/'), (u'Tecnologia', u'https://www.clarin.com/rss/tecnologia/'), (u'RevistaN', u'https://www.clarin.com/rss/revista-enie/'), (u'Viva', u'https://www.clarin.com/rss/viva/'), (u'Deportes', u'http://www.clarin.com/rss/deportes/'), (u'Mundo', u'http://www.clarin.com/rss/mundo/'), (u'Espectaculos', u'http://www.clarin.com/rss/espectaculos/'), (u'Sociedad', u'http://www.clarin.com/rss/sociedad/'), (u'Ciudades', u'http://www.clarin.com/rss/ciudades/'), (u'Policiales', u'http://www.clarin.com/rss/policiales/'), (u'Internet', u'http://www.clarin.com/rss/internet/') ] def get_browser(self): br = BasicNewsRecipe.get_browser(self) br.open(self.INDEX) return br |
11-28-2021, 01:47 PM | #11 |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
Both Clarin and La Nacion have paywall which is more complex then classic form user login. Not really sure if I care enough to do something about it.
|
Tags |
argentina, calibre, problem, recipes, spanish |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Links not working properly? | dokkeynot | Sigil | 1 | 06-27-2012 02:04 PM |
Kindle for pc not working properly | bodhran | Amazon Kindle | 4 | 09-18-2011 01:45 PM |
Recipes in Spanish for Kindle | mferrap | Recipes | 5 | 05-22-2011 08:27 AM |
3 recipes in spanish | desUBIKado | Recipes | 1 | 12-27-2010 12:20 PM |
lrf files not working properly | munty | LRF | 4 | 01-25-2009 05:01 PM |