~facundo/encuentro/trunk

« back to all changes in this revision

Viewing changes to server/scrapers_cda.py

  • Committer: Facundo Batista
  • Date: 2015-12-31 20:58:36 UTC
  • Revision ID: facundo@taniquetil.com.ar-20151231205836-98m9rnza8mh9ugvo
Little detail in CDA scrapping.

Show diffs side-by-side

added added

removed removed

Lines of Context:
28
28
    soup = bs4.BeautifulSoup(data['html'], "html.parser")
29
29
    for article in soup.find_all('article'):
30
30
        image = article.find('img')['src']
31
 
        title = article.find('h3', itemprop='name').text
 
31
        title = article.find('h3', itemprop='name').text.strip()
32
32
 
33
33
        # prepare text
34
34
        info = article.find('div', class_='info closed')