I am trying to parse an html page and select values ββfor currencies and write csv. I have the following code:
#!/usr/bin/env python import urllib2 from BeautifulSoup import BeautifulSoup contenturl = "http://www.bank.gov.ua/control/en/curmetal/detail/currency?period=daily" soup = BeautifulSoup(urllib2.urlopen(contenturl).read()) table = soup.find('div', attrs={'class': 'content'}) rows = table.findAll('tr') for tr in rows: cols = tr.findAll('td') for td in cols: text = td.find(text=True) + ';' print text, print
The problem is that I do not know how to get only the values ββfor the currency. I tried some regex, like '^ [0-9] {3}' - start with 3 digits, but that doesn't work.
user2140323
source share