I want to write a Python script (using 3.4.3) that grabs an HTML page from a URL and can go through the DOM to try to find a specific element.
I currently have this:
import urllib.request
def getSite(url):
return urllib.request.urlopen(url)
if __name__ == '__main__':
content = getSite('http://www.google.com').read()
print(content)
When I print the content, it prints the entire html page, which is something close to what I want ... although I would ideally want to be able to navigate the DOM, rather than treat it like a giant line.
I'm still pretty new to Python, but have experience working with several other languages (mainly Java, C #, C ++, C, PHP, JS). I already did something similar with Java, but wanted to try it in Python.
Any help is appreciated. Hooray!