Beautiful soup can not find tags

Question

Beautiful soup can not find tags

I am currently trying to practice with BeautifulSoup's requests and modules in Python 3.6, and am running into a problem that I cannot find in other questions and answers about.

It seems that at some point the Beuatiful Soup page stops recognizing tags and identifiers. I am trying to pull Play-by-play data from a page as follows:

http://www.pro-football-reference.com/boxscores/201609080den.htm

import requests, bs4

source_url = 'http://www.pro-football-reference.com/boxscores/201609080den.htm'
res = requests.get(source_url)
if '404' in res.url:
    raise Exception('No data found for this link: '+source_url)

soup = bs4.BeautifulSoup(res.text,'html.parser')

#this works
all_pbp = soup.findAll('div', {'id' : 'all_pbp'})
print(len(all_pbp))

#this doesn't
table = soup.findAll('table', {'id' : 'pbp'})
print(len(table))

Using the inspector in Chrome, I see that the table definitely exists. I also tried using it in div and tr in the later half of HTML and it doesn't seem to work. I tried the standard "html.parser" as well as lxml and html5lib, but nothing works.

- , - HTML , BeautifulSoup ? (hockey-reference.com, basketball-reference.com), .

- HTML, - /, ?

, BF

+6

python beautifulsoup

Big Fore 02 . '17 4:21

2

, , . , html. Comment BeautifulSoup, :

import requests
from bs4 import BeautifulSoup,Comment

source_url = 'http://www.pro-football-reference.com/boxscores/201609080den.htm'
res = requests.get(source_url)
if '404' in res.url:
    raise Exception('No data found for this link: '+source_url)

soup = BeautifulSoup(res.content,'html.parser')

comments=soup.find_all(string=lambda text:isinstance(text,Comment))

for comment in comments:
    comment=BeautifulSoup(str(comment), 'html.parser')
    search_play = comment.find('table', {'id':'pbp'})
    if search_play:
        play_to_play=search_play

0

Dmitriy Fialkovskiy 02 . '17 5:54

qwertyuip9 · Accepted Answer · 2017-07-02T15:23:44+0000

BS4 javascript - GET URL-. , , , async javascript.

, javascript , HTML. , !

Beautiful soup can not find tags

More articles: