【发布时间】:2019-08-06 17:47:56
【问题描述】:
我正在尝试将下面的函数从 urllib2 调整为 urllib3。我遇到的问题是 urllib3 没有 openurl 属性。我将如何重写这些函数?
提前谢谢你。
def get_season_URLs(year):
if not year in VALID_YEARS :
print (str(year) + " is an invalid year")
return
base_url_schedule_1 = BASE_URL + "/leagues/NBA_" + str(year) + "_games-"
url_extensions = YEAR_ONE_MONTHS + YEAR_TWO_MONTHS
urls = []
for extension in url_extensions :
schedule_url = get_schedule_URL(year, extension)
doc = BeautifulSoup(urllib2.urlopen(schedule_url).read(), "html.parser")
schedule_table = doc.find_all("tbody")[0]
box_score_els = schedule_table.find_all(attrs={"data-stat": "box_score_text"})
for el in box_score_els :
urls.append(BASE_URL + el.find("a").get('href'))
return urls
def scrape_game(url, f) :
html = urllib2.urlopen(url).read()
doc = BeautifulSoup(html, "html.parser")
if not len(doc.find_all(attrs={"data-label":"All Games in Series"})) == 0 :
return False
line = ""```
【问题讨论】:
标签: python urllib urllib2 urllib3