【发布时间】:2015-12-29 05:34:19
【问题描述】:
我正在尝试使用 python 3.5 和漂亮的汤从蒸汽的每一页上刮掉标题,以便免费播放搜索结果。但是返回的结果只是第一页上的标题,而不是后续页面上的标题:
import requests
from bs4 import BeautifulSoup
titles_list=[]
for i in range(3): # Number of pages plus one
print(i)
url = 'http://store.steampowered.com/genre/Free%20to%20Play/?tab=MostPlayed#p' + str(i)
print(url)
soup = BeautifulSoup(requests.get(url).content)
titles=soup.find_all("div",{"class":"tab_item_name"})
for item in titles:
try:
name=item.text
except:
name='sdfsd'
print(name)
titles_list.append(name)
控制台结果(我知道 0 和 1 相同,但 i=2 应该显示不同的游戏集):
0
http://store.steampowered.com/genre/Free%20to%20Play/?tab=MostPlayed#p0
Dota 2
Team Fortress 2
Warframe
Clicker Heroes
Unturned
Path of Exile
War Thunder
SMITE
Trove
AdVenture Capitalist
1
http://store.steampowered.com/genre/Free%20to%20Play/?tab=MostPlayed#p1
Dota 2
Team Fortress 2
Warframe
Clicker Heroes
Unturned
Path of Exile
War Thunder
SMITE
Trove
AdVenture Capitalist
2
http://store.steampowered.com/genre/Free%20to%20Play/?tab=MostPlayed#p2
Dota 2
Team Fortress 2
Warframe
Clicker Heroes
Unturned
Path of Exile
War Thunder
SMITE
Trove
AdVenture Capitalist
有人知道这里发生了什么吗?
【问题讨论】:
标签: python python-3.x web-scraping beautifulsoup