【发布时间】:2015-07-31 23:56:13
【问题描述】:
我有这个代码
import requests
from bs4 import BeautifulSoup
url = "http://www.rockefeller.edu/research/areas/summary.php?id=1"
r = requests.get(url)
soup = BeautifulSoup(r.content)
a = 'Comments'
for x in (soup.find_all('p')):
if a in x:
print (x)
else:
print ('it is not there')
基本上,我想到了一个词,我想知道它在页面中的位置。可以说我的话是“评论”。我想知道那个词评论在哪里:能够打印出它包含的标签(例如:<a href=#>Comments</a>
更新的代码(对我不起作用)
import requests
from bs4 import BeautifulSoup
import re
url = "http://www.rockefeller.edu/research/areas/summary.php?id=1"
r = requests.get(url)
soup = BeautifulSoup(r.content, 'html.parser')
for x in (soup.find_all(string=re.compile('comment', flags=re.I))):
print(x.parent)
print(x.parent.name)
【问题讨论】:
标签: python python-3.x beautifulsoup python-requests