becks

如下图,爬取标题,标题为中文内容,打印出乱码

 

 解决方法,需要对请求后得返回数据重新编码

response = requests.get(url=url2)
response.encoding = \'utf-8\' #处理编码得步骤
print(url2)
wb_data = response.text # 将页面转换成文档树
html = etree.HTML(wb_data)
b = html.xpath(\'//div[@class = "picmainer"]/h1/text()\')
b=(b[0])
print(b)
response.encoding = \'utf-8\' 

 

 内容正常显示了

 

分类:

技术点:

相关文章: