888888CN

import requests
from bs4 import BeautifulSoup
res = requests.get(\'http://news.sina.com.cn/china/\')#获取目标网页
res.encoding = \'utf-8\'#抓取网页出现乱码
#print(res.text)
soup = BeautifulSoup(res.text,\'html.parser\')#爬取网页
for news in soup.select(\'.news-item\'): 
    if len(news.select(\'h2\')) > 0:
        time = news.select(\'.time\')[0].text#新闻发布时间
        h2 = news.select(\'h2\')[0].text #新闻发布的标题
        a = news.select(\'a\')[0][\'href\']#新闻链接
        print(time+"\t\t",h2+"\t",a)

 


分类:

技术点:

相关文章:

  • 2021-12-14
  • 2022-01-21
  • 2021-06-08
  • 2021-12-18
  • 2021-12-26
  • 2021-07-07
  • 2021-12-04
  • 2021-08-18
猜你喜欢
  • 2021-09-09
  • 2021-09-09
  • 2021-10-26
  • 2021-11-01
  • 2021-07-22
  • 2021-05-05
相关资源
相似解决方案