friendg

学习爬虫day01:抓取网站信息并且下载视频

2019-07-01 14:35  FriendG  阅读(336)  评论(0编辑  收藏  举报

import requests
import re
import time
import uuid
def save_video(content):
with open(f\'{uuid.uuid4()}.mp4\',\'wb\') as f:
f.write(content)
print(\'OK!\')
def get_page(url):
response = requests.get(url)
return response

def parse_index(html):
detail_urls = re.findall(
\'<div class="items"><a class="imglink" href="(.*?)"\',html,re.S
)
print(detail_urls)
return detail_urls

def parse_detail(html):
movie_url = re.findall(\'<source src="(.*?)">\',html,re.S)
if movie_url:
return movie_url[0]

if __name__ == \'__main__\':
url = \'http://www.xiaohuar.com/v/\'
response = get_page(url)
#print(response)
#print(response.status_code)
# print(response.text)
detail_urls = parse_index(response.text)

for detail_url in detail_urls:
#print(detail_url)
#response = get_page(detail_url)
detail_res = get_page(detail_url)
#print(response.text)

movie_url = parse_detail(detail_res.text)

if movie_url:
print(movie_url)

movie_res = get_page(movie_url)

save_video(movie_res.content)

 

分类:

技术点:

相关文章:

  • 2021-12-19
  • 2021-12-19
  • 2021-07-16
  • 2021-12-19
  • 2021-07-11
  • 2021-11-20
  • 2021-12-15
  • 2022-12-23
猜你喜欢
  • 2021-12-19
  • 2022-01-15
  • 2021-11-23
  • 2022-01-02
  • 2021-11-12
  • 2021-11-18
  • 2021-12-05
相关资源
相似解决方案