【发布时间】:2021-04-29 17:14:22
【问题描述】:
我是 python 世界的新手,想知道如何将数据从 github 抓取到 CSV 文件中,例如 https://gist.github.com/simsketch/1a029a8d7fca1e4c142cbfd043a68f19#file-pokemon-csv
我正在尝试使用此代码,但它不是很成功。绝对应该有一种更简单的方法来做到这一点。
提前谢谢你!
from bs4 import BeautifulSoup
import requests
import csv
url = 'https://gist.github.com/simsketch/1a029a8d7fca1e4c142cbfd043a68f19'
r = requests.get(url)
soup = BeautifulSoup(r.text, 'html.parser')
pokemon_table = soup.find('table', class_= 'highlight tab-size js-file-line-container')
for pokemon in pokemon_table.find_all('tr'):
name = [pokemon.find('td', class_= 'blob-code blob-code-inner js-file-line').text]
with open('output.csv', 'w') as csvfile:
writer = csv.writer(csvfile)
writer.writerows(name)
【问题讨论】:
标签: python csv web-scraping beautifulsoup