【问题标题】:I have a list with urls-pdfs and i want to save them locally to my pc我有一个带有 urls-pdfs 的列表,我想将它们本地保存到我的电脑
【发布时间】:2019-01-29 13:19:39
【问题描述】:

我是 python 新手,所以我有特定的 json,我已经提取了 字典中的 documentElement 值,然后将其设为列表...如何将此 pdf 自动下载到目录文件夹?

import urllib.request
import requests
import json

url = 'https://diavgeia.gov.gr/luminapi/api/search/export?q=decisionType:%22%CE%93%CE%9D%CE%A9%CE%9C%CE%9F%CE%94%CE%9F%CE%A4%CE%97%CE%A3%CE%97%22&OrganizationUid:%2250024%22&status:%22%CE%91%CE%BD%CE%B1%CF%81%CF%84%CE%B7%CE%BC%CE%AD%CE%BD%CE%B7%22&page=1&size=4&wt=json'


#get urls
response = requests.get(url)
with urllib.request.urlopen(url) as u:
    data = json.loads(u.read().decode())
#add links to the list
pdf_links = list()
for key in data:
    for x in data[key]:
       pdf_links.append(x['documentUrl'])
#print
print(pdf_links)

【问题讨论】:

  • 您必须询问具体的编程问题。告诉我们您做了什么以及您如何无法实现您的目标。不要要求其他人为您编写代码。如何提出好问题请参考帮助中心:stackoverflow.com/help/how-to-ask

标签: python pdf download urllib


【解决方案1】:

我们开始吧:

import requests
response = requests.get('https://diavgeia.gov.gr/luminapi/api/search/export?q=decisionType:%22%CE%93%CE%9D%CE%A9%CE%9C%CE%9F%CE%94%CE%9F%CE%A4%CE%97%CE%A3%CE%97%22&OrganizationUid:%2250024%22&status:%22%CE%91%CE%BD%CE%B1%CF%81%CF%84%CE%B7%CE%BC%CE%AD%CE%BD%CE%B7%22&page=1&size=4&wt=json')
for doc in response.json()['decisionResultList']:
    r = requests.get(doc['documentUrl'], stream=True)
    with open('{}.pdf'.format(doc['ada']), 'wb') as f:
        for chunk in r:
            f.write(chunk)

以下文件已下载到我的电脑:

【讨论】:

    猜你喜欢
    • 1970-01-01
    • 1970-01-01
    • 2011-05-25
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多