【发布时间】:2020-07-30 18:42:23
【问题描述】:
我正在按照这里的指南进行操作: https://medium.com/swlh/tutorial-web-scraping-instagrams-most-precious-resource-corgis-235bf0389b0c
我过去可以使用它,但由于某种原因,现在它正在返回如下所示的空数组,而不是永久链接列表
C:\Users\19053\InstagramPublicImageDownloader\venv\Scripts\python.exe C:/Users/19053/InstagramPublicImageDownloader/getpermalinks.py
[]
[]
[]
[]
[]
[]
[]
[]
应该是这样的
['https://www.instagram.com/p/CDRbCxjBakW/','https://www.instagram.com/p/CDMQ9J2Fvl4/','...and so on']
代码如下:
from selenium.webdriver import Chrome
url = "https://www.instagram.com/dairyqueen/"
browser = Chrome()
browser.get(url)
post = 'https://www.instagram.com/p/'
post_links = []
while len(post_links) < 25:
links = [a.get_attribute('href') for a in browser.find_elements_by_tag_name('a')]
for link in links:
if post in link and link not in post_links:
post_links.append(link)
scroll_down = "window.scrollTo(0, document.body.scrollHeight);"
browser.execute_script(scroll_down)
time.sleep(10)
else:
print(post_links[:25])
【问题讨论】:
-
以下答案是否解决了您的问题?如果是,请随时接受/支持 :)
标签: python arrays selenium web-scraping instagram