【发布时间】:2020-01-19 15:12:18
【问题描述】:
我正在研究如何抓取 Linkedin 源 (https://www.linkedin.com/mynetwork/invite-connect/connections/),但无限滚动似乎是不可能的。如何处理?我不想使用 Selenium(想稍后实现为 Web 服务)。
import bs4
from bs4 import BeautifulSoup
import requests
def scraping(webpage):
headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36'}
response= requests.get(str(webpage), headers=headers)
soup = BeautifulSoup(response.text,"html.parser")
print(soup)
scraping('https://www.linkedin.com/mynetwork/invite-connect/connections')
【问题讨论】:
标签: python beautifulsoup