【发布时间】:2019-02-22 05:34:18
【问题描述】:
我正在尝试查找此页面所有事件的 URL:
https://www.eventshigh.com/delhi/food?src=exp
但我只能看到 JSON 格式的 URL:
{
"@context":"http://schema.org",
"@type":"Event",
"name":"DANDIYA NIGHT 2018",
"image":"https://storage.googleapis.com/ehimages/2018/9/4/img_b719545523ac467c4ad206c3a6e76b65_1536053337882_resized_1000.jpg",
"url":"https://www.eventshigh.com/detail/Delhi/5b30d4b8462a552a5ce4a5ebcbefcf47-dandiya-night-2018",
"eventStatus": "EventScheduled",
"startDate":"2018-10-14T18:30:00+05:30",
"doorTime":"2018-10-14T18:30:00+05:30",
"endDate":"2018-10-14T22:30:00+05:30",
"description" : "Dress code : TRADITIONAL (mandatory)\u00A0 \r\n Dandiya sticks will be available at the venue ( paid)\u00A0 \r\n Lip smacking food, professional dandiya Dj , media coverage , lucky draw \u00A0, Dandiya Garba Raas , Shopping and Games .\u00A0 \r\n \u00A0 \r\n Winners\u00A0 \r\n \u00A0 \r\n Best dress ( all",
"location":{
"@type":"Place",
"name":"K And L Community Hall (senior Citizen Complex )",
"address":"80 TO 49, Pocket K, Sarita Vihar, New Delhi, Delhi 110076, India"
},
这里是:
"url":"https://www.eventshigh.com/detail/Delhi/5b30d4b8462a552a5ce4a5ebcbefcf47-dandiya-night-2018"
但我找不到任何其他包含链接的 HTML/XML 标记。我也找不到包含链接的相应 JSON 文件。你能帮我把这个页面所有事件的链接刮下来吗:
https://www.eventshigh.com/delhi/food?src=exp
【问题讨论】:
-
检查任何
sitemap.xml存在吗?见api.hackertarget.com/pagelinks/?q=https://www.eventshigh.com/…
标签: python json scrapy web-crawler