【发布时间】:2021-01-19 00:21:10
【问题描述】:
所以我正在尝试使用 cheerio 库从 https://hordes.io/clans 和 Node.js 抓取数据,当我检查网站时它是这样的
但是当我将 HTML 记录到控制台时,它是这样的。
<!DOCTYPE html><html lang="en"><head><title>Clans • Hordes.io</title><meta charset="UTF-8"><meta name="Description" content="Roam the lands in gigantic battlegroups, level up with your party, trade rare items and upgrade your gear!"><meta name="viewport" content="width=device-width, initial-scale=1"><link rel="icon" type="image/png" sizes="32x32" href="/assets/ui/favicon32.png?v=44431023"><link rel="icon" type="image/png" sizes="16x16" href="/assets/ui/favicon16.png?v=44431023"><link rel="stylesheet" href="/clans/style.css?v=44431023"></head><body><script async src="/clans/script.js?v=44431023"></script><script async src="https://www.googletagmanager.com/gtag/js?id=UA-83865980-3"></script><script>window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'UA-83865980-3')</script></body></html>
里面没有<div>标签;我该如何解决?
这是我的代码
var rp = require('request-promise');
var cheerio = require('cheerio');
var drago = [];
rp('https://hordes.io/clans', function(err, resp, html) {
if (!err && resp.statusCode == 200) {
var $ = cheerio.load(html);
var parsedResults = [];
$('span.comhead').each(function(i, element) {
var a = $(this).prev();
});
console.log(html)
}
})
【问题讨论】:
-
这能回答你的问题吗? How to parse DOM (REACT)
-
大部分页面由
https://hordes.io/clans/script.js?v=44431023渲染
标签: javascript html css