【发布时间】:2016-04-04 11:52:28
【问题描述】:
我将尝试使用 XML 包读取 Dropbox 中的表格 HTML,但是 XML::readHTMLTable 函数在 html 中的 Dropbox 中不起作用,我不知道为什么,有人可以帮助我吗?
我的代码:
包
require(httr)
require(XML)
在 Dropbox 中打开表格 html 文件
FILE <- GET(url="https://www.dropbox.com/s/mb316ghr4irxipr/TALHOES_AGENTES.htm?dl=0")
阅读表格
tables <- getNodeSet(htmlParse(FILE), "//table")
FE_tab <- readHTMLTable(tables[2],
header = c("empresa","desc_projeto","desc_regiao",
"cadastrador_por","cod_talhao","descricao",
"formiga_area","qtd_destruido","latitude",
"longitude","data_cadastro"),
colClasses = c("character","character","character",
"character","character","character",
"character","character","character",
"character","character"),
trim = TRUE, stringsAsFactors = FALSE
)
head(FE_tab) ### Doesn’t work
【问题讨论】:
-
将您的网址设为
https://www.dropbox.com/s/mb316ghr4irxipr/TALHOES_AGENTES.htm?dl=0&raw=1
标签: xml r html-table httr rvest