【发布时间】:2016-05-06 18:06:40
【问题描述】:
我正在尝试使用 pandas read_html 函数从网页 (Link) 加载数据。当我尝试这个时:
import pandas as pd
url = 'https://en.wikipedia.org/wiki/List_of_countries_by_GDP_%28nominal%29'
mD = pd.read_html(url)
我收到了这个错误:
Traceback (most recent call last):
File "<ipython-input-14-1dee4ebbc8f2>", line 1, in <module>
md = pd.read_html(url)
File "C:\Anaconda3\lib\site-packages\pandas\io\html.py", line 866, in read_html
parse_dates, tupleize_cols, thousands, attrs, encoding)
File "C:\Anaconda3\lib\site-packages\pandas\io\html.py", line 718, in _parse
parser = _parser_dispatch(flav)
File "C:\Anaconda3\lib\site-packages\pandas\io\html.py", line 663, in _parser_dispatch
raise ImportError("html5lib not found, please install it")
ImportError: html5lib not found, please install it
但我应该已经安装了html5lib 模块,如下图所示:
这是另一个问题,因为当我尝试加载 html5lib 时出现此错误:
Traceback (most recent call last):
File "<ipython-input-2-69f09277bafa>", line 1, in <module>
import html5lib
File "C:\Anaconda3\lib\site-packages\html5lib-0.99999999.dev0-py3.5.egg\html5lib\__init__.py", line 16, in <module>
from .html5parser import HTMLParser, parse, parseFragment
File "C:\Anaconda3\lib\site-packages\html5lib-0.99999999.dev0-py3.5.egg\html5lib\html5parser.py", line 6, in <module>
from . import inputstream
File "C:\Anaconda3\lib\site-packages\html5lib-0.99999999.dev0-py3.5.egg\html5lib\inputstream.py", line 9, in <module>
import webencodings
ImportError: No module named 'webencodings'
这就是我被卡住的地方,因为找不到任何方法来安装这个模块。有人知道出了什么问题吗?
顺便说一句,我使用的是 Python 3.5 和 Spyder IDE。
【问题讨论】:
-
你是如何安装
html5lib的?是用pip吗?