【发布时间】:2020-05-05 11:00:07
【问题描述】:
我正在 jupyter notebook 中做一个 NLP 项目,其数据集涉及 160000 行。在运行给定的代码时,我遇到了内存错误。
messages = list(zip(processed, Y))
# defined a seed for reproducibility
seed = 1
np.random.seed = seed
np.random.shuffle(messages)
# calling find_features function for each comments
featuresets = [(find_features(text), label) for (text, label) in messages]
显示的错误是 -
<ipython-input-18-faca481e94f7> in find_features(message)
3 features = {}
4 for word in word_features:
----> 5 features[word] = (word in words)
6
7 return features
MemoryError:
有什么办法可以解决这个问题。 我正在运行 Windows 64bit 4gb RAM core i5 8th Gen 笔记本电脑。
【问题讨论】:
标签: python machine-learning deep-learning nlp jupyter-notebook