【发布时间】:2017-04-11 23:56:19
【问题描述】:
当我尝试运行以下代码的最后一点时,我收到一个错误,我无法弄清楚原因。
import random
combined_list = h_sub_text + s_sub_text
print(len(combined_list))
random.shuffle(combined_list)
training_part = int(len(combined_list) * .7)
print(len(combined_list))
training_set = combined_list[:training_part]
test_set = combined_list[training_part:]
print (len(train_set))
print (len(test_set))
import nltk.classify.util
from nltk.classify import NaiveBayesClassifier
classifier = NaiveBayesClassifier.train(train_set)
accuracy = nltk.classify.util.accuracy(classifier, test_set)
print("Accuracy is: ", accuracy * 100)
我得到这个错误:
ValueError Traceback (most recent call last)
<ipython-input-57-151936e75238> in <module>()
2 from nltk.classify import NaiveBayesClassifier
----> 4 classifier = NaiveBayesClassifier.train(training_set)
C:\Program Files (x86)\Anaconda3\lib\site-packages\nltk\classify\naivebayes.py in train(cls, labeled_featuresets, estimator)
--> 194 for featureset, label in labeled_featuresets:
195 label_freqdist[label] += 1
196 for fname, fval in featureset.items():
ValueError: too many values to unpack (expected 2)
提前致谢。
【问题讨论】:
-
用
training_set替换train_set?train_set未在您提供的代码中的任何位置定义。 -
对不起,它的 "NaiveBayesClassifier.train(training_set)" 。在错误中它显示了正确的对象。
标签: python python-3.x machine-learning anaconda training-data