【发布时间】:2018-12-10 01:46:39
【问题描述】:
我想预测响应变量,它有 700 个类。
深度学习模型参数
from h2o.estimators import deeplearning
dl_model = deeplearning.H2ODeepLearningEstimator(
hidden=[200,200],
epochs = 10,
missing_values_handling='MeanImputation',
max_categorical_features=4,
distribution='multinomial'
)
# Train the model
dl_model.train(x = Content_vecs.names,
y='tags',
training_frame = data_split[0],
validation_frame = data_split[1]
)
Orginal Response Variable -Tags:
apps, email, mail
finance,freelancers,contractors,zen99
genomes
gogovan
brazil,china,cloudflare
hauling,service,moving
ferguson,crowdfunding,beacon
cms,naytev
y,combinator
in,store,
conversion,logic,ad,attribution
Response variable tags:
[74]
[156, 89]
[153, 13, 133, 40]
[150]
[474, 277, 113]
[181, 117]
[15, 87, 8, 11]
错误:
OSError:密钥为 $03017f00000132d4ffffffff$_8355bcac0e9e98a86257f45c180e4898 的作业失败并出现异常:java.lang.UnsupportedOperationException:无法计算错误:类太多
堆栈跟踪: java.lang.UnsupportedOperationException:无法计算错误:类太多 在 hex.ConfusionMatrix.err(ConfusionMatrix.java:92)
但是在 h2o-core/src/main/java/hex/ConfusionMatrix.javaConfusionMatrix.java 写的是它可以计算1000个类。
【问题讨论】:
标签: python-3.x deep-learning h2o multilabel-classification