【发布时间】:2020-07-17 11:30:27
【问题描述】:
我正在尝试使用 DeepLearning4j 将 32x32 图像分类为 0-9 的数字。 我查阅了许多示例和教程,但在将数据集拟合到网络时总是遇到一些异常。
我目前正在尝试将 ImageRecordReader 与 ParentPathLabelGenerator 和 RecordReaderDataSetIterator 一起使用。
图像似乎可以正常加载,但我在拟合时总是遇到 DL4JInvalidInputException。
File parentDir = new File(dataPath);
FileSplit filesInDir = new FileSplit(parentDir, NativeImageLoader.ALLOWED_FORMATS);
ParentPathLabelGenerator labelMaker = new ParentPathLabelGenerator();
BalancedPathFilter pathFilter = new BalancedPathFilter(new Random(), labelMaker, 100);
InputSplit[] filesInDirSplit = filesInDir.sample(pathFilter, 80, 20);
InputSplit trainData = filesInDirSplit[0];
InputSplit testData = filesInDirSplit[1];
ImageRecordReader recordReader = new ImageRecordReader(numRows, numColumns, 3, labelMaker);
recordReader.initialize(trainData);
DataSetIterator dataIter = new RecordReaderDataSetIterator(recordReader, 1, 1, outputNum);
使用 DenseLayer 时:
Exception in thread "main" org.deeplearning4j.exception.DL4JInvalidInputException: Input that is not a matrix; expected matrix (rank 2), got rank 4 array with shape [1, 3, 32, 32]. Missing preprocessor or wrong input type? (layer name: layer0, layer index: 0, layer type: DenseLayer)
使用 ConvolutionLayer 时,OutputLayer 会出现错误:
Exception in thread "main" org.deeplearning4j.exception.DL4JInvalidInputException: Input that is not a matrix; expected matrix (rank 2), got rank 4 array with shape [1, 1000, 28, 28]. Missing preprocessor or wrong input type? (layer name: layer1, layer index: 1, layer type: OutputLayer)
是我加载图像的尝试不正确还是我的网络配置错误?
配置:
MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
.list()
.layer(0, new ConvolutionLayer.Builder()
.nIn(3) // Number of input datapoints.
.nOut(1000) // Number of output datapoints.
.activation(Activation.RELU) // Activation function.
.weightInit(WeightInit.XAVIER) // Weight initialization.
.build())
.layer(1, new OutputLayer.Builder(LossFunctions.LossFunction.NEGATIVELOGLIKELIHOOD)
.nIn(1000)
.nOut(outputNum)
.activation(Activation.SOFTMAX)
.weightInit(WeightInit.XAVIER)
.build())
.build();
【问题讨论】:
标签: java deeplearning4j dl4j