如何为本地图像配置 DL4j答案

【问题标题】：How to configure DL4j for local images如何为本地图像配置 DL4j
【发布时间】：2020-07-17 11:30:27
【问题描述】：

我正在尝试使用 DeepLearning4j 将 32x32 图像分类为 0-9 的数字。我查阅了许多示例和教程，但在将数据集拟合到网络时总是遇到一些异常。

我目前正在尝试将 ImageRecordReader 与 ParentPathLabelGenerator 和 RecordReaderDataSetIterator 一起使用。

图像似乎可以正常加载，但我在拟合时总是遇到 DL4JInvalidInputException。

        File parentDir = new File(dataPath);
        FileSplit filesInDir = new FileSplit(parentDir, NativeImageLoader.ALLOWED_FORMATS);
        ParentPathLabelGenerator labelMaker = new ParentPathLabelGenerator();

        BalancedPathFilter pathFilter = new BalancedPathFilter(new Random(), labelMaker, 100);
        InputSplit[] filesInDirSplit = filesInDir.sample(pathFilter, 80, 20);
        InputSplit trainData = filesInDirSplit[0];
        InputSplit testData = filesInDirSplit[1];

        ImageRecordReader recordReader = new ImageRecordReader(numRows, numColumns, 3, labelMaker);
        recordReader.initialize(trainData);

        DataSetIterator dataIter = new RecordReaderDataSetIterator(recordReader, 1, 1, outputNum);

使用 DenseLayer 时：

Exception in thread "main" org.deeplearning4j.exception.DL4JInvalidInputException: Input that is not a matrix; expected matrix (rank 2), got rank 4 array with shape [1, 3, 32, 32]. Missing preprocessor or wrong input type? (layer name: layer0, layer index: 0, layer type: DenseLayer)

使用 ConvolutionLayer 时，OutputLayer 会出现错误：

Exception in thread "main" org.deeplearning4j.exception.DL4JInvalidInputException: Input that is not a matrix; expected matrix (rank 2), got rank 4 array with shape [1, 1000, 28, 28]. Missing preprocessor or wrong input type? (layer name: layer1, layer index: 1, layer type: OutputLayer)

是我加载图像的尝试不正确还是我的网络配置错误？

配置：

MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
                .list()
                .layer(0, new ConvolutionLayer.Builder()
                        .nIn(3) // Number of input datapoints.
                        .nOut(1000) // Number of output datapoints.
                        .activation(Activation.RELU) // Activation function.
                        .weightInit(WeightInit.XAVIER) // Weight initialization.
                        .build())
                .layer(1, new OutputLayer.Builder(LossFunctions.LossFunction.NEGATIVELOGLIKELIHOOD)
                        .nIn(1000)
                        .nOut(outputNum)
                        .activation(Activation.SOFTMAX)
                        .weightInit(WeightInit.XAVIER)
                        .build())
                .build();

【问题讨论】：

标签： java deeplearning4j dl4j

【解决方案1】：

最简单的方法是在定义网络时使用.setInputType 配置选项。它会为你设置所有必要的预处理器，它也会计算出所有正确的.nIn 值。

再看看这个例子https://github.com/eclipse/deeplearning4j-examples/blob/master/dl4j-examples/src/main/java/org/deeplearning4j/examples/convolution/mnist/MnistClassifier.java#L156

当您使用 .setInputType 设置网络的方式时，您根本不需要设置任何 .nIn 值 - 您仍然可以，正如我链接的示例中所示，但通常存在没有很好的理由这样做。

【讨论】：