用于 OCR 的图像二值化答案

【问题标题】：Binarisation of image for OCR用于 OCR 的图像二值化
【发布时间】：2013-05-10 14:55:09
【问题描述】：

我想将扫描的图像转换为黑白图像，目标是在图像通过互联网传输以进行 OCR 之前减小文件大小。

扫描仪/通用图像编辑软件创建的正常二值化/黑白图像会产生不良结果。

留下大量随机黑色像素，它们实际上只是二值化产生的噪声，这导致 OCR 尝试识别没有字符的字符，或在字符后插入句号、冒号等。

我可以在 OpenCV 中使用什么来对图像进行二值化，保持线条、字符和暗区为实心，并减少白色区域中的像素噪点？

我玩过 cvThreshold 和 cvAdaptiveThreshold，但结果还不是很好。

例如，查看original image 和desired result。

【问题讨论】：

你的例子似乎是三元的，我看到除了黑色和白色之外至少有一种灰色。
@MarkRansom 当我回去查看 IrfanView 中的图像时，我认为您是对的，我一定是保存了错误的黑白图像。但是，在 Gimp 中查看图像时，像素只是黑白的。你用什么来查看图像？就我而言，我相信 gimp 胜过 IrfanView。
我在 Chrome 中查看它。今天在 Firefox 中看起来不错，不知道发生了什么。

标签： opencv image-processing

【解决方案1】：

你可以试试这个，但是你仍然需要调整一些参数。

#define ALPHA_SCALE 2
#define THRESHOLD_VAL 40
#define MAX_VAL_FOR_THRESHOLD 250
#define PIXEL_MISMATCH_COUNT 10 //9, 7
Mat current_frame_t2;        

     IplImage *img = cvLoadImage("Original.tiff", CV_LOAD_IMAGE_UNCHANGED );
     cvNamedWindow("My_Win", CV_WINDOW_AUTOSIZE);
    // namedWindow("My_Win", 1);
     cvShowImage("My_Win", img);
      cvWaitKey(10);
     Mat current_frame_t1(img);
     cvtColor(current_frame_t1, current_frame_t2, CV_RGB2GRAY);
    current_frame_t1.release();
    imshow("My_Win", current_frame_t2);
     cvWaitKey(10);
     equalizeHist(current_frame_t2, current_frame_t1);
    current_frame_t2.release();
    convertScaleAbs(current_frame_t1, current_frame_t2,ALPHA_SCALE);

    threshold(current_frame_t2, current_frame_t1, THRESHOLD_VAL, MAX_VAL_FOR_THRESHOLD, CV_THRESH_BINARY);
    medianBlur(current_frame_t1,current_frame_t2,1); 
    imshow("My_Win", current_frame_t2);
    imwrite("outimg.tiff", current_frame_t2),
    cvWaitKey(0);

【讨论】：

【解决方案2】：

您可以使用connected-components labeling 算法并删除未填充图像中合理数量像素的组件。

在 OpenCV 中实现它的一种非常简单的方法是使用轮廓：

1. Do the preliminary bizariztion of the OCR, that will give you a very noise output. 
2. Find all contours on that noise image.
3. For each found contour:
  3.1. Fill the contour with a color different of the two options in the binarized image.
  3.2. Count the ammount of pixels filled with that color.
  3.3. If the ammount of pixels are smaller than a given treshold, fill the contour with the void collor of the binary image.

供参考：cv::findContours 和 cv::drawContours。

可以在 3.1 上优化对多个轮廓进行分类的循环。并在 3.2 中对所有这些颜色进行一次像素计数。 .我没有回答优化版本，因为您可能有超过 253 个不同的组（255 种颜色 - 二进制图像的 2 种默认颜色），并且考虑到这一点并不是那么简单。

【讨论】：

作为计算机视觉和 OpenCV 的新手，我必须花很多时间研究您的答案才能提出自己的实现。能提供一点sn-p的代码吗？