【发布时间】:2015-07-27 05:13:01
【问题描述】:
我想画最好的png文件的前100个字符,但是如果不能全部画出来。
文件在那里:http://abatis.org.uk/projects/txt2fig.png
File fff = new File("C:\\Users\\lll\\Desktop\\txt2fig.png");
OCRScanner scanner = new OCRScanner();
TrainingImageLoader loader = new TrainingImageLoader();
HashMap<Character, ArrayList<TrainingImage>> trainingImageMap = new HashMap<Character, ArrayList<TrainingImage>>();
loader.load(fff.getAbsolutePath(), new CharacterRange('A', 'Z'), trainingImageMap);
scanner.addTrainingImages(trainingImageMap);
Image image = ImageIO.read(fff);
PixelImage pixelImage = new PixelImage(image);
pixelImage.toGrayScale(true);
pixelImage.filter();
String text = scanner.scan(image, 0, 0, 0, 0, null);
System.out.println(text);
例外:
java.io.IOException: Expected to decode 26 characters but actually decoded 911 characters in training: C:\Users\lll\Desktop\txt2fig.png
at net.sourceforge.javaocr.ocrPlugins.mseOCR.TrainingImageLoader.load(TrainingImageLoader.java:107)
at net.sourceforge.javaocr.ocrPlugins.mseOCR.TrainingImageLoader.load(TrainingImageLoader.java:83)
我在 pom 中的库:
<dependency>
<groupId>net.sourceforge.javaocr</groupId>
<artifactId>javaocr-core</artifactId>
<version>1.0</version>
</dependency>
<dependency>
<groupId>net.sourceforge.javaocr.plugins</groupId>
<artifactId>javaocr-plugin-awt</artifactId>
<version>1.0</version>
</dependency>
我知道:
new CharacterRange ('A', 'Z')
应该包括文件中的第一个和最后一个字符,它可以以某种方式绕过?
【问题讨论】: