【发布时间】:2012-04-09 12:02:42
【问题描述】:
谁能告诉我如何在带有 Netbeans 的 windows 上使用锅炉管?如果你能给我一些 java 代码,我将不胜感激。
【问题讨论】:
标签: java netbeans web-scraping web-mining boilerpipe
谁能告诉我如何在带有 Netbeans 的 windows 上使用锅炉管?如果你能给我一些 java 代码,我将不胜感激。
【问题讨论】:
标签: java netbeans web-scraping web-mining boilerpipe
尝试查看他们的Wiki 和QuickStart。下面的示例代码...
public static void main(final String[] args) throws Exception {
URL url;
url = new URL("http://www.example.com/some-location/index.html");
// NOTE We ignore HTTP-based character encoding in this demo...
final InputStream urlStream = url.openStream();
final InputSource is = new InputSource(urlStream);
final BoilerpipeSAXInput in = new BoilerpipeSAXInput(is);
final TextDocument doc = in.getTextDocument();
urlStream.close();
// You have the choice between different Extractors
// System.out.println(DefaultExtractor.INSTANCE.getText(doc));
System.out.println(ArticleExtractor.INSTANCE.getText(doc));
}
【讨论】: