【发布时间】:2014-07-18 12:17:29
【问题描述】:
我正在尝试开始我的第一次抓取,我已经配置了数据库设置并执行以下命令:bin/nutch inject urls
并且错误结果如下:
InjectorJob: starting at 2014-07-18 08:13:34
InjectorJob: Injecting urlDir: urls
InjectorJob: Using class org.apache.gora.sql.store.SqlStore as the Gora storage class.
InjectorJob: java.lang.RuntimeException: job failed: name=inject urls, jobid=job_local1172062909_0001
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233)
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
有人可以帮我吗?
【问题讨论】:
标签: java apache nutch web-crawler gora