您的位置:首页 > 其它

【Nutch2.2.1基础教程之1】nutch相关异常

2015-06-16 15:59 274 查看
1、在任务一开始运行,注入Url时即出现以下错误。

InjectorJob: Injecting urlDir: urls

InjectorJob: Using class org.apache.gora.hbase.store.HBaseStore as the Gora storage class.

InjectorJob: java.lang.RuntimeException: job failed: name=[20140000]inject urls, jobid=job_local1629320149_0001

at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)

at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233)

at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)

at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
原因是regex-urlfilter.txt配置错误
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: