首页 > 代码库 > nutch相关异常

nutch相关异常


1、在任务一开始运行,注入Url时即出现以下错误。

InjectorJob: Injecting urlDir: urls 
InjectorJob: Using class org.apache.gora.hbase.store.HBaseStore as the Gora storage class. 
InjectorJob: java.lang.RuntimeException: job failed: name=[20140000]inject urls, jobid=job_local1629320149_0001 
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54) 
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233) 
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251) 
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273) 
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

原因是regex-urlfilter.txt配置错误