Windows上的Cygwin上的Nutch 1.12

时间:2017-03-10 11:53:58

标签: nutch

英语不是我的母语;请原谅我的任何错误。

我尝试在Windows上运行Cygwin上的nutch 1.12,然后我跟着Nutch Turorial。但是当我尝试执行命令行" bin / nutch注入crawl / crawldb urls"时,我在Cygwin中得到了这个:

$ bin/nutch inject crawl/crawldb urls
Injector: starting at 2017-03-10 19:29:00
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Injector: java.lang.NullPointerException
    at java.lang.ProcessBuilder.start(ProcessBuilder.java:1012)
    at org.apache.hadoop.util.Shell.runCommand(Shell.java:445)
    at org.apache.hadoop.util.Shell.run(Shell.java:418)
    at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:650)
    at org.apache.hadoop.util.Shell.execCommand(Shell.java:739)
    at org.apache.hadoop.util.Shell.execCommand(Shell.java:722)
    at org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:633)
    at org.apache.hadoop.fs.FilterFileSystem.setPermission(FilterFileSystem.java:467)
    at org.apache.hadoop.fs.ChecksumFileSystem.create(ChecksumFileSystem.java:456)
    at org.apache.hadoop.fs.ChecksumFileSystem.create(ChecksumFileSystem.java:424)
    at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:906)
    at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:887)
    at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:849)
    at org.apache.hadoop.fs.FileSystem.createNewFile(FileSystem.java:1149)
    at org.apache.nutch.util.LockUtil.createLockFile(LockUtil.java:58)
    at org.apache.nutch.crawl.Injector.inject(Injector.java:357)
    at org.apache.nutch.crawl.Injector.run(Injector.java:467)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
    at org.apache.nutch.crawl.Injector.main(Injector.java:441)

我尝试了herehere中提到的方法,但要么对我有效。

如何解决此问题并在Windows上运行Nutch?谢谢!

1 个答案:

答案 0 :(得分:0)

我有同样的问题。通过设置Hadoop并在const express = require('express'); const cors = require('cors'); let app = express(); app.use(cors({ origin: true })); 文件夹中包含winutils.exe来解决该问题。如果仍然存在相同的错误,请尝试在提到的文件夹中包含hadoop.dll。