如何为wordcount程序mapreduce创建输入文件

时间:2015-03-04 07:24:33

标签: bigdata

它的意思是什么?

/user/joe/wordcount/input - input directory in HDFS
/user/joe/wordcount/output - output directory in HDFS

1 个答案:

答案 0 :(得分:0)

you can submit the Mapreduce application jar to cluster by below command.

hadoop jar  yourapplication.jar  driver-class input-path(hdfs path) output-path(hdfs path)

First create a simple text file with some content,and copy that file to hdfs  by below command.

hadoop fs -put  your-local-path/input   hdfs-path/

/user/joe/wordcount/input  -- is input file on hdfs .

/user/joe/wordcount/output --  output directory for storing MR result.

in your case you can run like this

hadoop jar  yourapplication.jar  driver-class /user/joe/wordcount/input.txt  
 /user/joe/wordcount/output