将Json Flat文件从本地复制到HDFS

时间:2017-10-28 12:52:41

标签: java hadoop hdfs

package com.Main;

import java.io.BufferedInputStream;
import java.io.FileInputStream;
import java.io.IOException;
import java.io.InputStream;
import java.io.OutputStream;
import java.net.URI;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IOUtils;



public class Main {

    public static void main(String[] args) throws IOException  {

        //Source file in the local file system
        String localSrc = args[0];

        //Destination file in HDFS
        String dst = args[1];

        //Input stream for the file in local file system to be written to HDFS
        InputStream in = new BufferedInputStream(new FileInputStream(localSrc));

        //Get configimport org.apache.commons.configuration.Configuration;uration of Hadoop system
        Configuration conf = new Configuration();
        System.out.println("Connecting to -- "+conf.get("fs.defaultFS"));

        //Destination file in HDFS
        FileSystem fs = FileSystem.get(URI.create(dst), conf);
        OutputStream out = fs.create(new Path(dst)); 

        //Copy file from local to HDFS
        IOUtils.copyBytes(in, out, 4096, true);

        System.out.println(dst + " copied to HDFS");
    }

}

AM收到以下错误消息" 线程中的异常" main" java.lang.ArrayIndexOutOfBoundsException:0     在com.Main.Main.main(Main.java:22)"

我的本​​地有Json文件,必须在HDFS中移动它 的实施例 {"德尔":" Ef77xvP""时间":1509073785106} {"德尔":" 2YXsF7r""时间":1509073795109}

1 个答案:

答案 0 :(得分:0)

指定程序的命令行参数。您的代码片段要求第一个参数是源,下一个参数是目标。 有关详细信息,请参阅What is "String args[]"? parameter in main method Java