在ORC文件中的行中使用逗号(,)作为分隔符

时间:2016-04-15 06:01:12

标签: hadoop orc

我正在用Java创建一个ORC文件。对于每一行,我希望字段以逗号分隔。这是我的java代码:

    ObjectInspector inspector = ObjectInspectorFactory.getReflectionObjectInspector(String.class,ObjectInspectorFactory.ObjectInspectorOptions.JAVA);
this.mWriter = OrcFile.createWriter(fs, fsPath, config, inspector, stripSize, CompressionKind.ZLIB, bufferSize,0);
this.mWriter.addRow(new Text("shekhar,saha")); 
this.mWriter.addRow(new Text("ram,shyam"));
this.mWriter.addRow(new Text("jhon,cena"));
this.mWriter.close();

这是创造它的正确方法吗?

我正在尝试在Hive表中加载数据。这就是我创建表格的方式:

create table demo ( name1 STRING,name2 STRING ) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS ORC tblproperties ("orc.compress"="ZLIB");

但我无法加载数据。当我从表中读取数据时,其抛出类等级异常org.apache.hadoop.hive.ql.io.orc.OrcStruct cannot be cast to org.apache.hadoop.io.Text

0 个答案:

没有答案