无法以正确的格式在Hive表中加载数据

时间:2020-07-23 01:23:30

标签: amazon-web-services hadoop hive hdfs hiveql

嗨,请问我在Hive的外部表中有此create语句,但是我的数据不一致-所以当我运行它时,我得到Null吗?

create external table sampleartistdata(
    artistid int,
    artistname string
) 
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe' 
WITH SERDEPROPERTIES ("field.delim"="# ") 
STORED AS TEXTFILE
    location '/user/users/sampledata/';
select * from sampleartistdata limit 3;

这是数据的样子:

1134999 06Crazy Life
10113088    Terfel, Bartoli- Mozart: Don
6826647 Bodenstandig 3000
10186265    Jota Quest e Ivete Sangalo
6828986 Toto_XX (1977
10236364    U.S Bombs -
1135000 artist formaly know as Mat
10299728    Kassierer - Musik für beide Ohren
10299744    Rahzel, RZA

结果:

sampleartistdata.artistid       sampleartistdata.artistname
NULL    NULL
NULL    NULL
NULL    NULL

1 个答案:

答案 0 :(得分:1)

我能够通过更改行定界符的值来解决它,而不是使用我使用的ROW FORMAT SERDE

drop table sampleartistdata;
create external table sampleartistdata(
    artistid int,
    artistname string
) ROW format DELIMITED
FIELDS TERMINATED BY '\t'
LINES TERMINATED BY '\n'
STORED AS TEXTFILE
    location '/user/jovyan/sampledata/';
相关问题