AWS Athena是否支持序列文件

时间:2018-04-02 22:04:06

标签: amazon-athena

是否有人尝试在Sequence Files上创建AWS Athena Table。根据{{​​3}}看来它是可能的。我能够在create table statement下面执行。

   create external table if not exists sample_sequence (
      account_id string,
      receiver_id string,
      session_index smallint, 
      start_epoch bigint)
STORED AS sequencefile
location 's3://bucket/sequencefile/';

语句执行成功但当我尝试从表中读取数据时,它会抛出错误

Your query has the following error(s):

HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split s3://viewershipforneo4j/2017-09-26/000030_0 (offset=372128055, length=62021342) using org.apache.hadoop.mapred.SequenceFileInputFormat: s3://viewershipforneo4j/2017-09-26/000030_0 not a SequenceFile

This query ran against the "default" database, unless qualified by the query. Please post the error message on our forum or contact customer support with Query Id: 9f0983b0-33da-4686-84a3-91b14a39cd09.

1 个答案:

答案 0 :(得分:0)

序列文件是有效的。这里的问题是没有定义分隔符。

行格式分隔的结尾的字段缺失

如果在您的情况下,如果tab是列分隔符,则行数据在下一行中,它将是

head