我正在zeppelin
中使用带有火花解释器的查询运行到创建表。
spark.sql("create table person as select * from TABLE_NAME")
查询运行没有任何错误。
但是当我在该表上运行查询时,出现此错误
spark.sql("select count(*) from person")
: org.apache.spark.SparkException: Job aborted due to stage failure:
Task 30 in stage 80.0 failed 4 times, most recent failure: Lost task 30.3 in stage 80.0 (TID 7772, ip-10-226-34-88.ec2.internal, executor 530):
org.apache.hadoop.hdfs.BlockMissingException: Could not obtain block:
BP-97775991-10.226.34.113-1532420342496:blk_1073799259_58435 file=/user/hive/warehouse/person/part-0001
我该如何解决?
答案 0 :(得分:0)
问题在于表格的格式-默认为“实木复合地板” 尝试将其更改为“ orc”:
spark.sql("select * from TABLE_NAME").write.format("orc").saveAsTable("person")