在此比例因子上查询耗尽的资源

时间:2019-08-09 05:37:09

标签: amazon-athena

我正在尝试将一个非常大的表(52个百万行)连接到一个具有11,553,668,111个观测值但只有两列的大型表中

简单的左联接命令出现错误,“以此比例系数查询耗尽的资源”。

-- create smaller table to save $$
CREATE TABLE targetsmart_idl_data_mi_pa_maid AS 
SELECT targetsmart_idl_data_pa_mi_pa.idl,  targetsmart_idl_data_pa_mi_pa.grouping_indicator, targetsmart_idl_data_pa_mi_pa.vb_voterbase_dob, targetsmart_idl_data_pa_mi_pa.vb_voterbase_gender, targetsmart_idl_data_pa_mi_pa.ts_tsmart_urbanicity, targetsmart_idl_data_pa_mi_pa.ts_tsmart_high_school_only_score,
targetsmart_idl_data_pa_mi_pa.ts_tsmart_college_graduate_score, targetsmart_idl_data_pa_mi_pa.ts_tsmart_partisan_score, targetsmart_idl_data_pa_mi_pa.ts_tsmart_presidential_general_turnout_score, targetsmart_idl_data_pa_mi_pa.vb_voterbase_marital_status, targetsmart_idl_data_pa_mi_pa.vb_tsmart_census_id, 
targetsmart_idl_data_pa_mi_pa.vb_voterbase_deceased_flag,  idl_maid_base.maid
FROM targetsmart_idl_data_pa_mi_pa
LEFT JOIN idl_maid_base
    ON targetsmart_idl_data_pa_mi_pa.idl = idl_maid_base.idl

1 个答案:

答案 0 :(得分:0)

我可以通过将大桌子作为驾驶台来解决这个问题

例如。

select col1, col2 from table a join table b on a.col1 =b.col1

表a很小,少于1000条记录,而表b则有数百万条记录。上面的查询错误出来了

将查询重写为

select col1, col2 from table b join table a on a.col1 =b.col1