Question

我正在执行下面的查询，该查询针对具有数百万条记录的数据库表运行。

用户将文本文件中的数据上传到表格。使用该数据，我需要针对具有数百万条记录的表（company_classification）进行like匹配。

完成大小为1K的输入文件的执行需要将近16分钟。我也试图明确加入，但同样的成本。需要有关优化查询的建议，以便它可以执行2-3分钟。该查询是从前端聚合物网页触发的。

以下是解释计划，并附有svl_query_summary输出。

查询：

select
  count(*)
from company_classification c
join dev.input_data_load d on (c.industry_cls_code like d.industry_code||'%')
where
      d.id = '51993d75f7cf2647647b58250222d954'
  and d.company_indicator = 1

----- Nested Loop Join in the query plan - review the join predicates to avoid Cartesian products -----

AWS Redshift查询优化

0 个答案: