我有一个实例化视图,该视图聚合了多个表中的数据,这些表存储了有关应用程序中统计数据的信息。目前,该视图包含约800.000条记录。问题在于此查询运行得很慢(大约1.5秒),无法满足客户的需求。有什么方法可以改善其性能?我正在使用PostgreSQL 9.6。
我尝试创建索引。但这无济于事。
CREATE INDEX t1 ON statistic_basic_view (active, visible, removed, draft, id, name, object_type, is_paid, company_name);
CREATE INDEX t2 ON statistic_basic_view (company_name, is_paid, object_type, name, id, draft, removed, visible, active);
CREATE INDEX t3 ON statistic_basic_view (company_name, is_paid, object_type, name, id);
CREATE INDEX t4 ON statistic_basic_view (draft, removed, visible, active);
CREATE INDEX t5 ON statistic_basic_view (active, visible, removed, draft);
CREATE INDEX t6 ON statistic_basic_view (id, name, object_type, is_paid, company_name);
CREATE INDEX t8 ON statistic_basic_view (active, visible, removed, draft, id, name, object_type, is_paid, company_name);
CREATE INDEX t9 ON statistic_basic_view ((active AND visible AND (NOT removed) AND (NOT draft)));
CREATE INDEX t10 ON statistic_basic_view (((NOT draft) AND (NOT removed) AND active = true AND visible = true));
查询:
SELECT id,
name,
object_type,
is_paid,
company_name,
SUM(CASE
WHEN type = 'COMPARE'
AND service_type IN ('GG_WEB') THEN 1
ELSE 0
END) AS compare_count,
SUM(CASE
WHEN type = 'EXPORT'
AND service_type IN ('GG_WEB') THEN 1
ELSE 0
END) AS export_count,
SUM(CASE
WHEN type = 'VIEW'
AND service_type IN ('GG_WEB') THEN 1
ELSE 0
END) AS view_count,
SUM(CASE
WHEN type = 'REMEMBER'
AND service_type IN ('GG_WEB') THEN 1
ELSE 0
END) AS remember_count,
SUM(CASE
WHEN type = 'SEARCH'
AND service_type IN ('GG_WEB') THEN 1
ELSE 0
END) AS search_count,
SUM(CASE
WHEN type = 'MAIL'
AND service_type IN ('GG_WEB') THEN 1
ELSE 0
END) AS mail_count
FROM statistic_basic_view
WHERE active = TRUE
AND visible = TRUE
AND removed = FALSE
AND draft = FALSE
GROUP BY id,
name,
object_type,
is_paid,
company_name
ORDER BY view_count DESC,
id ASC
limit 15;
解释分析:
Limit (cost=74204.47..74204.50 rows=15 width=130) (actual time=1420.542..1420.545 rows=15 loops=1)
-> Sort (cost=74204.47..74600.55 rows=158432 width=130) (actual time=1420.540..1420.542 rows=15 loops=1)
Sort Key: (sum(CASE WHEN ((type = 'VIEW'::text) AND ((service_type)::text = 'GG_WEB'::text)) THEN 1 ELSE 0 END)) DESC, id
Sort Method: top-N heapsort Memory: 28kB
-> HashAggregate (cost=68733.10..70317.43 rows=158432 width=130) (actual time=1420.539..1420.542 rows=8988 loops=1)
Group Key: id, name, object_type, is_paid, company_name
-> Seq Scan on statistic_basic_view (cost=0.00..24950.65 rows=761434 width=94) (actual time=0.023..249.851 rows=762118 loops=1)
Filter: (active AND visible AND (NOT removed) AND (NOT draft))
Rows Removed by Filter: 30047
Planning time: 0.665 ms
Execution time: 1420.545 ms
答案 0 :(得分:1)
没有索引可以帮助您进行此查询。
WHERE
的条件都不是选择性的,不能使用索引来加速具有这么多组的GROUP BY
,并且不能使用索引进行排序(因为存在分组根据之前的不同标准。
您应该做的是在物化视图的顶部(或直接在基表的顶部)创建另一个materialized view,其结果已预先计算并定期刷新。这会为您提供一些过时的数据,但速度很快。