合并重复的相同子查询

时间:2018-12-28 03:54:46

标签: sql subquery query-optimization greenplum

如何在下面的查询中删除或合并三个相同的子查询?

用例详细信息:我使用的 Projects 表列出了几百万个项目。每个项目记录都有一个创建者,管理员和编辑者,如系统ID所示。我想将这些系统ID替换为 Employees 表中保留的相应名称。要交叉引用系统ID和名称,需要另一个名为 Users 的表及其hr_id字段。

Projects:
--prj-- --name-- -owner- -creator- -editor- --many more columns...
 001     alpha    001Z     300Z     304Z       ...
 002     beta     020Z     350Z     600Z       ...
 003     charlie  600Z     020Z     001Z       ...


Employees:                       Users:               
--hr_id-- --name--                 -hr_id- -sys_id-
 A01    john                      A01     001Z
 A02    susan                     A02     020Z
 A03    ryan                      A03     300Z
 A04    kelly                     A04     304Z
 A05    matt                      A05     350Z
 A06    bert                      A06     600Z

Desired output:
--prj-- --name-- -owner- -creator- -editor- --adt'l cols...
 001     alpha    john     ryan     kelly    ...
 002     beta     susan    matt     bert     ...
 003     charlie  bert     susan    john     ...

下面是我的代码,包括一些必须保留的不相关联接。我的查询运行正常,但是效率不高,请多指教。另外,(从我的Google搜索中,我认为这很重要)我正在不适合CTE的环境中工作。

SELECT projects.prj As project_id,
       projects.name As project_name,
       owner.name As owner_name,
       creator.name As creator_name,
       editor.name As editor_name,
       stats.stat1 As stat_1,
       actuals.stat2 As stat_2
FROM "dbconnect"."projects" As projects
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS owner ON projects.owner = owner.sys_id
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS creator ON projects.creator = creator.sys_id
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS editor ON projects.editor = editor.sys_id
  LEFT JOIN "dbconnect"."prjstats" As stats ON projects.prj = prjstats.prj_id
  LEFT JOIN "dbconnect"."prjactuals" As actuals ON projects.prj = prjactuals.prj_id

2 个答案:

答案 0 :(得分:1)

您可以为子查询创建标量值函数,然后像这样重写查询。

CREATE FUNCTION dbo.getName (@id varchar(30))  
RETURNS varchar(128) 
AS  
BEGIN  
     DECLARE @v_name varchar(128) 
     SELECT @v_name=emps.name 
     FROM "dbconnect"."employees" AS emps
     RIGHT JOIN "dbconnect"."users" AS users ON emps.hr_id = users.hr_id
     WHERE users.sys_id=@id

     RETURN @v_name
END

--Query
SELECT projects.prj As project_id,
       projects.name As project_name,
       /*
       owner.name As owner_name,
       creator.name As creator_name,
       editor.name As editor_name,
       */
       dbo.getName(projects.owner) as owner_name,
       dbo.getName(projects.creator) as creator_name,
       dbo.getName(projects.editor) as editor_name,
       stats.stat1 As stat_1,
       actuals.stat2 As stat_2
FROM "dbconnect"."projects" As projects
/*
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS owner ON projects.owner = owner.sys_id
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS creator ON projects.creator = creator.sys_id
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS editor ON projects.editor = editor.sys_id
*/
  LEFT JOIN "dbconnect"."prjstats" As stats ON projects.prj = prjstats.prj_id
  LEFT JOIN "dbconnect"."prjactuals" As actuals ON projects.prj = prjactuals.prj_id

或者您可以创建表值函数,然后使用APPLY运算符加入该函数。

CREATE FUNCTION dbo.getName (@id varchar(30))  
RETURNS TABLE
AS  
RETURN
( 
     SELECT emps.name
     FROM "dbconnect"."employees" AS emps
     RIGHT JOIN "dbconnect"."users" AS users ON emps.hr_id = users.hr_id
     WHERE users.sys_id=@id
)

--Query
SELECT projects.prj As project_id,
       projects.name As project_name,
       owner.name As owner_name,
       creator.name As creator_name,
       editor.name As editor_name,
       stats.stat1 As stat_1,
       actuals.stat2 As stat_2
FROM "dbconnect"."projects" As projects
OUTER APPLY dbo.getName(projects.owner) as owner
OUTER APPLY dbo.getName(projects.creator) as creator
OUTER APPLY dbo.getName(projects.editor) as editor
/*
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS owner ON projects.owner = owner.sys_id
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS creator ON projects.creator = creator.sys_id
  LEFT JOIN (
             SELECT emps.name,
                    users.hr_id,
                    users.sys_id
             FROM "dbconnect"."employees" AS emps
             RIGHT JOIN "dbconnect"."users" AS users
               ON emps.hr_id = users.hr_id
            ) AS editor ON projects.editor = editor.sys_id
*/
  LEFT JOIN "dbconnect"."prjstats" As stats ON projects.prj = prjstats.prj_id
  LEFT JOIN "dbconnect"."prjactuals" As actuals ON projects.prj = prjactuals.prj_id

答案 1 :(得分:1)

只需使用CTE。我更喜欢left join,所以我这样写:

WITH eu as (
      SELECT e.name, u.hr_id, u.sys_id
      FROM "dbconnect"."users" u LEFT JOIN
           "dbconnect"."employees" e
           ON e.hr_id = u.hr_id
     )       
SELECT p.prj As project_id, p.name As project_name,
       euo.name As owner_name, euc.name As creator_name,
       eue.name As editor_name,
       ps.stat1 As stat_1,
       pa.stat2 As stat_2
FROM "dbconnect"."projects" p LEFT JOIN
     eu euo 
     ON p.owner = euo.sys_id LEFT JOIN
     eu euc
     ON p.creator = euc.sys_id LEFT JOIN
     eu eue
     ON p.editor = eue.sys_id LEFT JOIN
     "dbconnect"."prjstats" ps
     ON p.prj = ps.prj_id LEFT JOIN
     "dbconnect"."prjactuals" pa
     ON p.prj = pa.prj_id;