SQL - Self Join查找两个标签,其中包含与第三个标签

时间:2018-06-03 13:46:00

标签: sql postgresql pgadmin self-join musicbrainz

我正在使用PostgreSQL和Pg​​Admin 4,我正在使用MusicBrainz数据库。 我需要找到从未发布过共同版本的标签对,但他们都发布了带有第三个标签的版本(两者都是相同的标签)。

在数据库中有以下表格: label(id,name ..)id是主键。 release_label(id,release,label)id是主键并标记外键。

我尝试过自我加入,但它无效:

SELECT l1.name as label_1 , l2.name as label_2
FROM release_label as r1 INNER JOIN label as l1 ON r1.label=l1.id, label as l2
INNER JOIN (release_label as r2 LEFT JOIN release_label as r3
ON r3.label=r2.label)ON r2.label=l2.id WHERE r1.release != r2.release 
AND r1.label!= r3.label AND r1.release=r3.release
GROUP BY label_1,label_2 ORDER BY label_1,label_2

感谢您的建议。

1 个答案:

答案 0 :(得分:0)

此查询获取从未发布任何共同点的标签对:

select l1.id as id1, l2.id as id2
from label l1 cross join
     label l2 left join
     release_label rl1 
     on l1.id = rl1.label left join
     release_label rl2
     on l2.id = rl2.label and rl2.release = rl1.release
where rl1.label is null and l1.id < l2.id;

现在,您需要一个已同时发布的第三个标签。 。 。

select ll.*, rl3_1.label as in_common
from (select l1.id as id1, l2.id as id2
      from label l1 cross join
           label l2 left join
           release_label rl1 
           on l1.id = rl1.label left join
           release_label rl2
           on l2.id = rl2.label and rl2.release = rl1.release
      where rl1.label is null and l1.id < l2.id
     ) ll join
     release_label rl1
     on rl1.label = ll.id1 join
     release_label rl2
     on rl2.label = ll.id2 join
     release_label rl3_1
     on rl3_1.release = rl1.release join 
     release_label rl3_2
     on rl3_2.release = rl2.release and
        rl3_2.label = rl3_1.label;

编辑:

另一种方法可能更简单:

select l1.id, l2.id, l3.id as in_common_id
from label l1 join
     label l2
     on l1.id < l2.id join
     label l3
     on l1.id <> l3.id and l2.id <> l3.id
where -- have no releases in common
      not exists (select 1
                  from release_label rl1 join
                       release_label rl2
                       on rl1.release = rl2.release
                  where rl1.label = l1.id and rl2.label = l2.id
                 ) and
      -- l1 has a release with l3
      exists (select 1
              from release_label rl1 join
                   release_label rl3
                   on rl1.release = rl3.release
                  where rl1.label = l1.id and rl3.label = l3.id
             ) and
      -- l2 has a release with l3
      exists (select 1
              from release_label rl2 join
                   release_label rl3
                   on rl2.release = rl3.release
                  where rl2.label = l2.id and rl3.label = l3.id
             );

from子句生成标签的所有候选行程。 exists检查您要检查的三个条件。这是我将使用的版本,因为我认为逻辑更容易理解。

在其中任何一个查询中,您(当然)可以在前两个ID上使用select distinct来获取您要查找的对。