表1:曲目
表2:Wordlist
表3:N:M Track有单词(trackwords)
查找包含所有单词的所有曲目。
目前查询如下:
SELECT DISTINCT t.id FROM track as t
Left Join trackwords as tw ON t.id=tw.trackid
Left Join wordlist as wl on wl.id=tw.wordid
WHERE
wl.trackusecount>0
group by t.id
HAVING SUM(IF(wl.word IN ('folsom','prison','blues'),1,0)) = 3;
根据EXPLAIN的说法,正在使用所有索引:
+----+-------------+-------+--------+-----------------------+---------+---------+----------------+---------+-------------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+-------+--------+-----------------------+---------+---------+----------------+---------+-------------+
| 1 | SIMPLE | t | index | PRIMARY | PRIMARY | 4 | NULL | 8194507 | Using index |
| 1 | SIMPLE | tw | ref | wordid,trackid | trackid | 4 | mbdb.t.id | 3 | Using where |
| 1 | SIMPLE | wl | eq_ref | PRIMARY,trackusecount | PRIMARY | 4 | mbdb.tw.wordid | 1 | Using where |
+----+-------------+-------+--------+-----------------------+---------+---------+----------------+---------+-------------+
但查询需要很长时间。 有什么建议可以加快查询速度吗?
答案 0 :(得分:5)
如果您只是在寻找包含所有单词的曲目,那么左连接就没有意义了。我假设(trackid
,wordid
)组合在trackwords
中是唯一的。
SELECT t.id
FROM track as t, trackwords as tw, wordlist as wl
WHERE t.id=tw.trackid
AND wl.id=tw.wordid
AND wl.trackusecount>0 /* not sure what that is - you have it in your query */
AND wl.word in ('folsom','prison','blues')
GROUP by t.id
HAVING count(*) = 3
此查询将受益于wordlist(word),trackwords(trackid,wordid)和track(id)上的索引。
答案 1 :(得分:3)
您的问题集与存储StackOverflow或Del.icio.us等项目的标签非常相似。
文章Tags: Database schemas提出了几个解决方案,其中包括@ ChssPly76的想法。
答案 2 :(得分:0)
如果你把它分成两个查询,可能会更快。首先,单词和跟踪词的连接将为您提供所需的所有跟踪。然后返回轨道表并执行:
WHERE t.id IN(...trackids here...)
但基于上面的查询,你所返回的只是t.id,你已经从tw.trackid获得了。