优化sql查询,即使对小数据也太慢

时间:2013-05-10 13:08:57

标签: java mysql sql

基本上我试图从每个网址匹配的单词中获得总计数。我有这个SQL查询:

select w.url, w.word, w.count, (
select sum(w2.count)
from wordcounts w2 where w2.url = w.url and w2.word in ('search', 'more')
) as totalcount
from wordcounts w
where w.word in ('search', 'more')

我正在使用此查询来获得此类结果:

URL                              |  word  | count | Total Count

http://haacked.com/              | more   | 61    | 62
http://haacked.com/              | search | 1     | 62
http://feeds.haacked.com/haacked | more   | 58    | 59
http://feeds.haacked.com/haacked | search | 1     | 59
http://www.asp.net/privacy       | more   | 7     | 13
http://www.asp.net/privacy       | search | 6     | 13

我的原始表结构是

ID | URL  |  word  | count

但问题是,这个小问题需要花费太多时间。在几千行上查询超过7秒以上。如何优化此查询?

我从其他网站获得了这种语法,但它给出了错误。

select id, url, word, count, 
sum(count) over(partition by url) as count_sum
from wordcounts where word in ('search', 'more') order by url

Error code 1064, SQL state 42000: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near '(partition by url) as count_sum
from wordcounts where word in ('search', 'more')' at line 2
Line 1, column 1

Execution finished after 0 s, 1 error(s) occurred.

3 个答案:

答案 0 :(得分:3)

预聚合:

select w.url, w.word, w.`count`, w3.totalcount
from wordcounts w
join (
     select w2.url, sum(w2.`count`) totalcount
     from wordcounts w2
     where w2.word in ('search', 'more')
     group by w2.url) w3 on w3.url = w.url
where w.word in ('search', 'more')

答案 1 :(得分:1)

使用JOIN而不是子查询:

select w.url, w.word, w.count, sum(w2.count) as totalcount 
from wordcounts w
left join wordcounts w2  
  on w2.url = w.url and w2.word in ('search', 'more')
where w.word in ('search', 'more')
group by w.url, w.word, w.count

答案 2 :(得分:1)

您的原始查询在MySQL中运行缓慢,因为MySQL正在为结果集的每一行执行子查询。您可以通过执行一次聚合并将结果加入到以下来解决此问题:

select w.url, w.word, w.count, wsum.sumcount
from wordcoutns w join
     (select w.url, w.word, SUM(w.count) as sumcount
      from wordcounts w
      where w.word in ('search', 'more')
      group by w.url, w.word
     ) wsum
     on wsum.url = w.url and wsum.word = w.word
where w.word in ('search', 'more') 

其他数据库支持一类称为窗口函数的函数,使这更容易。 MySQL不支持这些。