这是一场长达19个小时的噩梦。
我有一个非常大的查询,实际上需要在几个表中连接大型数据集。在我进行连接之后,我需要使用select语句中的数据更新原始表。 SELECT语句超快,UPDATE语句超级慢。
这是select语句。
SELECT l.col1,
l.col2,
l.col3,
p.personid
FROM table1 p
LEFT JOIN table2 l ON (l.col1 = p.col1)
LEFT JOIN
(SELECT name,
col AS 'col2'
FROM tbl3 f
WHERE f.col LIKE '%-F') pcf ON (pcf.col1 = p.col1)
LEFT JOIN
(SELECT name,
col AS 'col3'
FROM tbl4 f
WHERE f.col LIKE '%-M') pcm ON (pcm.col1 = p.col1)
WHERE p.requestid = '1928'
现在,如果我使用EXACT SAME系列JOIN并将其置于UPDATE上下文中,则查询将永远存在。
UPDATE table1 p
LEFT JOIN table2 l ON (l.col1 = p.col1)
LEFT JOIN
(SELECT name,
col AS 'col2'
FROM tbl3 f
WHERE f.col LIKE '%-F') pcf ON (pcf.col1 = p.col1)
LEFT JOIN
(SELECT name,
col AS 'col3'
FROM tbl4 f
WHERE f.col LIKE '%-M') pcm ON (pcm.col1 = p.col1)
SET p.col1 = l.col1,
p.col2 = l.col2,
p.col3 = l.col3
WHERE p.requestid = '1928'
那么......为什么UPDATE JOIN语句比SELECT JOIN语句花费的时间长得多? Oodles的时间更长。我已经尝试过临时表,但它没有用。
仅供参考,我正在使用50k或更多记录的表格。
如果您对EXPLAIN结果感到好奇,那么当我解析select查询时会发生这种情况(虽然显然你不能使用explain进行更新?)
id select_type table type possible_keys key key_len ref rows Extra
1 PRIMARY p ALL NULL NULL NULL NULL 613246 Using where
1 PRIMARY l eq_ref PRIMARY,name_3,name,name_2 PRIMARY 257 drilldev_db.p.lastname 1
1 PRIMARY <derived2> ALL NULL NULL NULL NULL 23435
1 PRIMARY <derived3> ALL NULL NULL NULL NULL 13610
1 PRIMARY <derived4> ALL NULL NULL NULL NULL 13053
1 PRIMARY <derived5> ALL NULL NULL NULL NULL 8273
1 PRIMARY <derived6> ALL NULL NULL NULL NULL 11481
1 PRIMARY <derived7> ALL NULL NULL NULL NULL 6708
1 PRIMARY <derived8> ALL NULL NULL NULL NULL 9588
1 PRIMARY <derived9> ALL NULL NULL NULL NULL 5494
1 PRIMARY <derived10> ALL NULL NULL NULL NULL 6981
1 PRIMARY <derived11> ALL NULL NULL NULL NULL 4107
1 PRIMARY <derived12> ALL NULL NULL NULL NULL 5973
1 PRIMARY <derived13> ALL NULL NULL NULL NULL 3851
1 PRIMARY <derived14> ALL NULL NULL NULL NULL 4935
1 PRIMARY <derived15> ALL NULL NULL NULL NULL 3574
1 PRIMARY <derived16> ALL NULL NULL NULL NULL 5793
1 PRIMARY <derived17> ALL NULL NULL NULL NULL 4706
17 DERIVED f ref year,gender gender 257 364263 Using where; Using temporary; Using filesort
16 DERIVED f ref year,gender gender 257 397322 Using where; Using temporary; Using filesort
15 DERIVED f range year,gender year 4 NULL 54092 Using where; Using temporary; Using filesort
14 DERIVED f range year,gender year 4 NULL 54092 Using where; Using temporary; Using filesort
13 DERIVED f range year,gender year 4 NULL 62494 Using where; Using temporary; Using filesort
12 DERIVED f range year,gender year 4 NULL 62494 Using where; Using temporary; Using filesort
11 DERIVED f range year,gender year 4 NULL 69317 Using where; Using temporary; Using filesort
10 DERIVED f range year,gender year 4 NULL 69317 Using where; Using temporary; Using filesort
9 DERIVED f ref year,gender gender 257 364263 Using where; Using temporary; Using filesort
8 DERIVED f range year,gender year 4 NULL 94949 Using where; Using temporary; Using filesort
7 DERIVED f ref year,gender gender 257 364263 Using where; Using temporary; Using filesort
6 DERIVED f ref year,gender gender 257 397322 Using where; Using temporary; Using filesort
5 DERIVED f ref year,gender gender 257 364263 Using where; Using temporary; Using filesort
4 DERIVED f ref year,gender gender 257 397322 Using where; Using temporary; Using filesort
3 DERIVED f ALL NULL NULL NULL NULL 37045 Using where
2 DERIVED f ALL NULL NULL NULL NULL 37045 Using where
谢谢!
-b
答案 0 :(得分:0)
让我们考虑一下这一点。如果你正在选择一个表的行(只是抓住它们)而不是更新每个...单行......当你经历它们时,需要更长的时间?读n行或修改(更新)n行数?
将它与阅读书籍的10行与在一张纸上书写相同的10行进行比较。哪一个需要更长时间?
我还要补充一点,您阅读和更新的行越多,差异就越大。正如读书与写作书的线条会有所不同,你阅读/写作的线条就越多。
答案 1 :(得分:0)
如果这是您的实际陈述,则不需要第2和第3 LEFT JOIN,因为它们不会更改结果。
顺便说一下,MySQL不知道有效处理“复杂”查询:-) 如果你在临时表中实现SELECT的结果并使用它,那么它应该更快。