Question

我有以下学生表：

Student_Name Student_RollNo Student_Marks1 Student_Marks2
Andrew       10             90.9           9.2
Blake        2              78.2           100
Fisher       1              69.2           89
Alex         1              98.8           82.1

我的查询：

select * 
from Student
order by (Student_Marks1+Student_Marks2) limit 1;

即。我想检索具有最高分的学生的学生详细信息。这里最高分是一个函数，它是Student_Marks1和Student_Marks2的总和。现在我很想知道postgresql（或mysql）使用哪种算法来执行此排名？

即postgresql首先应用ORDER BY然后LIMIT;或者它首先应用LIMIT，然后是ORDER BY;或者它是否使用将ORDER BY和LIMIT一起应用的算法

Answer 1

PostgreSQL使用默认的B树运算符类来确定ORDER BY表达式中的排序顺序（ASC作为默认值使用＆lt;首先排序较小，DESC使用＆gt;来变大）。

对于有关多个属性的评论问题，它会根据您对排序字段的输入顺序使用ORDER BY。

Answer 2

通常算法非常简单：
1.从Student表中删除行 2.使用ORDER BY表达式从＃1对整个结果集进行排序 3.应用LIMIT子句（+ offset）以从＃2

获得的有序结果集中获取一部分行您可以在此处阅读有关LIMIT的更多信息：http://www.postgresql.org/docs/9.4/static/queries-limit.html

在某些情况下，在ORDER BY操作（排序）期间会考虑LIMIT以加快查询速度，尤其是在某些索引可用于消除排序操作时。

您可以查看这项工作如何查看解释计划假设在此表上创建了一个索引：

create index student_mark1 on student( student_marks1 );

此查询提供以下解释计划：

select * From student
order by student_marks2 
limit 1;

Limit  (cost=5.06..5.06 rows=1 width=178) (actual time=0.088..0.089 rows=1 loops=1)
  Output: student_name, student_rollno, student_marks1, student_marks2
  ->  Sort  (cost=5.06..5.57 rows=204 width=178) (actual time=0.088..0.088 rows=1 loops=1)
        Output: student_name, student_rollno, student_marks1, student_marks2
        Sort Key: student.student_marks2
        Sort Method: top-N heapsort  Memory: 25kB
        ->  Seq Scan on public.student  (cost=0.00..4.04 rows=204 width=178) (actual time=0.007..0.021 rows=204 loops=1)
              Output: student_name, student_rollno, student_marks1, student_marks2

您需要从下到上阅读此计划第一个操作是Seq scan - 这意味着从磁盘读取所有行（整个表 - 请参阅actual rows = 204）。
然后排序操作被pefrormed（ORDER BY）最后一个操作是LIMIT 1（在计划的顶部）

将上述计划与此查询进行比较：

select * From student
order by student_marks1 
limit 1;

Limit  (cost=0.14..0.24 rows=1 width=178) (actual time=0.010..0.010 rows=1 loops=1)
  Output: student_name, student_rollno, student_marks1, student_marks2
  ->  Index Scan using student_mark1 on public.student  (cost=0.14..19.20 rows=204 width=178) (actual time=0.009..0.009 rows=1 loops=1)
        Output: student_name, student_rollno, student_marks1, student_marks2

这里跳过了排序阶段，因为我们可以使用索引以所需的顺序（ORDER BY student_marks1 => INDEX ON Student( student_marks1 )）检索行。
请注意最底层操作中的Actual rows = 1：`索引扫描＆＃39;。
这意味着，PostgreSQL不扫描整个索引，但是从索引中只检索1（第一）行，因为它知道，而不是查询具有LIMIT 1子句。（人们有时会说PostgreSQL＆＃34;将＃1;限制1子句限制为索引扫描操作并使用它来减少索引中的扫描条目数量。

有关使用索引加速ORDER BY的更多信息，请访问：http://www.postgresql.org/docs/8.3/static/indexes-ordering.html

如果您的问题中的查询，ORDER BY clasue包含表达式Student_Marks1+Student_Marks2，而不是简单的列。此查询的解释计划如下所示：

select *
From student
order by student_marks1 + student_marks2 
limit 2;

Limit  (cost=7.10..7.11 rows=2 width=178) (actual time=0.207..0.207 rows=2 loops=1)
  Output: student_name, student_rollno, student_marks1, student_marks2, (((student_marks1)::numeric + student_marks2))
  ->  Sort  (cost=7.10..7.61 rows=204 width=178) (actual time=0.205..0.205 rows=2 loops=1)
        Output: student_name, student_rollno, student_marks1, student_marks2, (((student_marks1)::numeric + student_marks2))
        Sort Key: (((student.student_marks1)::numeric + student.student_marks2))
        Sort Method: top-N heapsort  Memory: 25kB
        ->  Seq Scan on public.student  (cost=0.00..5.06 rows=204 width=178) (actual time=0.019..0.107 rows=204 loops=1)
              Output: student_name, student_rollno, student_marks1, student_marks2, ((student_marks1)::numeric + student_marks2)

但您仍然可以通过以下方式加快此查询创建function based index：

create index student_mark12 on student( ( student_marks1 + student_marks2) );

创建索引后，我们现在有：

Limit  (cost=0.14..0.34 rows=2 width=178) (actual time=0.044..0.047 rows=2 loops=1)
  Output: student_name, student_rollno, student_marks1, student_marks2, (((student_marks1)::numeric + student_marks2))
  ->  Index Scan using student_mark12 on public.student  (cost=0.14..20.22 rows=204 width=178) (actual time=0.043..0.046 rows=2 loops=1)
        Output: student_name, student_rollno, student_marks1, student_marks2, ((student_marks1)::numeric + student_marks2)

请注意，Postgre在这种情况下使用索引，并根据LIMIT 2子句从中检索仅2个条目（实际行= 2）。

postgresql选择最高值

2 个答案: