标签: apache-spark apache-spark-sql
答案 0 :(得分:0)
这里有两件事:
通常,Spark使用sort作为orderBy-What is the difference between sort and orderBy functions in Spark
sort
orderBy
配置单元具有SORT BY子句which sorts data locally per partition-在Spark中这种操作称为sortWithinPartitions。
SORT BY
sortWithinPartitions