数据透视表的有效方式

时间:2019-12-13 13:30:32

标签: sql hive hiveql

我有一个名为monthly_agg的表,其中包含每月的汇总数据。

+------------+-----+----------+-----------+---------------+--------------+-------------+----------+---------+
| yyyy_mm_dd | id  | app      | ex_status | active_status | active_count | active_base | ex_count | ex_base |
+------------+-----+----------+-----------+---------------+--------------+-------------+----------+---------+
| 2019-01-31 | 123 | content  | impl      | impl          | 390          | 321         | 344      | 340     |
+------------+-----+----------+-----------+---------------+--------------+-------------+----------+---------+
| 2019-01-31 | 333 | messages | impl      | impl          | 541          | 210         | 788      | 610     |
+------------+-----+----------+-----------+---------------+--------------+-------------+----------+---------+
| 2019-01-31 | 832 | photos   | no        | no            | null         | 430         | null     | 100     |
+------------+-----+----------+-----------+---------------+--------------+-------------+----------+---------+

我想让每个应用程序都成为一列。每个应用程序列应包含一个百分比,其计算公式如下:

SELECT 
    yyyy_mm_dd,
    id,
   App,
    SUM(CASE
        WHEN (app = ‘content’ AND ex_status = ‘impl’) THEN ex_count/ex_base
        WHEN (active_status = 'impl') THEN active_count/active_base
    END) AS percentage
FROM 
    monthly_agg

我需要使每个app值成为一列,然后将该列的值作为上述计算的结果。我如何以这种方式旋转桌子?理想情况下,我的输出应如下所示:

+------------+-----+--------------------+---------------------+
| yyyy_mm_dd | id  | content_percentage | messages_percentage |
+------------+-----+--------------------+---------------------+
| 2019-01-31 | 123 | 1.2                | null                |
+------------+-----+--------------------+---------------------+
| 2019-01-31 | 333 | null               | 2.57                |
+------------+-----+--------------------+---------------------+

我大约有20个应用程序,所以动态性会很棒。

1 个答案:

答案 0 :(得分:0)

IIUC您可以尝试:

SELECT 
    yyyy_mm_dd,
    id,
    SUM(CASE WHEN (app = 'content' AND ex_status = 'impl') THEN ex_count/ex_base 
WHEN (app = 'content' and active_status = 'impl') THEN active_count/active_base ELSE 0 END) AS content_percentage,
    SUM(CASE WHEN (app = 'messages' and active_status = 'impl') THEN active_count/active_base ELSE 0 END) AS messages_percentage
FROM 
    monthly_agg
GROUP BY
    yyyy_mm_dd, id