如何使用BigQuery模拟数据透视表?

时间:2013-10-16 21:53:23

标签: google-bigquery

我需要在列中组织查询结果,就像它是一个数据透视表一样。我怎么能这样做?

2 个答案:

答案 0 :(得分:10)

使用条件语句将查询结果组织成行和列。在下面的示例中,搜索大多数以“Google”值开头的维基百科文章的结果将被组织到列中,如果它们符合各种条件,则会显示修订计数。

SELECT
  page_title,
  /* Populate these columns as True or False, depending on the condition */
  IF(page_title CONTAINS 'search', INTEGER(total), 0) AS search,
  IF(page_title CONTAINS 'Earth' OR page_title CONTAINS 'Maps', INTEGER(total), 0) AS geo,
FROM
  /* Subselect to return top revised Wikipedia articles containing 'Google'
   * followed by additional text.
   */
  (SELECT
    TOP(title, 5) as page_title,
    COUNT(*) as total
   FROM
     [publicdata:samples.wikipedia]
   WHERE
     REGEXP_MATCH (title, r'^Google.+') AND wp_namespace = 0
  );

结果:

+---------------+--------+------+
|  page_title   | search | geo  |
+---------------+--------+------+
| Google search |   4261 |    0 |
| Google Earth  |      0 | 3874 |
| Google Chrome |      0 |    0 |
| Google Maps   |      0 | 2617 |
| Google bomb   |      0 |    0 |
+---------------+--------+------+

一个类似的例子,不使用子查询:

SELECT SensorType, DATE(DTimestamp), AVG(data) avg, 
FROM [data-sensing-lab:io_sensor_data.moscone_io13]
WHERE DATE(DTimestamp) IN ('2013-05-16', '2013-05-17')
GROUP BY 1, 2
ORDER BY 2, 3 DESC;

生成3列表:传感器类型,日期和平均数据。要“转动”并将日期作为列:

SELECT
  SensorType,
  AVG(IF(DATE(DTimestamp) = '2013-05-16', data, null)) d16,
  AVG(IF(DATE(DTimestamp) = '2013-05-17', data, null)) d17
FROM [data-sensing-lab:io_sensor_data.moscone_io13]
GROUP BY 1
ORDER BY 2 DESC;

答案 1 :(得分:0)

相同的方法/结果,但使用BigQuery Standard SQL:

-- top revised Wikipedia articles containing 'Google'
WITH articles AS (
  SELECT title AS page_title,
         COUNT(*) AS total
    FROM `publicdata.samples.wikipedia`
   WHERE REGEXP_CONTAINS(title, r'^Google.+') AND wp_namespace = 0
   GROUP BY title
   ORDER BY total DESC
   LIMIT 5
)

SELECT page_title,
       -- Populate these columns as True or False, depending on the condition
       IF(page_title LIKE '%search%', total, 0) AS search,
       IF(page_title LIKE '%Earth%' OR page_title LIKE '%Maps%', total, 0) AS geo
  FROM articles
;