目标:
在第1周中获取ID登录,然后在第2周中登录多少个ID。 重新开始第2周至第3周的逻辑。 然后是第3周和第4周,依此类推...该练习需要每周进行。 需要按队列(即他们订阅的月份和年份)对ID进行细分。
故事:
第一个表(成员)具有电子邮件及其创建日期。第二个表(登录表)是登录活动。首先,我需要按创建日期(月-年)对电子邮件进行分组以创建群组。 然后,每个群组的每周登录活动比较。可能吗 要使该查询每周动态?
输出: 结果应如下所示:
+--------+--------+--------+--------+---------+
| Cohort | 2019-1 | 2019-2 | 2019-3 | 2019-4 |...
+--------+--------+--------+--------+---------+
| 2018-3 | 7000 | 6800 | 7400| 7100 |...
| 2018-4 | 6800 | 6500 | 8400| 8000 |...
| 2018-5 | 9500 | 8000 | 6400| 6200 |...
| 2018-6 | 9100 | 8500 | 8000| 7800 |...
| 2018-7 | 10000 | 8000 | 7000| 6800 |...
+--------+--------+--------+--------+---------+
我尝试过的事情:
SELECT CONCAT(DATEPART(YEAR,m.date_created),'-',DATEPART(MONTH,m.date_created)) AS Cohort
,CONCAT(subquery.[YYYY],'-',subquery.[ISO]) AS YYYY_ISO
,m.email
FROM member as m
INNER JOIN (SELECT DATEPART(YEAR,log.login_time) AS [YYYY]
,DATEPART(ISO_WEEK,log.login_time) AS [ISO]
,log.email
,ROW_NUMBER()
OVER(PARTITION BY
DATEPART(YEAR,log.login_time),
DATEPART(ISO_WEEK,log.login_time),
log.email
ORDER BY log.login_time ASC) AS Log_Rank
FROM login AS log
WHERE CAST(log.login_time AS DATE) >= '2019-01-01'
) AS subquery ON m.email=subquery.email AND Log_Rank = 1
ORDER BY cohort
样本数据:
CREATE TABLE member
([email] varchar(50), [date_created] Datetime)
CREATE TABLE login
([email] varchar(50), [login_time] Datetime)
INSERT INTO member
VALUES
('player123@google.com', '2018-03-01 05:00:00'),
('player999@google.com', '2018-04-12 12:00:00'),
('player555@google.com', '2018-04-25 20:15:00')
INSERT INTO login
VALUES
('player123@google.com', '2019-01-07 05:30:00'),
('player123@google.com', '2019-01-08 08:30:00'),
('player123@google.com', '2019-01-15 06:30:00'),
('player999@google.com', '2019-01-08 11:30:00'),
('player999@google.com', '2019-01-10 07:30:00'),
('player555@google.com', '2019-01-08 04:30:00')