格式化数据框行而不是列

时间:2020-03-24 15:49:36

标签: r

我有一个趋势数据表。

ga_sessions_combined <-
structure(list(Metric = structure(1:7, .Label = c("Users", "Engaged Users", 
"Transactions", "Revenue", "ConversionRate", "Bounce Rate", "$/User"
), class = "factor"), ym_201904 = c(157664, 79295, 5764, 609172.887628, 
0.0365587578648265, 0.497063375279075, 3.86374116873858), ym_201905 = c(199340, 
103879, 5744, 673063.435872, 0.0288150897963279, 0.478885321561152, 
3.3764594956958), ym_201906 = c(169971, 90557, 4899, 566247.290325, 
0.0288225638491272, 0.467220878855805, 3.33143471724588), ym_201907 = c(161346, 
88059, 4223, 580408.759911, 0.0261735648854016, 0.454222602357666, 
3.5972925260682), ym_201908 = c(132702, 70701, 3106, 424807.71545, 
0.0234058265888984, 0.467219785685219, 3.20121562184443), ym_201909 = c(164160, 
96124, 3841, 724958.93068, 0.0233979044834308, 0.414449317738791, 
4.41617282334308), ym_201910 = c(217227, 118041, 4448, 798116.2282, 
0.0204762759693777, 0.456600698808159, 3.67411154322437), ym_201911 = c(970864, 
604606, 27713, 4859788.602792, 0.0285446777303515, 0.37724954267539, 
5.00563271765355), ym_201912 = c(1180689, 671162, 59536, 9447240.17602, 
0.0504247943361884, 0.431550560731912, 8.00146370129645), ym_202001 = c(216816, 
109637, 5057, 738079.024166, 0.0233239244336211, 0.494331599143975, 
3.40417231277212), ym_202002 = c(204113, 145975, 4847, 720506.474953, 
0.0237466501398735, 0.284832421256853, 3.52993917561841), ym_202003 = c(324266, 
229438, 8341, 1196234.593648, 0.0257227091338592, 0.292438923599761, 
3.68905341185323)), class = c("tbl_df", "tbl", "data.frame"), row.names = c(NA, 
-7L), .Names = c("Metric", "ym_201904", "ym_201905", "ym_201906", 
"ym_201907", "ym_201908", "ym_201909", "ym_201910", "ym_201911", 
"ym_201912", "ym_202001", "ym_202002", "ym_202003"))

当我在一个闪亮的应用程序中运行此数据框时,它看起来像这样:

enter image description here

我想基于“ Metric”列来设置表格格式,这几乎与dplyr动词类似,但列名是第一行。

对于前三行(用户,参与的用户和交易),我想使用scales::comma_format()进行格式化,以使用逗号表示成千上万的数字,例如1,000。

对于收入和“ $ /用户”行,我想使用scales::dollar_format()

进行格式化

对于“转化率”和“跳出率”行,我想将其格式化为scales::percent_format()

我该怎么做?

1 个答案:

答案 0 :(得分:1)

也许不是您要寻找的答案,但转置数据框更容易。这是一种tidyr方法:

library(tidyr)
ga_sessions_combined %>% 
  gather(key = period, value = value, 2:ncol(ga_sessions_combined)) %>% 
  spread(key = names(ga_sessions_combined)[1], value = "value")

编辑:

如果您想将其保留为宽格式,我认为这可行,但是所有内容都将转换为字符:

ga_sessions_combined %>% 
  gather(key = period, value = value, 2:ncol(ga_sessions_combined)) %>% 
  spread(key = names(ga_sessions_combined)[1], value = "value") %>% 
  mutate_at(vars(matches("Users|Engaged Users|Transactions")), funs(prettyNum(., big.mark=","))) %>% 
  mutate_at(vars(matches("Rate")), funs(scales::percent(., accuracy = 0.01))) %>% 
  mutate_at(vars(contains("$/User"), contains("Revenue")), funs(scales::dollar(.))) %>% t()

如果可以接受长格式,则只需将t()放在最后。