Question

这是我数据框的前10行：

head(test.data,10)
# A tibble: 10 x 5
       date             o2.permeg co2.ppm  apo        o2.spike
      <time>              <dbl>   <dbl>    <dbl>       <chr>
1  2015-01-01 00:00:00   -685.09 413.023 -354.1816        N
2  2015-01-01 00:02:00   -695.10 412.894 -364.8690        N
3  2015-01-01 00:04:00   -687.84 412.979 -357.1627        N
4  2015-01-01 00:06:00   -683.23 412.866 -353.1460        N
5  2015-01-01 00:08:00   -683.28 412.755 -353.7788        N
6  2015-01-01 00:10:00   -685.40 412.647 -356.4659        N
7  2015-01-01 00:12:00   -687.80 412.659 -358.8029        N
8  2015-01-01 00:14:00   -662.79 412.665        NA        Y
9  2015-01-01 00:16:00   -684.17 412.762 -354.6321        N
10 2015-01-01 00:18:00   -680.37 412.720 -351.0526        N

正如您所看到的，最后一列名为o2.spike，其中包含字符N和Y. N表示数据点不是尖峰，Y表示它是尖峰。在此示例中，只有1 Y，但在实际帧中，有负载，并随机放置。

我希望绘制绘图中的所有数据点，标记为Y的那些数据点将以不同的颜色绘制。

为了您的信息，这是我用来绘制所有内容的当前代码。前3个变量以红色，绿色和蓝色绘制，我希望将“Y”行绘制为例如粉红色。

library(openair)
test.data$yr_day <- format(as.Date(test.data$date), "%Y-%m-%d")
dir.create(daily) # where "daily" is the path of the folder I want to save the plots into
for (d in unique(test.data$yr_day)) {
mypath <- file.path(daily, paste(name, d, ".png", sep = "" )) 
png(filename = mypath, width = 963, height = 690) 
timePlot(subset(test.data, yr_day == d), 
       plot.type = "p",
       pollutant = c("co2.ppm", "o2.permeg", "apo"), 
       y.relation = "free",
       date.pad = TRUE,
       pch = c(19,19,19),
       cex = 0.2,
       xlab = paste("Time of day in hours on", d),
       ylab = "CO2, O2, and APO concentrations",
       name.pol = c("CO2 (ppm)", "O2 (per meg)", "APO (per meg)"),
       date.breaks = 24,
       date.format = "%H:%M"
  )
 dev.off()
}

示例图（包含与非尖峰颜色相同颜色的所有尖峰）如下：

那么如何用与众不同的颜色绘制尖峰？非常感谢你！

编辑：正如塞巴斯蒂安所问，我已经添加了这个（不确定你们将如何从中提取数据）

dput(head(test.data,20))

structure(list(date = structure(c(1420070400, 1420070520, 1420070640, 
1420070760, 1420070880, 1420071000, 1420071120, 1420071240, 1420071360, 
1420071480, 1420071600, 1420071720, 1420071840, 1420071960, 1420072080, 
1420072200, 1420072320, 1420072440, 1420072560, 1420072680), class =    c("POSIXct", 
"POSIXt"), tzone = "GMT"), o2.permeg = c(-685.09, -695.1, -687.84, 
-683.23, -683.28, -685.4, -687.8, -662.79, -684.17, -680.37,     
-684.66, -686.13, -683.27, -680.77, -682.16, -692.54, NA, NA, 
NA, NA), co2.ppm = c(413.023, 412.894, 412.979, 412.866, 412.755, 
412.647, 412.659, 412.665, 412.762, 412.72, 412.692, 412.71, 
412.757, 412.838, 412.922, 413.019, NA, NA, NA, NA), apo =   c(-354.181646778043, 
-364.868973747017, -357.162673031026, -353.145990453461, -353.778806682578, 
-356.465871121718, -358.802863961814, NA, -354.632052505966, 
-351.052577565632, -355.489594272076, -356.86508353222, -353.75830548926, 
-350.833007159904, -351.781957040573, -361.652649164678, NA, 
NA, NA, NA), o2.spike = c("N", "N", "N", "N", "N", "N", "N", 
"Y", "N", "N", "N", "N", "N", "N", "N", "N", "N", "N", "N", "N"
)), .Names = c("date", "o2.permeg", "co2.ppm", "apo", "o2.spike"
), row.names = c(NA, -20L), class = c("tbl_df", "tbl", "data.frame"
))

Answer 1

不幸的是，如果没有数据，那么回答这个问题并不容易。 ggplot2解决方案可能是：

g1 <- ggplot(data=test.data, aes(x=date, y=o2.permeg, col=o2.spike)) + geom_point()
g1

将数据帧的一列传递给参数＆＃34; col＆＃34; in＆＃34; aes＆＃34;使您可以使用不同的颜色映射该列中的每个不同值。它甚至可以创建一个图例，其名称与不同的颜色相关联。

我尝试使用另一个数据框（＆＃34; iris＆＃34;，包含在基础R中）并且它有效，希望它会有所帮助。

编辑：

要获得并排图，您可以使用ggplot创建3个图并使用＆＃34; cowplot＆＃34;提供的函数plot_grid（）。封装

library(cowplot)
g1 <- ggplot(data=test.data, aes(x=date, y=o2.permeg, col=o2.spike)) + geom_point()
g2 <- ggplot(data=test.data, aes(x=date, y=co2.ppm, col=o2.spike)) + geom_point()
g3 <- ggplot(data=test.data, aes(x=date, y=apo, col=o2.spike)) + geom_point()
plot_grid(g1, g2, g3, nrow=3, ncol=1)

根据条件绘制不同的颜色

1 个答案: