Question

我正在尝试进行逻辑回归，并且达到了每次观察的可能性。现在我想在给定阈值的情况下将概率分类为0或1

例如，如果我有两个数字0.65和0.87且我的阈值是0.7，我想将0.65变为0和0.87变为1.

为了达到这个目的，我尝试了以下代码，我认为这对于这么简单的任务来说太过分了，我想知道是否有任何专用于执行此操作的函数。

library(tidyverse)

# create a table of probabilities and predictions (0 or 1)
df <- tibble(
  prob = runif(20),
  pred = round(prob) # threshold = 0.5
)

# threshold function for length = 1
threshold_1 <- function(p,t) {
  if (p > t) 1 else 0
}

# threshold function for length = p
threshold_p <- function(ps, t) {
  map2_dbl(ps, t, threshold_1)
}

# below works.
df %>% mutate(
  pred = threshold_p(df$prob, 0.7)
)

我也试过这个

# threshold = 0.7
df %>%
  mutate(
  pred = round(prob - 0.2) # threshold = 0.7
)

上面的工作非常好，因为没有概率可能正好是0或1（只要我们处理分布函数），所以即使我+/- 0.5的数字（改变阈值），他们将从来没有圆到-1或2.但它只是它不是很优雅。

我想知道是否有任何功能以更简单的方式执行此操作？

Answer 1

听起来像ifelse可能会做你想要的吗？

library(dplyr)
df %>% 
  mutate(pred = ifelse(prob < 0.7, 0, 1))

R中的阈值处的舍入数字

1 个答案: