Question

我对R来说相对较新，甚至对Shiny来说更新（字面意思是第一天）。

我希望用户输入由逗号分隔的多个短语，例如female, aged, diabetes mellitus.我有一个数据框，其中一个变量MH2包含文本字。我想输出一个数据帧，其中只包含所有输入短语都存在的行。有时用户可能只输入一个短语，其他时间为5.

这是我的ui.R

library(shiny)
library(stringr)

# load dataset
load(file = "./data/all_cardiovascular_case_reports.Rdata")

ui <- fluidPage(
  sidebarLayout(
    sidebarPanel(
      textInput(inputId = "phrases", 
                label = "Please enter all the MeSH terms that you would like to search, each separated by a comma:",
                value = ""),

      helpText("Example: female, aged, diabetes mellitus")

    ),
    mainPanel(DT::dataTableOutput("dataframe"))
  )
)

这是我的服务器.R

library(shiny)

server <- function(input, output)
{
  # where all the code will go
    df <- reactive({

      # counts how many phrases there are
      num_phrases <- str_count(input$phrases, pattern = ", ") + 1

      a <- numeric(num_phrases) # initialize vector to hold all phrases

      # create vector of all entered phrases
      for (i in 1:num_phrases)
      {
        a[i] <- noquote(strsplit(input$phrases, ", ")[[i]][1])
      }

      # make all phrases lowercase
      a <- tolower(a)

      # do exact case match so that each phrase is bound by "\\b"
      a <- paste0("\\b", a, sep = "")
      exact <- "\\b"
      a <- paste0(a, exact, sep = "")

      # subset dataframe over and over again until all phrases used
      for (i in 1:num_phrases)
      {
        final <- final[grepl(pattern = a, x = final$MH2, ignore.case = TRUE), ]
      }

      return(final)
    })

    output$dataframe <- DT::renderDataTable({df()})
}

当我尝试运行renderText({num_phrases})时，即使我输入用逗号分隔的多个短语，我也始终得到1。从那时起，每当我尝试输入多个短语时，我都会遇到错误：下标超出范围。＆＃34;但是，当我输入用逗号分隔的单词而不是逗号和空格（输入＆＃34;女性，年龄＆＃34;而不是＆＃34;女性，年龄＆＃34;）然后该问题消失，但我的数据框没有正确的子集。它只能是一个短语的子集。

请告知。

感谢。

Answer 1

我认为你的Shiny逻辑看起来不错，但是对数据帧进行子集化的功能有一些小问题。特别是：

a[i] <- noquote(strsplit(input$phrases, ", ")[[i]][1])

指数[[i]]和1位于错误的位置，应为[[1]][i]

final <- final[grepl(pattern = a, x = final$MH2, ignore.case = TRUE), ]

你不能像这样匹配多个模式，只会使用a的第一个元素，这也是R给出的警告。

示例工作代码

我已将input$phrases更改为inp_phrases。如果此脚本执行您想要的操作，我认为您可以轻松地将其复制到您的反应中，进行必要的更改（即更改inp_phrases并添加return(result)语句。）。我还不完全清楚你是否希望所有模式在一行内匹配，或者返回所有模式匹配的所有行，所以我添加了它们，你可以取消注释你需要的那些：

library(stringr)

# some example data
inp_phrases = "ab, cd"
final = data.frame(index = c(1,2,3,4),MH2 = c("ab cd ef","ab ef","cd ef ab","ef gx"),stringsAsFactors = F)

# this could become just two lines:
a <- sapply(strsplit(inp_phrases, ", ")[[1]],  function(x) tolower(noquote(x)))
a <- paste0("\\b", a, "\\b") 

# Two options here, uncomment the one you need.
# Top one: match any pattern in a. Bottom: match all patterns in a
# indices = grepl(pattern = paste(a,collapse="|"), x = final$MH2, ignore.case = TRUE)
indices = colSums(do.call(rbind,lapply(a, function(x) grepl(pattern = x, x = final$MH2, ignore.case = TRUE))))==length(a)

result <- final[indices,]

返回：

  index      MH2
1     1 ab cd ef
3     3 cd ef ab

...使用第二版索引（全部匹配）或

  index      MH2
1     1 ab cd ef
2     2    ab ef
3     3 cd ef ab

...使用索引的第一个版本（匹配任何）

希望这有帮助！

操纵R Shiny中的textInput

1 个答案: