从dataframe中的单个列获取文本数据

时间:2018-06-12 15:34:24

标签: r readline quanteda

我想只读取我的数据框的一个特定列,即第3列C,并创建一个文字云。让df=

A B C
1 2 sheep
2 2 sheep
3 4 goat
4 5 camel
5 2 camel
6 1 camel

我尝试从readLines(df$C)读取线条但是我收到以下错误:

 Error in readLines(df$C) : 
  'con' is not a connection

1 个答案:

答案 0 :(得分:2)

df <- read.table(textConnection("A B C
1 2 sheep
2 2 sheep
3 4 goat
4 5 camel
5 2 camel
6 1 camel"), header = TRUE, stringsAsFactors = FALSE)

library("quanteda")
## Package version: 1.3.0

corpus(df, text_field = "C") %>%
    dfm() %>%
    textplot_wordcloud(min_count = 1)

enter image description here