对象[seq_len(ile)]中的错误:“符号”类型的对象不是可子集化的

时间:2017-03-06 21:18:10

标签: r knitr

我正在尝试使用render语句将rmd文件转换为pdf。

render("MiningReport.Rmd", "pdf_document",output_dir = "C:/ProjectSocial/Reports/Twitter/Maybelline")

我收到如下错误

Quitting from lines 109-113 (MiningReport.Rmd) 
Error in object[seq_len(ile)] : 
  object of type 'symbol' is not subsettable

这对我来说非常奇怪,因为当我编写Rmd文件时,没有这样的错误并且pdf报告生成成功,当我尝试使用render语句执行相同操作时,它会给出错误。任何人都可以解释一下发生了什么?下面是错误蔓延的代码块

```{r assoc ,echo=F,message=FALSE}
library(tm)
findAssocs(myTdm,df$term[1:10],0.5)
```

当我删除上面的块时,下一个代码块会出现同样的错误。下面是我的Rmd文件。我正在阅读存储在指定目录中的文件中的推文。

```{r computedate,echo = FALSE}
date1 <-format(Sys.Date() - 7,"%B %d")
date2 <-format(Sys.Date() - 1,"%B %d, %Y")

```

# This report has been created on twitter data from `r date1` to `r date2`.
# Analysis of Tweets
## Below we can see the most frequent words.

```{r frequent,echo=FALSE,message=FALSE,warning=FALSE,cache=TRUE}

setwd("C:/ProjectSocial/Data/TwitterData/Maybelline")

library(devtools)
library(twitteR)
library(tm)
library(ggplot2)
library(graph)
library(Rgraphviz)
library(wordcloud)
library(topicmodels)
library(data.table)
library(fpc)
library(igraph)
library(xlsx)
library(stringr)

tweets.df<-data.frame(text=character(),favorited=character(),favoriteCount=numeric(),replyToSN=character(),
                created=as.POSIXct(character()),truncated=character(),replyToSID=character(),id=character(),replyToUID=character(),statusSource=character(),screenName=character(),retweetCount=numeric(),
isRetweet=character(),retweeted=character(),longitude=character(),latitude=character(),stringsAsFactors =F)
i<-1
while(i<=7){
  since<-Sys.Date()-i
  file<-read.xlsx2(file=paste("Maybelline",since,".xlsx",sep=""), 1,colClasses = c(rep("character",2),
  "numeric","character","POSIXct",rep("character",6),"numeric",rep("character",4)),   stringsAsFactors=F)

  tweets.df<-rbind(tweets.df,file)
  i<-i+1
}

j<-1
HashTagsList<-c()
HashTags<-str_extract_all(tweets.df$text,"#\\S+")
HashTags<-HashTags[!HashTags %in% c("character(0)")]

while (j<=length(HashTags)){

  HashTagsList<-c(HashTagsList,HashTags[[j]])
  j<-j+1
}
HashTagsList<- gsub("#", "", HashTagsList) 
HashTagsList<-unique(HashTagsList)
HashTagsList<-gsub("[^[:alnum:] ]", "", HashTagsList)

k<-1
HandleTagsList<-c()
HandleTags<-str_extract_all(tweets.df$text,"@\\S+")
HandleTags<-HandleTags[!HandleTags %in% c("character(0)")]
while (k<=length(HandleTags)){

  HandleTagsList<-c(HandleTagsList,HandleTags[[k]])
  k<-k+1
}

HandleTagsList<- gsub("@", "", HandleTagsList) 
HandleTagsList<-unique(HandleTagsList)
HandleTagsList<-gsub("[^[:alnum:] ]", "", HandleTagsList)

tweets.df$text<-gsub("#\\S+", "", tweets.df$text)
tweets.df$text<-gsub("@\\S+", "", tweets.df$text)

Tweets.df<-subset(tweets.df,isRetweet=="FALSE")
Tweets.df$text<-gsub("[^[:alpha:] ]", " ", Tweets.df$text)
Tweets.df$text<-tolower(Tweets.df$text)

myCorpus <-Corpus(VectorSource(Tweets.df$text))
myStopwords<-c(stopwords("english"),"maybelline","https","like","bring","make","thought","please","maybe",
               "know","just","want","wearing","really","last","better","best","first")
myCorpus<-tm_map(myCorpus,removeWords,myStopwords)
myCorpus<-tm_map(myCorpus,removeWords,HashTagsList)
myCorpus<-tm_map(myCorpus,removeWords,HandleTagsList)

myCorpus <- tm_map(myCorpus, PlainTextDocument)
myTdm<-TermDocumentMatrix(myCorpus,control=list(wordLengths=c(4,13)))
freq.Terms<- findFreqTerms(myTdm,lowfreq=20)
termFrequency <- rowSums(as.matrix(myTdm))
termFrequency <- subset(termFrequency, termFrequency>=20)
df <- data.frame(term=names(termFrequency), freq=termFrequency,stringsAsFactors = F)
df <- df[order(-df$freq),] 
rownames(df) <- NULL
print(head(df,50), row.names = FALSE)
df<-head(df,40)
ggplot(df,aes(x=term,y=freq)) + geom_bar(stat="identity") + xlab("Terms") +ylab("Count") +coord_flip()

```

## Below we can find all the words which are associated with the top 10 most frequent words and having correlation > 0.5.

```{r assoc ,echo=F,message=FALSE}

library(tm)
findAssocs(myTdm,df$term[1:10],0.5)

```

任何帮助表示赞赏 感谢

1 个答案:

答案 0 :(得分:22)

错误即将发生,因为我使用了echo = F而不是echo = FALSE。 F或T被视为符号,因此会产生问题。 这就是为什么F(或T)是一个符号(参见?is.symbol来了解符号是什么):

> str(alist(warning = F)) 
List of 1 $ warning: symbol F > str(alist(warning = FALSE)) List of 1 $ warning: logi FALSE