Question

我有一个名为＆＃34; rating.csv＆＃34;的数据集。数据集如下所示：

rating: 2.97, 3, 3, 3.41, 3.5, 3.5, 4, NULL, NULL, NULL, 3.67

我正在尝试在read.csv中指定colClasses以加快csv文件的读取速度。首先，我创建像这样的colClasses：

classes <- c("double")

然后我用

读取数据集

data <- read.csv('rating.csv', colClasses=classes,
  comment.char="", na.strings='NULL')

但是这个read.csv

出错了

Error in scan(file, what, nmax, sep, dec, quote, skip, 
 nlines, na.strings,  : scan() expected 'a real', got '"3.67"'

你知道这段代码有什么问题吗？或者是否有另一种方法来改变read.csv中的数据集colclass的类型？

Answer 1

我认为这意味着您的数据是字符串格式而不是数字/双/ csv格式提供的任何选项。