Question

a）我试图从R中的cbc.read.table包中读取一个功能为colbycol的相对较大的.txt文件。根据我读过这个包的内容当我们有大文件（在R中读取超过1 GB）时我们会更容易工作，并且我们不需要所有的列/变量来进行分析。另外，我读到函数cbc.read.table可以支持相同的read.table参数。但是，如果我传递参数nrows（为了在R中预览我的文件），我会收到以下错误：

#My line code. I'm just reading columns 5,6,7,8 out of 27
i.can <- cbc.read.table( "xxx.txt", header = T, sep = "\t",just.read=5:8, nrows=20)
#error message
Error in read.table(file, nrows = 50, sep = sep, header = header, ...) : 
formal argument "nrows" matched by multiple actual arguments

所以，我的问题是：你能告诉我怎样才能解决这个问题？

b）之后，我尝试使用以下代码读取所有实例：

i.can.b <- cbc.read.table( "xxx.txt", header = T, sep = "\t",just.read=4:8) #done perfectly
my.df <- as.data.frame(i.can.b) #getting error in this line
Error in readSingleKey(con, map, key) : unable to obtain value for key 'Company' #Company is a string column in my data set

所以，我的问题又是：我该如何解决这个问题？

c）您是否知道在阅读文件时我可以过滤（按实例条件）的方式？

Answer 1

如果你想要预览，那么只需使用nrow = 20的read.table并设置colClasses参数来读取你的列。

尝试在读取文件时使用过滤器在R +中读取带有cbc.read.table函数的文件的一些问题

1 个答案: