Question

我正在尝试将大型数据集（更多150列，csv文件）导入到RStudio，但是我仍然无法为所有列手动选择“数字”，因为它们似乎是以“double”或“double”导入的字符“默认模式。这需要大量文件的大量时间。这是代码：

> Test_example <- read_csv("~/Desktop/Test_example.csv", 
+     col_types = cols(c_perc = col_number(), 
+         n_perc = col_number(), treatment = col_integer()), 
+     na = "0")

我尝试过使用“stringsAsFactors”命令，但它不起作用。另外，如果我在导入时没有为列添加格式，我会收到错误：

"Parsed with column specification:
cols(
  .default = col_integer()
)
See spec(...) for full column specifications."

Answer 1

检查有关功能的帮助。如果您指定AWSCredentials cred = new BasicAWSCredentials(<accessKey>,<secretKey>); AmazonS3 s3client = AmazonS3ClientBuilder.standard() .withCredentials(new AWSStaticCredentialsProvider(cred)) .withClientConfiguration(<your configuration>) .withRegion(Region.getRegion(Regions.AP_SOUTH_1));，则需要所有150列的规范或使用col_types 你也可以使用像“cid”这样的紧凑字符串来表示字符，整数，双精度您也可以尝试让函数猜测。

readr :: read_csv问题：数字变为双/字符

1 个答案: