如何阅读" .TAB"文件

时间:2018-05-13 09:30:22

标签: r web-scraping download

我正在尝试通过R找到从Harvard Dataverse网站检索数据的方法。我正在使用" dataverse"和" dvn"包等等。许多数据文件以" .tab"结尾,尽管它们没有格式化为普通的制表符分隔文本。

我做到了:

library(dataverse)   

## 01. Using the dataverse server and making a search
Sys.setenv("DATAVERSE_SERVER" ="dataverse.harvard.edu")

## 02. Loading the dataset that I chose, by url
doi_url <- "https://doi.org/10.7910/DVN/ZTCWYQ"
my_dataset <- get_dataset(doi_url)

## 03. Grabbing the first file of the dataset
## which is named "001_AppendixC.tab"
my_files <- my_dataset$files$label
my_file <- get_file(my_files[1], doi_url)
AppendixC <- tempfile()
writeBin(my_file, AppendixC)

read.table(AppendixC)
> Error in scan(file = file, what = what, sep = sep, quote = quote, dec = dec,  : 
> line 1 did not have 2 elements
> In addition: Warning message:
> In read.table(AppendixC) :
> line 1 appears to contain embedded nulls

任何提示?

0 个答案:

没有答案