R Read.table,列中包含多个单词

时间:2016-02-24 09:59:04

标签: r data-mining logfile

我有一个日志文件,这个类型要在R:

中处理
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Start entimICE Application Command Line Parameters ******
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Config-File: E:/Program Files (x86)/conf/storages.dsconfig
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Datasource: datasource
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Application: App
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Ignore : false
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Plugin: com.plug
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Logging: E:/Program Files (x86)/conf/log4j.properties
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** End Application Command Line Parameters ******
2015-11-23 11:51:02,129  INFO               BaseRuntime - Runtime created in mode: RichClient

我试着将它放在带有读取表的数据框中,但它将每个单词放在一列中,我希望有一个包含5列的数据框:

date        time           type  element              text
2015-11-23  11:25::02,082  info  FrameworkAplication  - ****** Start entimICE Application Command Line Parameters ******

问题是我的字段分隔符是一个空格以及我在不同字段中不需要的单词分隔符

是否可以通过read.table或scan,或者我应该自己做什么?

谢谢,

1 个答案:

答案 0 :(得分:0)

@ ma33kael你甚至尝试过重复的解决方案吗? 因为它按预期工作

library(readr)
a <- read_fwf(text, fwf_widths(c(10,13,6,1)))

给你:

          X1           X2   X3                                                                                       X4
1 2015-11-23 11:51:02,082 INFO  FrameworkApplication - ****** Start entimICE Application Command Line Parameters ******
2 2015-11-23 11:51:02,082 INFO FrameworkApplication - ****** Config-File: E:/Program Files (x86)/conf/storages.dsconfig
3 2015-11-23 11:51:02,082 INFO                                     FrameworkApplication - ****** Datasource: datasource
4 2015-11-23 11:51:02,082 INFO                                           FrameworkApplication - ****** Application: App
5 2015-11-23 11:51:02,082 INFO                                             FrameworkApplication - ****** Ignore : false
6 2015-11-23 11:51:02,082 INFO                                           FrameworkApplication - ****** Plugin: com.plug
7 2015-11-23 11:51:02,082 INFO      FrameworkApplication - ****** Logging: E:/Program Files (x86)/conf/log4j.properties
8 2015-11-23 11:51:02,082 INFO             FrameworkApplication - ****** End Application Command Line Parameters ******
9 2015-11-23 11:51:02,129 INFO                                        BaseRuntime - Runtime created in mode: RichClient

数据:

text <- "2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Start entimICE Application Command Line Parameters ******
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Config-File: E:/Program Files (x86)/conf/storages.dsconfig
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Datasource: datasource
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Application: App
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Ignore : false
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Plugin: com.plug
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** Logging: E:/Program Files (x86)/conf/log4j.properties
2015-11-23 11:51:02,082  INFO      FrameworkApplication - ****** End Application Command Line Parameters ******
2015-11-23 11:51:02,129  INFO               BaseRuntime - Runtime created in mode: RichClient"
相关问题