从SPELL数据创建序列对象

时间:2013-04-19 16:35:41

标签: r traminer

我正在尝试使用SPELL格式创建一个seqdef的序列对象。以下是我的数据示例:

spell <- structure(list(ID = c(1, 3, 3, 4, 5, 5, 6, 8, 9, 10, 11, 11, 
12, 13, 13, 13, 13, 14, 14, 14, 14, 14, 14, 14, 14, 14, 15, 15, 
15, 15, 15, 15, 15, 16, 16, 16, 16, 17, 17, 17, 18, 18, 18, 19, 
19), status = c(1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 
1, 1, 2, 3, 1, 2, 3, 2, 3, 1, 1, 1, 3, 1, 3, 3, 1, 3, 1, 1, 1, 
1, 1, 3, 3, 1, 3, 1, 1, 1), time1 = c(1, 1, 57, 1, 1, 91, 1, 
1, 1, 1, 1, 104, 1, 1, 60, 109, 121, 1, 42, 47, 54, 64, 72, 78, 
85, 116, 1, 29, 39, 69, 74, 78, 88, 1, 16, 40, 68, 1, 30, 123, 
1, 39, 51, 1, 61), time2 = c(125, 57, 125, 125, 91, 125, 125, 
125, 125, 125, 104, 125, 125, 60, 109, 121, 125, 42, 47, 54, 
64, 72, 78, 85, 116, 125, 29, 39, 69, 74, 78, 88, 125, 16, 40, 
68, 125, 30, 123, 125, 39, 51, 125, 61, 125)), .Names = c("ID", 
"status", "time1", "time2"), row.names = c(NA, 45L), class = "data.frame")

当我尝试定义序列对象时,会抛出一个奇怪的错误:

spell.seq <- seqdef(data=spell, informat="SPELL", id="ID", begin="time1", end="time2", 
                    status="status", limit=125,process=FALSE)

 [>] time axis: 1 -> 125
 [>] SPELL data converted into 17 STS sequences
 [>] 3 distinct states appear in the data: 
     1 = 1
     2 = 2
     3 = 3
 [>] state coding:
       [alphabet]  [label]  [long label] 
     1  1           1        1
     2  2           2        2
     3  3           3        3
 [>] 17 sequences in the data set
 [>] min/max sequence length: 125/125
Error in `row.names<-.data.frame`(`*tmp*`, value = value) : 
  invalid 'row.names' length

但是,如果我通过seqformat间接地执行相同的操作,保留相同的参数,则不会抛出任何错误:

sts <- seqformat(data=spell,from="SPELL",to="STS",
                 id="ID",begin="time1",end="time2",status="status",
                 limit=125,process=FALSE)

seqs <- seqdef(sts,right="DEL")

将TramineR 1.8-5与R 3.0.0 Windows 7 64位一起使用。这是一个错误还是我做错了什么?提前致谢。

1 个答案:

答案 0 :(得分:4)

快速查看seqdef()的来源row.names如何设置显示它们是根据id参数的值设置的。

?seqdef中查看id节目

  

ID
  用于设置序列对象的rownames的可选参数。 如果为NULL(默认值),则从输入数据中获取rownames。如果设置为“auto”,则序列的编号从1到序列数。也可以指定长度等于序列数的rownames向量。

从问题中的示例中,您传递的id="ID"不符合这些条件。将此更改为id=NULL可以使命令按预期完成,并使用identical( spell.seq, seqs)检查相等性会产生true