R tm包。在哪里可以找到TermDocumentMatrix组件的详细描述?我,j,v

时间:2013-04-12 14:28:12

标签: r tm

作为一个例子,这是一个tdm:

str(AssociatedPress)
List of 6
$ i       : int [1:302031] 1 1 1 1 1 1 1 1 1 1 ...

$ j       : int [1:302031] 116 153 218 272 299 302 447 455 548 597 ...
$ v       : int [1:302031] 1 2 1 1 1 1 2 1 1 1 ...
$ nrow    : int 2246
$ ncol    : int 10473
$ dimnames:List of 2
..$ Docs : NULL
..$ Terms: chr [1:10473] "aaron" "abandon" "abandoned" "abandoning" ...
- attr(*, "Weighting")= chr [1:2] "term frequency" "tf"
- attr(*, "class")= chr [1:2] "DocumentTermMatrix" "simple_triplet_matrix"

我一直试图找到这些列的描述$ i,$ j,$ v ... 非常感谢,

1 个答案:

答案 0 :(得分:3)

看看这个:http://www.inside-r.org/packages/cran/slam/docs/as.simple_triplet_matrix

?TermDocumentMatrix

我们看到:

Value

An object of class TermDocumentMatrix or class DocumentTermMatrix
(both inheriting from a simple triplet matrix in package slam)
containing a sparse term-document matrix or document-term matrix. The
attribute Weighting contains the weighting applied to the matrix.

当您点击声明中的链接时,都会继承simple triplet matrix

Arguments

i, j    
Integer vectors of row and column indices, respectively.

v   
Vector of values.

和...

Details
simple_triplet_matrix is a generator for a class of
“lightweight” sparse matrices, “simply” represented by triplets (i,
j, v) of row indices i, column indices j, and values v, respectively.
simple_triplet_zero_matrix and simple_triplet_diag_matrix are
convenience functions for the creation of empty and diagonal
matrices.