Datastax 5.1图形加载器 - 从多个子目录加载统一文件

时间:2017-05-03 00:53:26

标签: groovy datastax datastax-enterprise datastax-enterprise-graph

图形加载器可以从显式定义的目录加载文件,但目前没有内置的方法来自动递归加载来自多个子目录的统一文件。

1 个答案:

答案 0 :(得分:1)

我找不到任何从多个子目录加载统一文件的示例,所以在搞清楚之后,我认为将来发布此文件以帮助其他人是有帮助的。有没有人有一种更加时髦的方式?

//configure graphloader
config dryrun: false, load_vertex_threads: 2, load_edge_threads: 3,
read_threads: 1, preparation: true, create_schema: false,
abort_on_prep_errors: true

import java.io.File as javaFile; //this must be aliased so as to not conflict with graphloader's File.directory()

inputBaseDir = /path/to/base/dir
//base directory has many subdirectories that have many uniform files to load

//create a list of the subdirectory paths
def list = []
new javaFile(inputBaseDir).eachDir() { dir ->
    list << dir.getAbsolutePath()
}

//loop through the list of subdirectory paths

for (item in list){
    def fileBuilder = File.directory(item)
    def theData = fileBuilder.map{
        it["specificDataLabel"] = it["data"]["specificData"][0];
        it["otherSpecificDataLabel"] = it["data"]["otherSpecificData"][0];
        it.remove("data")
        it
    }

    load(theData).asVertices {
        label "theLabel"
        key "specificDataLabel"
        vertexProperty "otherSpecificDataLabel",{
            value "metaPropertyLabel"
            value "otherMetaPropertyLabel"
        }
    }