Orientdb将csv导入文档模型

时间:2017-06-29 14:59:17

标签: import model etl orientdb document

我正在尝试使用ETL导入csv文件以在Orientdb中记录Model 我不知道这是不是一个新手,而不是很多关于文档模型的文档,但我尝试的是:

{
  "config": {
    "log": "debug"
  },
  "begin": [],
  "source": {
    "file": {
      "path": "C:/Users/M/Desktop/files/lact.csv"
    }
  },
  "extractor": 
{ "csv": 
      {  "separator": ",", 
         "nullValue": "NULL"
      }
  },
  "transformers": [
    {
      "log": {}
    }
  ],
  "loader": {
    "orientdb": {
      "dbURL": "plocal:../databases/Model_doc",



       "dbType": "document",
      "classes": [
        {
          "name": "Annotations"
        },


      ]
    }
  },
  "end": []
}

我在显示文件内容的解析后得到这个说法: [orientdb] DEBUG orientdb:在类'null'中找到0个文档

Csv文件

"Entry","Entry_name","Status","Protein_names","Gene_names","Organism","Length","Cross_reference(STRING)"
"Q29836","1B67_HUMAN","reviewed","HLA class I histocompatibility antigen, B-67 alpha chain (MHC class I antigen B*67)","HLA-B HLAB","Homo sapiens (Human)","362","9606.ENSP00000399168;"
"P30501","1C02_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-2 alpha chain (MHC class I antigen Cw*2)","HLA-C HLAC","Homo sapiens (Human)","366",""
"P30508","1C12_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-12 alpha chain (MHC class I antigen Cw*12)","HLA-C HLAC","Homo sapiens (Human)","366",""
"Q29960","1C16_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-16 alpha chain (MHC class I antigen Cw*16)","HLA-C HLAC","Homo sapiens (Human)","366",""
"Q29865","1C18_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-18 alpha chain (MHC class I antigen Cw*18)","HLA-C HLAC","Homo sapiens (Human)","366",""

2 个答案:

答案 0 :(得分:1)

您需要为文档指定一个类,在日志

之后将字段转换器添加到链中
"transformers": [
{
  "log": {}
},
{
  "field": {
    "fieldName": "@class",
    "value": "Annotations"
  }
}
],

答案 1 :(得分:1)

我尝试了你的代码,我有同样的信息:

[orientdb] DEBUG orientdb: found 0 documents in class 'null'

但我已经能够导入所有数据,正如您从我的屏幕截图中看到的那样。

enter image description here

要做到这一点,正如@RobertoFranchini所说,你必须加上这个:

 "transformers": [
{
  "log": {}
},
{
  "field": {
    "fieldName": "@class",
    "value": "Annotations"
  }
}
],

我对你的csv文件做了一点改动:

Entry,Entry_name,Status,Protein_names,Gene_names,Organism,Length,Cross_reference(STRING)
Q29836,1B67_HUMAN,reviewed,HLA class I histocompatibility antigen, B-67 alpha chain (MHC class I antigen B*67),HLA-B HLAB,Homo sapiens (Human),362,9606.ENSP00000399168
P30501,1C02_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-2 alpha chain (MHC class I antigen Cw*2),HLA-C HLAC,Homo sapiens (Human),366,
P30508,1C12_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-12 alpha chain (MHC class I antigen Cw*12),HLA-C HLAC,Homo sapiens (Human),366,
Q29960,1C16_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-16 alpha chain (MHC class I antigen Cw*16),HLA-C HLAC,Homo sapiens (Human),366,
Q29865,1C18_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-18 alpha chain (MHC class I antigen Cw*18),HLA-C HLAC,Homo sapiens (Human),366,

并且已导入所有数据。

希望它有所帮助。

问候。