Question

我跑了ElasticSearch 1.x（幸福地）超过一年。现在是时候进行一些升级 - 到2.1.x.应该关闭节点，然后再次（逐个）打开。看起来很容易。
但后来我遇到了麻烦。主要的问题是我自己创建的字段_uid，以便我从随机的其他文件（通过散列值）中知道文档的确切位置。这样我知道只返回确切的一个。升级期间我得到了

MapperParsingException[Field [_uid] is a metadata field and cannot be added inside a document. Use the index API request parameters.]

但是当我尝试将我的前_uid映射到_id（这也应该足够好）时，我会得到类似的东西。

我使用_uid参数的原因是因为查找时间比使用termsQuery（或类似函数）低很多。
如何在每个文档中使用_uid或_id字段来快速（和准确）查找某些确切的文档？请注意，我当时必须调用数千个精确的，所以我需要一个像查询一样的ID。此外，文档的_uid或_id可能不存在（在这种情况下，我想像现在一样，假设＆＃39;结果）

注意：从1.x升级到2.x非常大（过滤器消失，名称中没有点，没有_xxx的默认访问权限）

更新（无效）：
使用：

更新_uid或_id的映射

final XContentBuilder mappingBuilder = XContentFactory.jsonBuilder().startObject().startObject(type).startObject("_id").field("enabled", "true").field("default", "xxxx").endObject()
            .endObject().endObject();
 CLIENT.admin().indices().prepareCreate(index).addMapping(type, mappingBuilder)
                .setSettings(Settings.settingsBuilder().put("number_of_shards", nShards).put("number_of_replicas", nReplicas)).execute().actionGet();

结果：

MapperParsingException[Failed to parse mapping [XXXX]: _id is not configurable]; nested: MapperParsingException[_id is not configurable];

更新：将名称更改为_id而不是_uid，因为后者是由_type＃_id构建的。因此，我需要能够写入_id。

Answer 1

由于似乎无法设置_uid和_id我发布解决方案。我将所有具有_uid的文档映射到uid（用于内部引用）。在某些时候它来找我，你可以设置相关的id

要使用id批量插入文档，您可以：

final BulkRequestBuilder builder = client.prepareBulk();
for (final Doc doc : docs) {
    builder.add(client.prepareIndex(index, type, doc.getId()).setSource(doc.toJson()));
}
final BulkResponse bulkResponse = builder.execute().actionGet();

注意第三个参数，这个参数可能是null（或者是一个双值参数，然后ES将生成id）。
然后通过id获取一些文件，你可以：

final List<String> uids = getUidsFromSomeMethod(); // ids for documents to get
final MultiGetRequestBuilder builder = CLIENT.prepareMultiGet();
builder.add(index_name, type, uids);
final MultiGetResponse multiResponse = builder.execute().actionGet();
// in this case I simply want to know whether the doc exists
if (only_want_to_know_whether_it_exists){
    for (final MultiGetItemResponse response : multiResponse.getResponses()) {
        final boolean exists = response.getResponse().isExists();
        exist.add(exists);
    }
} else {
    // retrieve the doc as json
    final String string = builder.getSourceAsString();
    // handle JSON
}

如果你只想要1：

client.prepareGet().setIndex(index).setType(type).setId(id);

执行 - 使用curl的单次更新是mapping-id-field（注意：完全复制）：

# Example documents
PUT my_index/my_type/1
{
  "text": "Document with ID 1"
}

PUT my_index/my_type/2
{
  "text": "Document with ID 2"
}

GET my_index/_search
{
  "query": {
    "terms": {
      "_id": [ "1", "2" ] 
    }
  },
  "script_fields": {
    "UID": {
      "script": "doc['_id']" 
    }
  }
}

Elasticsearch 2.x索引映射_id

1 个答案: