到目前为止我做了什么

Question

我正在尝试使用elasticsearch_dsl python库设置ElasticSearch。我已经可以设置索引，并且可以使用.filter()方法进行搜索，但是我无法使用.suggest方法。

我正在尝试使用completion映射类型和suggest查询方法，因为该方法将用于自动完成字段（在Elastic的文档中推荐）。

我是弹性的新手，所以我想我缺少了一些东西。任何指导将不胜感激！

到目前为止我做了什么

我没有找到完全符合我想要的教程，但是我通读了ElasticSearch.com和elasticsearch_dsl上的文档，并看了一些示例 here和here

PS：我在Heroku上使用Searchbox Elasticsearch

索引/映射设置：

# imports [...]

edge_ngram_analyzer = analyzer(
    'edge_ngram_analyzer',
    type='custom',
    tokenizer='standard',
    filter=[
        'lowercase',
        token_filter(
            'edge_ngram_filter', type='edgeNGram',
            min_gram=1, max_gram=20
        )
    ]
)

class DocumentIndex(ElasticDocument):
    title = Text()
    title_suggest = Completion(
        analyzer=edge_ngram_analyzer,
        )
    class Index:
        name = 'documents-index'

# [...] Initialize index
# [...] Upload Documents (5,000 documents)
# DocumentIndex.init()
# [DocumentIndex(**doc).save() for doc in mydocs]

映射输出：

这是Web控制台中显示的映射：

 {
  "documents-index": {
    "mappings": {
      "doc": {
        "properties": {
          "title": {
            "type": "text"
          },
          "title_suggest": {
            "type": "completion",
            "analyzer": "edge_ngram_analyzer",
            "search_analyzer": "standard",
            "preserve_separators": true,
            "preserve_position_increments": true,
            "max_input_length": 50
          }
        }
      }
    }
  }
}

尝试搜索

验证索引存在：

>>> search = Search(index='documents-index')
>>> search.count()  # Returns correct amount of documents
5000
>>> [doc for doc in search.scan()][:3]
>>> [<Hit(documents-index/doc/1): ...} ...

测试搜索-作品：

>>> query = search.filter('match', title='class')
>>> query.execute()
>>> result.hits 
<Response: [<Hit(documents-in [ ... ]
>>> len(result.hits)
10
>>> query.to_dict()  # see query payload
{ 
  "query":{
    "bool":{
      "filter":[
        {
          "fuzzy":{
            "title":"class"
          }
        }
      ]
    }
  }
}

失败的部分

我无法使用任何.suggest()方法。注意： *我正在关注官方library docs

测试建议：

>>> query = search.suggest(
        'title-suggestions',
        'class',
        completion={
        'field': 'title_suggest',
        'fuzzy': True
        })
>>> query.execute()
<Response: {}>
>>> query.to_dict() # see query payload
{
  "suggest": {
    "title-suggestions": {
      "text": "class",
      "completion": { "field": "title_suggest" }
    }
  }
}

我也尝试了下面的代码，显然还有许多不同类型的查询和值，但结果却相似。（请注意，.filter()我总是会得到预期的结果。）

>>> query = search.suggest(
        'title-suggestions',
        'class',
         term=dict(field='title'))
>>> query.to_dict() # see query payload
{
  "suggest": {
    "title-suggestions": { 
        "text": "class", 
        "term": { 
            "field": "title" 
        } 
    }
  }
}
>>> query.execute()
<Response: {}>

更新

根据Honza的建议，我将title_suggest映射更新为仅Completion，没有自定义分析器。我还删除了索引并从头开始重新建立索引

class DocumentIndex(ElasticDocument):
    title = Text()
    title_suggest = Completion()
    class Index:
        name = 'documents-index'

不幸的是，问题仍然存在。这里还有更多测试：

验证`title_suggest`是否已正确编入索引

>>> search = Search(index='documents-index)
>>> search.index('documents-index').count()
23369
>>> [d for d in search.scan()][0].title
'AnalyticalGrid Property'
>>> [d for d in search.scan()][0].title_suggest
'AnalyticalGrid Property'

再次尝试搜索：

>>> len(search.filter('term', title='class').execute().hits)
10
>>> search.filter('term', title_suggest='Class').execute().hits
[]
>>> search.suggest('suggestions', 'class', completion={'field': 
'title_suggest'}).execute().hits
[]

验证映射：

>>> pprint(index.get_mapping())
{
  "documents-index": {
    "mappings": {
      "doc": {
        "properties": {
          "title": { "type": "text" },
          "title_suggest": {
            "analyzer": "simple",
            "max_input_length": 50,
            "preserve_position_increments": True,
            "preserve_separators": True,
            "type": "completion"
          }
        }
      }
    }
  }
}

Answer 1

对于完成字段，您不想使用ngram分析器。 completion字段将自动为所有前缀编制索引并针对前缀查询进行优化，因此您要进行两次工作并使系统混乱。从空的completion字段开始，然后从那里开始。

Answer 2

我想形式化Honza在其中一个评论中提供的解决方案，以得到另一个答案。

问题不在于映射，而仅仅是由 .suggest()下没有返回hits方法。

建议现在可以在返回的字典中看到：

>>> response = query.execute()
>>> print(response)
<Response: {}>
>>> response.to_dict()
# output is
# {'query': {},
# 'suggest': {'title-suggestions': {'completion': {'field': 'title_suggest'},
# [...]

我还发现了有关此github issue的其他详细信息：

HonzaKral在27天前评论了

Response对象提供对具有以下内容的任何字段的访问：   由elasticsearch返回。为了方便起见，有一个捷径   可以迭代点击，这是最常见的   也很容易做到。 对于响应的其他部分，例如汇总或   建议，您需要像这样明确访问它们   response.suggest.foo.options。

弹性搜索建议返回零结果

到目前为止我做了什么

索引/映射设置：

映射输出：

尝试搜索

失败的部分

更新

验证`title_suggest`是否已正确编入索引

再次尝试搜索：

验证映射：

2 个答案:

弹性搜索建议返回零结果

到目前为止我做了什么

索引/映射设置：

映射输出：

尝试搜索

失败的部分

更新

验证title_suggest是否已正确编入索引

再次尝试搜索：

验证映射：

2 个答案:

验证`title_suggest`是否已正确编入索引