Elasticsearch按自定义项目权重排序

时间:2019-10-23 12:24:19

标签: javascript elasticsearch

我已经存储了包含status属性的文档。我想按状态优先级(而不是按字母顺序排列的状态)对文档进行排序。我遵循了先前的答案,并组成了以下功能,但仍无法按预期工作;文档按状态名称(按字母顺序)排序:

function getESSortingByStatusQuery(query, order) {
        let statusOrder = ['BLUE', 'RED', 'BLACK', 'YELLOW', 'GREEN'];
        if(order == 'desc'){
            statusOrder.reverse();
        }
        const functions = statusOrder.map((item) => {
            const idx = statusOrder.indexOf(item);
            return {filter: {match: {statusColor: item}},
                weight: (idx + 1) * 50}
        });
        const queryModified = {
            "function_score": {
                "query": {"match_all": {}}, // this is for testing purposes and should be replaced with original query
                "boost": "5",
                "functions": functions,
                "score_mode": "multiply",
                "boost_mode": "replace"
            }
        }
        return queryModified;
    }

如果有人建议根据属性的预定义优先级(在这种情况下为状态)对项目进行排序的方式,我将不胜感激。

2 个答案:

答案 0 :(得分:1)

下面是一个示例custom sort script,我认为这是您正在寻找的示例。我添加了示例映射,文档,查询和响应(如其显示的那样)。

映射:

PUT color_index
{
  "mappings": {
    "properties": {
      "color":{
        "type": "keyword"
      },
      "product":{
        "type": "text"
      }
    }
  }
}

样本文档:

POST color_index/_doc/1
{
  "color": "BLUE",
  "product": "adidas and nike"
}

POST color_index/_doc/2
{
  "color": "GREEN",
  "product": "adidas and nike and puma"
}

POST color_index/_doc/3
{
  "color": "GREEN",
  "product": "adidas and nike"
}

POST color_index/_doc/4
{
  "color": "RED",
  "product": "nike"
}

POST color_index/_doc/5
{
  "color": "RED",
  "product": "adidas and nike"
}

查询:

POST color_index/_search
{
  "query": {
    "bool": {
      "must": [
        {
          "query_string": {
            "default_field": "*",
            "query": "adidas OR nike"
          }
        }
      ]
    }
  },
  "sort": [
    { "_score": { "order": "desc"} },          <---- First sort by score
    { "_script": {                             <---- Second sort by Colors
            "type": "number",
            "script": {
                "lang": "painless",
                "source": "if(params.scores.containsKey(doc['color'].value)) { return params.scores[doc['color'].value];} return 100000;",
                "params": {
                    "scores": {
                        "BLUE": 0,
                        "RED": 1,
                        "BLACK": 2,
                        "YELLOW": 3,
                        "GREEN": 4
                    }
                }
            },
            "order": "asc"
        }

    }
  ]
}

首先它将返回按分数排序的文档,然后将第二种排序逻辑应用于该结果。

对于第二种排序,即使用脚本排序,请注意我如何将数字值添加到scores部分的颜色中。您需要相应地构造查询。

工作原理的逻辑在source部分中,我认为这是不言而喻的,我在其中使用了 doc ['color']。value ,因为这是我的专长我正在应用自定义排序逻辑。

响应:

{
  "took" : 5,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 5,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 0.5159407,
        "_source" : {
          "color" : "BLUE",
          "product" : "adidas and nike"
        },
        "sort" : [
          0.5159407,                     <--- This value is score(desc by nature)
          0.0                            <--- This value comes from script sort as its BLUE and I've used value 0 in the script which is in 'asc' order
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "5",
        "_score" : 0.5159407,
        "_source" : {
          "color" : "RED",
          "product" : "adidas and nike"
        },
        "sort" : [
          0.5159407,
          1.0
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "3",
        "_score" : 0.5159407,
        "_source" : {
          "color" : "GREEN",
          "product" : "adidas and nike"
        },
        "sort" : [
          0.5159407,
          4.0
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "2",
        "_score" : 0.40538198,
        "_source" : {
          "color" : "GREEN",
          "product" : "adidas and nike and puma"
        },
        "sort" : [
          0.40538198,
          4.0
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "4",
        "_score" : 0.10189847,
        "_source" : {
          "color" : "RED",
          "product" : "nike"
        },
        "sort" : [
          0.10189847,
          1.0
        ]
      }
    ]
  }
}

请注意前三个文档,它的确切值是product但有不同的color,您可以看到它们在我们首先按_score进行排序然后按以下方式进行排序时被分组在一起color

让我知道这是否有帮助!

答案 1 :(得分:1)

Here's the code sample of sorting result. I think this will helps you. If you don't want to get entire documents as result you can filter results using includes. 

GET testindex/_search
{
  "_source": {
"includes": [
  "filed1"
]
},
  "aggs": {
    "emp_figures": {
      "terms": {
        "field": "status"
      }
    }
}
}

This is the sample result you should retrieve 
{
  "took": 11,
  "timed_out": false,
  "_shards": {
    "total": 2,
    "successful": 2,
    "failed": 0
  },
  "hits": {
    "total": 84968,
    "max_score": 1,
    "hits": [
      {
        "_index": "test",
        "_type": "type",
        "_id": "0001",
        "_score": 1,
        "_source": {
          "filed1": "color1,
          }
        },
         {
        "_index": "test",
        "_type": "type",
        "_id": "0002",
        "_score": 1,
        "_source": {
          "filed1": "color2,
          }
        }
      }
    }
}