Question

我已经开始学习ElasticSearch的过程，我想知道是否有人可以通过提供一些如何构建几个查询的示例来帮助我简化过程。

这是我的示例架构......

PUT /sales/_mapping
{   
   "sale": {
      "properties": {
         "productCode: {"type":"string"},
         "productTitle": {"type": "string"},
         "quantity" : {"type": "integer"},
         "unitPrice" : {"type": double}
       }
   }
}

POST /sales/1
{"productCode": "A", "productTitle": "Widget", "quantity" : 5, "unitPrice":
5.50}
POST /sales/2
{"productCode": "B", "productTitle": "Gizmo", "quantity" : 10, "unitPrice": 1.10}
POST /sales/3
{"productCode": "C", "productTitle": "Spanner", "quantity" : 5, "unitPrice": 
9.00}
POST /sales/4
{"productCode": "A", "productTitle": "Widget", "quantity" : 15, "unitPrice": 
5.40}
POST /sales/5
{"productCode": "B", "productTitle": "Gizmo", "quantity" : 20, "unitPrice": 
1.00}
POST /sales/6
{"productCode": "B", "productTitle": "Gizmo", "quantity" : 30, "unitPrice": 
0.90}
POST /sales/7
{"productCode": "B", "productTitle": "Gizmo", "quantity" : 40, "unitPrice": 
0.80}
POST /sales/8
{"productCode": "C", "productTitle": "Spanner", "quantity" : 100, 
"unitPrice": 7.50}
POST /sales/9
{"productCode": "C", "productTitle": "Spanner", "quantity" : 200, 
"unitPrice": 5.50}

我需要什么查询来生成以下结果？

A）。显示按产品代码分组的文档数

Product code  Title    Count
A             Widget   2
B             Gizmo    4
C             Spanner  3

B）。按产品代码显示销售的总单位，即

Product code  Title    Total units sold
A             Widget   20
B             Gizmo    100
C             Spanner  305

TIA

Answer 1

您可以使用aggregations完成此操作，尤其是Terms Aggregations。它可以在一次运行中完成，方法是将它们包含在查询结构中;为了指示ES根据聚合生成分析数据，您需要包含aggregations对象（或aggs），并在其中指定您希望ES在您的数据上运行的聚合类型

{
  "query": {
    "match_all": {}
  },
  "aggs": {
    "group_by_product": {
      "terms": {
        "field": "productCode"
      },
      "aggs": {
        "units_sold": {
          "sum": {
            "field": "quantity"
          }
        }
      }
    }
  }
}

通过运行该查询，除了搜索产生的命中之外（在这种情况下，我们正在进行全部匹配），并且将在响应对象中包含其他对象，并保留相应的结果聚合。例如

{
  ...
  "hits": {
    "total": 6,
    "max_score": 1,
    "hits": [ ... ]
  },
  "aggregations": {
    "group_by_product": {
      "doc_count_error_upper_bound": 0,
      "sum_other_doc_count": 0,
      "buckets": [
        {
          "key": "b",
          "doc_count": 3,
          "units_sold": {
            "value": 60
          }
        },
        {
          "key": "a",
          "doc_count": 2,
          "units_sold": {
            "value": 20
          }
        },
        {
          "key": "c",
          "doc_count": 1,
          "units_sold": {
            "value": 5
          }
        }
      ]
    }
  }
}

为简洁起见，我省略了响应对象的一些细节，并突出显示了aggregations对象中的重要部分。您可以看到聚合数据如何由不同的存储桶组成，每个存储桶代表在文档中找到的不同产品类型（由key键标识），doc_count包含每种产品类型的出现次数，和unit_sold对象，保留每种产品类型的总销售量总和。

要考虑的一件重要事情是，要在字符串或文字字段上执行聚合，您需要启用 fielddata 在字段映射中设置，因为默认情况下在所有基于文本的字段上禁用该设置。为了更新映射，例如。在产品代码字段中，您只需要对索引中相应的映射类型的PUT请求，例如

PUT http://localhost:9200/sales/sale/_mapping
{
  "properties": {
    "productCode": {
      "type": "string",
      "fielddata": true
    }
  }
}

（关于fielddata设置的更多info）

弹性查询组

1 个答案: