我们编写了一个Elasticsearch查询,用于从特定日期范围的索引中获取分组数据。但是,如果我们增加日期范围,则查询大小会随着动态添加的日期范围子句而增加。 动态增加查询样本
"query": {
"bool": {
"filter": [
{
"bool": {
"minimum_should_match": 1,
"must": [
{
"range": {
"startDate": {
"gte": "2018-05-28T21:00:00Z",
"lte": "2021-04-04T20:59:59Z"
}
}
}
],
"should": [
{
"bool": {
"must": [
{
"range": {
"startDate": {
"gte": "2019-12-24T04:30:00Z",
"lte": "2019-12-24T14:00:00Z"
}
}
}
]
}
},
{
"bool": {
"must": [
{
"range": {
"startDate": {
"gte": "2020-11-09T04:30:00Z",
"lte": "2020-11-09T14:00:00Z"
}
}
}
]
}
},
{
"bool": {
"must": [
{
"range": {
"startDate": {
"gte": "2020-07-28T14:00:00Z",
"lte": "2020-07-28T20:59:00Z"
}
}
}
]
}
}
]
}
},
{
"term": {
"tenantId": {
"value": "b29aadd8-b1bb-4754-ab26-b59eebe6d86a"
}
}
},
{
"term": {
"status.keyword": {
"value": "ProductionEnd"
}
}
},
{
"range": {
"startDate": {
"gte": "2018-05-28T21:00:00Z",
"lte": "2021-04-04T20:59:59Z"
}
}
}
]
}},
我们有基于时间的数据,我们想按上面的日期时间过滤它们,但我们想过滤3个月的数据范围,并且会有太多的范围过滤器,并且会出现错误( “ too_many_clauses “ ),因为查询量较大。因此,我们要减少查询子句。我们如何重写查询?
谢谢
答案 0 :(得分:1)
我认为,您的选择之一是将如此大的应当查询拆分为更小的应查询查询。这样,布尔查询不会扩展1024个子句的限制。
bool
|___should
| |___should query with 1024 range queries
| |___should query with 1024 range queries
| |___... range queries
这是我在说什么的简单示例
var ranges = Enumerable.Range(0, 3000).Select((x, i) =>
new QueryContainer(new DateRangeQuery {Name = $"query_{i}", Field = $"date", GreaterThan = "now"}));
var part1 = ranges.Take(1024)
.Aggregate((agg, q) => agg || q);
var part2 = ranges.Skip(1024).Take(1024)
.Aggregate((agg, q) => agg || q);
var searchResponse = await client.SearchAsync<object>(s => s
.Query(q => q.Bool(b => b.Should(part1, part2))));
希望有帮助。