订单条款按地理距离汇总

时间:2016-11-16 23:50:04

标签: elasticsearch chewy-gem

所以我在这里有一个问题......

我正在使用chewy ruby​​ gem与Elasticsearch进行通信

=> #<Chewy::SnippetPagesIndex::Query:0x007f911c6b1610
 @_collection=nil,
 @_fully_qualified_named_aggs={"chewy::snippetpagesindex"=>{"chewy::snippetpagesindex::snippetpage"=>{}}},
 @_indexes=[Chewy::SnippetPagesIndex],
 @_named_aggs={},
 @_request=nil,
 @_response=nil,
 @_results=nil,
 @_types=[],
 @criteria=
  #<Chewy::Query::Criteria:0x007f911c6b1458
   @aggregations=
    {:group_by=>{:terms=>{:field=>"seo_area.suburb.id", :order=>{:_count=>"asc"}}, :aggs=>{:by_top_hit=>{:top_hits=>{:size=>10}}}}},
   @facets={},
   @fields=[],
   @filters=
    [{:geo_distance=>{:distance=>"100km", "seo_area.suburb.coordinates"=>"-27.9836052, 153.3977354"}},
     {:bool=>
       {:must_not=>[{:terms=>{:id=>[1]}}, {:terms=>{"seo_area.suburb.id"=>[5559]}}],
        :must=>[{:term=>{:path_category=>"garden-services"}}, {:term=>{:status=>"active"}}, {:exists=>{:field=>"path_area"}}],
        :should=>[]}}],
   @options=
    {:query_mode=>:must,
     :filter_mode=>:and,
     :post_filter_mode=>:and,
     :preload=>
      {:scope=>
        #<Proc:0x007f911c6b1700@/Users/serviceseeking/Work/serviceseeking/engines/seo/app/concepts/seo/snippet_page/twins/search.rb:45 (lambda)>},
     :loaded_objects=>true},
   @post_filters=[],
   @queries=[],
   @request_options={},
   @scores=[],
   @script_fields={},
   @search_options={},
   @sort=[{:_geo_distance=>{"seo_area.suburb.coordinates"=>"-27.9836052, 153.3977354", :order=>"asc", :unit=>"km"}}],
   @suggest={},
   @types=[]>,
 @options={}>

我正在使用Elasticsearch聚合,因此在访问聚合时,查询/搜索阶段的任何排序都将消失。

我一直在传递的是......

     aggs: {
        by_seo_area_suburb_id: {
          terms: {
            field: "seo_area.suburb.id",
            size: 10,
            order: { by_distance: "desc" }
          },
          aggs: {
            by_top_hit: {
              top_hits: { size: 10 }
            },
            by_distance: {
              geo_distance: {
                field: "seo_area.suburb.coordinates",
                origin: "52.3760, 4.894",
                ranges: [
                  { from: 0, to: 1 },
                  { from: 1, to: 2 }
                ]
              }
            }
          }
        }
      }

虽然我收到了这个错误...

[500] {"error":{"root_cause":[{"type":"aggregation_execution_exception","reason":"Invalid terms aggregation order path [by_distance]. Terms buckets can only be sorted on a sub-aggregator path that is built out of zero or more single-bucket aggregations within the path and a final single-bucket or a metrics aggregation at the path end."}],"type":"search_phase_execution_exception","reason":"all shards failed","phase":"query","grouped":true,"failed_shards":[{"shard":0,"index":"snippet_pages","node":"srrlBssmSEGsqpZnPnOJmA","reason":{"type":"aggregation_execution_exception","reason":"Invalid terms aggregation order path [by_distance]. Terms buckets can only be sorted on a sub-aggregator path that is built out of zero or more single-bucket aggregations within the path and a final single-bucket or a metrics aggregation at the path end."}}]},"status":500}

简单地说......

术语存储区只能在子聚合器路径上排序,该路径由路径中的零个或多个单桶聚合构成,最后一个桶或路径末端的度量聚合。

有什么想法吗?

1 个答案:

答案 0 :(得分:0)

你有这样的Buckets:

1-2

2-3

4-5

等等。这些都不是具有自然顺序的单值桶。这是异常告诉你的。因此,您需要将某些内容融合为单个值。

即使您可以按此订购。你为什么这样?距离介于1和2之间的所有距离都具有相同的值,并且它们的排序将是未定义的。如果它足以让你知道哪些是0-1和1-2等等,只需转动聚合顺序。首先考虑距离并对术语进行分解。

总而言之,我认为您有一个用例,其中聚合不是您想要的,因为请考虑以下两个文档:

{ name: "peter", location: [0,0] }
{ name: "peter", location: [100,0] }
显然,两个彼得都会在术语聚合中融为一体。但它们有两个不同的位置,因此距离(几乎)总是不同的。那你怎么能按距离订购彼得?一旦你聚合一个字段,所有其他字段或多或少都会与它分离,你不能使用其他字段。

因此。如果您想要这样的东西,您很可能必须通过正常搜索。看一下如何按距离对搜索进行排序:

https://www.elastic.co/guide/en/elasticsearch/guide/current/sorting-by-distance.html