为什么我的同义词没有返回?

时间:2014-07-22 16:09:21

标签: search elasticsearch

我是Elasticsearch的新手,现在我正试图找出为什么我的同义词没有像我期望的那样返回任何结果。

我为我的同义词文件创建了一个自定义过滤器和分析器,并将分析器应用于_all字段并明确定义specialty字段以使用它。

当我在没有分析器/标记器的情况下搜索"specialty": "aids" 时,它会按预期给出零结果。

但是,当我使用分析器/标记器搜索"specialty": "aids" 时,我希望它能为我提供与搜索"speciality": "retrovirology"相同的结果,这会产生3个结果,但它什么都没有回来。

我接近这个有什么不对吗?


以下是我的设置和一些示例数据:

curl -XDELETE "http://localhost:9200/personsearch"

curl -XPUT "http://localhost:9200/personsearch" -d'
{
  "settings": {
    "index": {
      "analysis": {
        "analyzer": {
          "XYZSynAnalyzer": {
            "tokenizer": "standard",
            "filter": [
              "XYZSynFilter"
            ]
          }
        },
        "filter": {
          "XYZSynFilter": {
            "type": "synonym",
            "synonyms": [
              "aids, retrovirology"
            ]
          }
        }
      }
    }
  },
  "mappings": {
    "xyzemployee": {
      "_all": {
        "analyzer": "XYZSynAnalyzer"
      },
      "properties": {
        "firstName": {
          "type": "string"
        },
        "lastName": {
          "type": "string"
        },
        "middleName": {
          "type": "string",
          "include_in_all": false,
          "index": "not_analyzed"
        },
        "specialty": {
          "type": "string",
          "analyzer": "XYZSynAnalyzer"
        }
      }
    }
  }
}'

curl -XPUT "http://localhost:9200/personsearch/xyzemployee/1" -d'
{
  "firstName": "Don",
  "middleName": "W.",
  "lastName": "White",
  "specialty": "Adult Retrovirology"
}'

curl -XPUT "http://localhost:9200/personsearch/xyzemployee/2" -d'
{
  "firstName": "Terrance",
  "middleName": "G.",
  "lastName": "Gartner",
  "specialty": "Retrovirology"
}'

curl -XPUT "http://localhost:9200/personsearch/xyzemployee/3" -d'
{
  "firstName": "Carter",
  "middleName": "L.",
  "lastName": "Taylor",
  "specialty": "Pediatric Retrovirology"
}'

# Why is this returning nothing?
curl -XGET "http://localhost:9200/personsearch/xyzemployee/_search?pretty=true" -d'
{
  "query": {
    "match": {
      "specialty": "retrovirology"
    }
  }
}'

1 个答案:

答案 0 :(得分:1)

你不能在任何地方小写。 试试这个:

{
 "settings": {
   "index": {
     "analysis": {
       "analyzer": {
         "XYZSynAnalyzer": {
           "tokenizer": "standard",
           "filter": [
             "lowercase", "XYZSynFilter"
           ]
         }
       },
       "filter": {
         "XYZSynFilter": {
           "type": "synonym",
           "synonyms": [
             "aids, retrovirology"
           ]
         }
       }
     }
   }
 }

注意:您可能希望拆分索引分析器和搜索分析器,并只选择其中一个来执行同义词。仅在索引期间扩展它们将加快搜索结果。