Hue没有找到具体的单词

时间:2016-09-01 14:38:08

标签: solr lucene cloudera hue

我已经使用cloudera CDH 5.0和Solr 4.4版索引文件一段时间了,我得到了一些奇怪的选择结果。例如,对于特定单词"odi",它找不到任何匹配项,但它确实存在于我正在编制索引的文件中,但是当我使用*odi or odi*时它会找到它。我在那个词之前和之后检查了特殊字符,它的两边都有空格。

有没有人经历过类似的事情?如果是这样你做了什么来解决它?

以下是架构默认配置:

 <fieldType name="text_en" class="solr.TextField" positionIncrementGap="100">
  <analyzer type="index">
   <tokenizer class="solr.StandardTokenizerFactory"/>
   <!-- in this example, we will only use synonyms at query time
   <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
   -->
   <!-- Case insensitive stop word removal. -->
   <filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_en.txt" />
   <filter class="solr.LowerCaseFilterFactory"/>
   <filter class="solr.EnglishPossessiveFilterFactory"/>
   <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/>
   <!-- Optionally you may want to use this less aggressive stemmer instead of PorterStemFilterFactory:
   <filter class="solr.EnglishMinimalStemFilterFactory"/>
   -->
   <filter class="solr.PorterStemFilterFactory"/>
  </analyzer>
  <analyzer type="query">
   <tokenizer class="solr.StandardTokenizerFactory"/>
   <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
   <filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_en.txt" />
   <filter class="solr.LowerCaseFilterFactory"/>
   <filter class="solr.EnglishPossessiveFilterFactory"/>
   <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/>
   <!-- Optionally you may want to use this less aggressive stemmer instead of PorterStemFilterFactory:
   <filter class="solr.EnglishMinimalStemFilterFactory"/>
   -->
   <filter class="solr.PorterStemFilterFactory"/>
  </analyzer>
 </fieldType>

0 个答案:

没有答案
相关问题