2015-01-16 1 views
0

구문 검색에 포함 emdash : 나는 단지가 반환되도록 구성해야하는 방법은 두 문서를 반환 "—emdash"를 쿼리하고SOLR - 나는 텍스트 두 문서 색인을 예를 들어

doc1 - Test—emdash 

doc2 - Test without emdash 

을, 정확한 문구가있는 문서?

인덱스에 사용되는 필드 타입 textgen

<fieldType name="textgen" class="solr.TextField" positionIncrementGap="100"> 
    <analyzer type="index"> 
    <tokenizer class="solr.WhitespaceTokenizerFactory"/> 
    <!-- <filter class="solr.ASCIIFoldingFilterFactory"/> --> 
    <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" /> 
    <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="0"/> 
    <filter class="solr.LowerCaseFilterFactory"/> 
    </analyzer> 
    <analyzer type="query"> 
    <tokenizer class="solr.WhitespaceTokenizerFactory"/> 
    <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> 
    <filter class="solr.StopFilterFactory" 
      ignoreCase="true" 
      words="stopwords.txt" 
      enablePositionIncrements="true" 
      /> 
    <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" splitOnCaseChange="0"/> 
    <filter class="solr.LowerCaseFilterFactory"/> 
    </analyzer> 
</fieldType> 

답변

0

1)) WordDelimiter 필터 2를 제거 매핑 숯 필터를 추가 할 것이다 : "-"= ""- "