Class AnalyzerNames
-
Field Summary
Fields -
Method Summary
-
Field Details
-
DEFAULT
The default analyzer.This analyzer will generally be used for full-text field that don't require specific analysis.
Unless overridden by explicit analysis configuration, the default analyzer will be the standard analyzer:
- First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
- Then, lowercase each token.
- See Also:
-
STANDARD
The standard analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
- Then, lowercase each token.
- See Also:
-
SIMPLE
The simple analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming non-letter characters are separators.
- Then, lowercase each token.
- See Also:
-
WHITESPACE
The whitespace analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming whitespace characters are separators.
- Do not change the tokens.
- See Also:
-
STOP
The stop analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming non-letter characters are separators.
- Then, lowercase each token.
- finally, remove english stop words.
- See Also:
-
KEYWORD
The keyword analyzer.Unless overridden by explicit analysis configuration, this analyzer does not change the text in any way.
With this analyzer, a full text field would behave similarly to a keyword field, but with fewer features: no terms aggregations, for example.
Consider using a keyword field instead.
- See Also:
-