Class AnalyzerNames
- java.lang.Object
-
- org.hibernate.search.engine.backend.analysis.AnalyzerNames
-
public final class AnalyzerNames extends Object
Constants for the names of built-in analyzers.
-
-
Field Summary
Fields Modifier and Type Field Description static String
DEFAULT
The default analyzer.static String
KEYWORD
The keyword analyzer.static String
SIMPLE
The simple analyzer.static String
STANDARD
The standard analyzer.static String
STOP
The stop analyzer.static String
WHITESPACE
The whitespace analyzer.
-
-
-
Field Detail
-
DEFAULT
public static final String DEFAULT
The default analyzer.This analyzer will generally be used for full-text field that don't require specific analysis.
Unless overridden by explicit analysis configuration, the default analyzer will be the standard analyzer:
- First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
- Then, lowercase each token.
- See Also:
- Constant Field Values
-
STANDARD
public static final String STANDARD
The standard analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
- Then, lowercase each token.
- See Also:
- Constant Field Values
-
SIMPLE
public static final String SIMPLE
The simple analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming non-letter characters are separators.
- Then, lowercase each token.
- See Also:
- Constant Field Values
-
WHITESPACE
public static final String WHITESPACE
The whitespace analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming whitespace characters are separators.
- Do not change the tokens.
- See Also:
- Constant Field Values
-
STOP
public static final String STOP
The stop analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming non-letter characters are separators.
- Then, lowercase each token.
- finally, remove english stop words.
- See Also:
- Constant Field Values
-
KEYWORD
public static final String KEYWORD
The keyword analyzer.Unless overridden by explicit analysis configuration, this analyzer does not change the text in any way.
With this analyzer, a full text field would behave similarly to a keyword field, but with fewer features: no terms aggregations, for example.
Consider using a keyword field instead.
- See Also:
- Constant Field Values
-
-