Class AnalyzerNames


  • public final class AnalyzerNames
    extends Object
    Constants for the names of built-in analyzers.
    • Field Detail

      • DEFAULT

        public static final String DEFAULT
        The default analyzer.

        This analyzer will generally be used for full-text field that don't require specific analysis.

        Unless overridden by explicit analysis configuration, the default analyzer will be the standard analyzer:

        • First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
        • Then, lowercase each token.
        See Also:
        Constant Field Values
      • STANDARD

        public static final String STANDARD
        The standard analyzer.

        Unless overridden by explicit analysis configuration, this analyzer behaves as follows:

        • First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
        • Then, lowercase each token.
        See Also:
        Constant Field Values
      • SIMPLE

        public static final String SIMPLE
        The simple analyzer.

        Unless overridden by explicit analysis configuration, this analyzer behaves as follows:

        • First, tokenize by assuming non-letter characters are separators.
        • Then, lowercase each token.
        See Also:
        Constant Field Values
      • WHITESPACE

        public static final String WHITESPACE
        The whitespace analyzer.

        Unless overridden by explicit analysis configuration, this analyzer behaves as follows:

        • First, tokenize by assuming whitespace characters are separators.
        • Do not change the tokens.
        See Also:
        Constant Field Values
      • STOP

        public static final String STOP
        The stop analyzer.

        Unless overridden by explicit analysis configuration, this analyzer behaves as follows:

        • First, tokenize by assuming non-letter characters are separators.
        • Then, lowercase each token.
        • finally, remove english stop words.
        See Also:
        Constant Field Values
      • KEYWORD

        public static final String KEYWORD
        The keyword analyzer.

        Unless overridden by explicit analysis configuration, this analyzer does not change the text in any way.

        With this analyzer, a full text field would behave similarly to a keyword field, but with fewer features: no terms aggregations, for example.

        Consider using a keyword field instead.

        See Also:
        Constant Field Values