|
ModeShape Distribution 3.0.0.Beta4 | |||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.modeshape.jcr.query.parse.FullTextSearchParser.TermTokenizer
public static class FullTextSearchParser.TermTokenizer
A basic TokenStream.Tokenizer
implementation that ignores whitespace but includes tokens for individual symbols, the period
('.'), single-quoted strings, double-quoted strings, whitespace-delimited words, and optionally comments.
Note this Tokenizer may not be appropriate in many situations, but is provided merely as a convenience for those situations that happen to be able to use it.
Field Summary | |
---|---|
static int |
DOUBLE_QUOTED_STRING
The token type for tokens that consist of all the characters within double-quotes. |
static int |
PLUS_MINUS
The token type for tokens that consist of an individual '+' or '-' characters. |
static int |
SINGLE_QUOTED_STRING
The token type for tokens that consist of all the characters within single-quotes. |
static int |
WORD
The token type for tokens that represent an unquoted string containing a character sequence made
up of non-whitespace and non-symbol characters. |
Constructor Summary | |
---|---|
protected |
FullTextSearchParser.TermTokenizer()
|
Method Summary | |
---|---|
void |
tokenize(TokenStream.CharacterStream input,
TokenStream.Tokens tokens)
Process the supplied characters and construct the appropriate TokenStream.Token objects. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final int WORD
token type
for tokens that represent an unquoted string containing a character sequence made
up of non-whitespace and non-symbol characters.
public static final int PLUS_MINUS
token type
for tokens that consist of an individual '+' or '-' characters. The set of
characters includes: -+
public static final int SINGLE_QUOTED_STRING
token type
for tokens that consist of all the characters within single-quotes. Single quote
characters are included if they are preceded (escaped) by a '\' character.
public static final int DOUBLE_QUOTED_STRING
token type
for tokens that consist of all the characters within double-quotes. Double quote
characters are included if they are preceded (escaped) by a '\' character.
Constructor Detail |
---|
protected FullTextSearchParser.TermTokenizer()
Method Detail |
---|
public void tokenize(TokenStream.CharacterStream input, TokenStream.Tokens tokens) throws ParsingException
TokenStream.Tokenizer
TokenStream.Token
objects.
tokenize
in interface TokenStream.Tokenizer
input
- the character input stream; never nulltokens
- the factory for TokenStream.Token
objects, which records the order in which the tokens are created
ParsingException
- if there is an error while processing the character stream (e.g., a quote is not closed, etc.)
|
ModeShape Distribution 3.0.0.Beta4 | |||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |