|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.modeshape.graph.query.parse.FullTextSearchParser.TermTokenizer
public static class FullTextSearchParser.TermTokenizer

A basic TokenStream.Tokenizer implementation that ignores whitespace but includes tokens for individual symbols, the period
('.'), single-quoted strings, double-quoted strings, whitespace-delimited words, and optionally comments.
Note this Tokenizer may not be appropriate in many situations, but is provided merely as a convenience for those situations that happen to be able to use it.
| Field Summary | |
|---|---|
static int |
DOUBLE_QUOTED_STRING
The token type for tokens that consist of all the characters within double-quotes. |
static int |
PLUS_MINUS
The token type for tokens that consist of an individual '+' or '-' characters. |
static int |
SINGLE_QUOTED_STRING
The token type for tokens that consist of all the characters within single-quotes. |
static int |
WORD
The token type for tokens that represent an unquoted string containing a character sequence made
up of non-whitespace and non-symbol characters. |
| Constructor Summary | |
|---|---|
protected |
FullTextSearchParser.TermTokenizer()
|
| Method Summary | |
|---|---|
void |
tokenize(TokenStream.CharacterStream input,
TokenStream.Tokens tokens)
Process the supplied characters and construct the appropriate TokenStream.Token objects. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final int WORD
token type for tokens that represent an unquoted string containing a character sequence made
up of non-whitespace and non-symbol characters.
public static final int PLUS_MINUS
token type for tokens that consist of an individual '+' or '-' characters. The set of
characters includes: -+
public static final int SINGLE_QUOTED_STRING
token type for tokens that consist of all the characters within single-quotes. Single quote
characters are included if they are preceded (escaped) by a '\' character.
public static final int DOUBLE_QUOTED_STRING
token type for tokens that consist of all the characters within double-quotes. Double quote
characters are included if they are preceded (escaped) by a '\' character.
| Constructor Detail |
|---|
protected FullTextSearchParser.TermTokenizer()
| Method Detail |
|---|
public void tokenize(TokenStream.CharacterStream input,
TokenStream.Tokens tokens)
throws ParsingException
TokenStream.Token objects.
tokenize in interface TokenStream.Tokenizerinput - the character input stream; never nulltokens - the factory for TokenStream.Token objects, which records the order in which the tokens are created
ParsingException - if there is an error while processing the character stream (e.g., a quote is not closed, etc.)org.modeshape.common.text.TokenStream.Tokenizer#tokenize(CharacterStream, Tokens)
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||