public class WhitespaceTokenizerFactory extends TokenizerFactory
WhitespaceTokenizer
.
<fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.WhitespaceTokenizerFactory" rule="unicode" maxTokenLen="256"/> </analyzer> </fieldType>Options:
WhitespaceTokenizer
or "unicode" for UnicodeWhitespaceTokenizer
CharTokenizer
::DEFAULT_MAX_TOKEN_LENModifier and Type | Field and Description |
---|---|
static String |
NAME
SPI name
|
static String |
RULE_JAVA |
static String |
RULE_UNICODE |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
Constructor and Description |
---|
WhitespaceTokenizerFactory(Map<String,String> args)
Creates a new WhitespaceTokenizerFactory
|
Modifier and Type | Method and Description |
---|---|
Tokenizer |
create(AttributeFactory factory)
Creates a TokenStream of the specified input using the given AttributeFactory
|
availableTokenizers, create, findSPIName, forName, lookupClass, reloadTokenizers
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
public static final String NAME
public static final String RULE_JAVA
public static final String RULE_UNICODE
public Tokenizer create(AttributeFactory factory)
TokenizerFactory
create
in class TokenizerFactory
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.