| Package | Description |
|---|---|
| org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
| org.apache.lucene.analysis.custom |
A general-purpose Analyzer that can be created with a builder-style API.
|
| org.apache.lucene.analysis.ngram |
Character n-gram tokenizers and filters.
|
| org.apache.lucene.analysis.path |
Analysis components for path-like strings such as filenames.
|
| org.apache.lucene.analysis.pattern |
Set of components for pattern-based (regex) analysis.
|
| org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizers.
|
| org.apache.lucene.analysis.th |
Analyzer for Thai.
|
| org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
| org.apache.lucene.analysis.wikipedia |
Tokenizer that is aware of Wikipedia syntax.
|
| Modifier and Type | Class and Description |
|---|---|
class |
KeywordTokenizerFactory
Factory for
KeywordTokenizer. |
class |
LetterTokenizerFactory
Factory for
LetterTokenizer. |
class |
WhitespaceTokenizerFactory
Factory for
WhitespaceTokenizer. |
| Modifier and Type | Method and Description |
|---|---|
TokenizerFactory |
CustomAnalyzer.getTokenizerFactory()
Returns the tokenizer that is used in this analyzer.
|
| Modifier and Type | Method and Description |
|---|---|
CustomAnalyzer.Builder |
CustomAnalyzer.Builder.withTokenizer(Class<? extends TokenizerFactory> factory,
Map<String,String> params)
Uses the given tokenizer.
|
CustomAnalyzer.Builder |
CustomAnalyzer.Builder.withTokenizer(Class<? extends TokenizerFactory> factory,
String... params)
Uses the given tokenizer.
|
| Modifier and Type | Class and Description |
|---|---|
class |
EdgeNGramTokenizerFactory
Creates new instances of
EdgeNGramTokenizer. |
class |
NGramTokenizerFactory
Factory for
NGramTokenizer. |
| Modifier and Type | Class and Description |
|---|---|
class |
PathHierarchyTokenizerFactory
Factory for
PathHierarchyTokenizer. |
| Modifier and Type | Class and Description |
|---|---|
class |
PatternTokenizerFactory
Factory for
PatternTokenizer. |
class |
SimplePatternSplitTokenizerFactory
Factory for
SimplePatternSplitTokenizer, for producing tokens by splitting according to the provided regexp. |
class |
SimplePatternTokenizerFactory
Factory for
SimplePatternTokenizer, for matching tokens based on the provided regexp. |
| Modifier and Type | Class and Description |
|---|---|
class |
ClassicTokenizerFactory
Factory for
ClassicTokenizer. |
class |
StandardTokenizerFactory
Factory for
StandardTokenizer. |
class |
UAX29URLEmailTokenizerFactory
Factory for
UAX29URLEmailTokenizer. |
| Modifier and Type | Class and Description |
|---|---|
class |
ThaiTokenizerFactory
Factory for
ThaiTokenizer. |
| Modifier and Type | Method and Description |
|---|---|
static TokenizerFactory |
TokenizerFactory.forName(String name,
Map<String,String> args)
looks up a tokenizer by name from context classpath
|
| Modifier and Type | Method and Description |
|---|---|
static Class<? extends TokenizerFactory> |
TokenizerFactory.lookupClass(String name)
looks up a tokenizer class by name from context classpath
|
| Modifier and Type | Method and Description |
|---|---|
static String |
TokenizerFactory.findSPIName(Class<? extends TokenizerFactory> serviceClass)
looks up a SPI name for the specified tokenizer factory
|
| Modifier and Type | Class and Description |
|---|---|
class |
WikipediaTokenizerFactory
Factory for
WikipediaTokenizer. |
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.