Class | Description |
---|---|
GraphvizFormatter |
Outputs the dot (graphviz) string for the viterbi lattice.
|
JapaneseAnalyzer |
Analyzer for Japanese that uses morphological analysis.
|
JapaneseBaseFormFilter |
Replaces term text with the
BaseFormAttribute . |
JapaneseBaseFormFilterFactory |
Factory for
JapaneseBaseFormFilter . |
JapaneseIterationMarkCharFilter |
Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
|
JapaneseIterationMarkCharFilterFactory |
Factory for
JapaneseIterationMarkCharFilter . |
JapaneseKatakanaStemFilter |
A
TokenFilter that normalizes common katakana spelling variations
ending in a long sound character by removing this character (U+30FC). |
JapaneseKatakanaStemFilterFactory |
Factory for
JapaneseKatakanaStemFilter . |
JapaneseNumberFilter |
A
TokenFilter that normalizes Japanese numbers (kansūji) to regular Arabic
decimal numbers in half-width characters. |
JapaneseNumberFilter.NumberBuffer |
Buffer that holds a Japanese number string and a position index used as a parsed-to marker
|
JapaneseNumberFilterFactory |
Factory for
JapaneseNumberFilter . |
JapanesePartOfSpeechStopFilter |
Removes tokens that match a set of part-of-speech tags.
|
JapanesePartOfSpeechStopFilterFactory |
Factory for
JapanesePartOfSpeechStopFilter . |
JapaneseReadingFormFilter |
A
TokenFilter that replaces the term
attribute with the reading of a token in either katakana or romaji form. |
JapaneseReadingFormFilterFactory |
Factory for
JapaneseReadingFormFilter . |
JapaneseTokenizer |
Tokenizer for Japanese that uses morphological analysis.
|
JapaneseTokenizerFactory |
Factory for
JapaneseTokenizer . |
Token |
Analyzed token with morphological data from its dictionary.
|
Enum | Description |
---|---|
JapaneseTokenizer.Mode |
Tokenization mode: this determines how the tokenizer handles
compound and unknown words.
|
JapaneseTokenizer.Type |
Token type reflecting the original source of this token
|
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.