LuceneStandardTokenizer Class
- java.
lang. Object - com.
azure. search. documents. indexes. models. LexicalTokenizer - com.
azure. search. documents. indexes. models. LuceneStandardTokenizer
- com.
- com.
public final class LuceneStandardTokenizer
extends LexicalTokenizer
Breaks text following the Unicode Text Segmentation rules. This tokenizer is implemented using Apache Lucene.
Constructor Summary
| Constructor | Description |
|---|---|
| LuceneStandardTokenizer(String name) |
Constructor of LuceneStandardTokenizer. |
Method Summary
| Modifier and Type | Method and Description |
|---|---|
| Integer |
getMaxTokenLength()
Get the max |
|
Lucene |
setMaxTokenLength(Integer maxTokenLength)
Set the max |
|
Json |
toJson(JsonWriter jsonWriter) |
Methods inherited from LexicalTokenizer
Methods inherited from java.lang.Object
Constructor Details
LuceneStandardTokenizer
public LuceneStandardTokenizer(String name)
Constructor of LuceneStandardTokenizer.
Parameters:
Method Details
getMaxTokenLength
public Integer getMaxTokenLength()
Get the maxTokenLength property: The maximum token length. Default is 255. Tokens longer than the maximum length are split.
Returns:
setMaxTokenLength
public LuceneStandardTokenizer setMaxTokenLength(Integer maxTokenLength)
Set the maxTokenLength property: The maximum token length. Default is 255. Tokens longer than the maximum length are split.
Parameters:
Returns:
toJson
public JsonWriter toJson(JsonWriter jsonWriter)
Overrides:
LuceneStandardTokenizer.toJson(JsonWriter jsonWriter)Parameters:
Throws: