Set NGramOrientalOnly
to True
to apply N-Gram tokenization to Chinese, Japanese, and Korean characters but to ignore multi-byte characters in other languages.
Type: | Boolean |
Default: | False |
Required: | No |
Configuration Section: | LanguageTypes or MyLanguage |
Example: | NGram=2 NGramOrientalOnly=True |
See Also: |
|