public class MultiWordChunker extends AbstractDisambiguator
Constructor and Description |
---|
MultiWordChunker(String filename) |
MultiWordChunker(String filename,
boolean allowFirstCapitalized,
boolean allowAllUppercase) |
Modifier and Type | Method and Description |
---|---|
AnalyzedSentence |
disambiguate(AnalyzedSentence input)
If possible, filters out the wrong POS tags.
|
AnalyzedSentence |
disambiguate(AnalyzedSentence input,
JLanguageTool.CheckCancelledCallback checkCanceled)
Implements multiword POS tags, e.g., <ELLIPSIS> for ellipsis (...)
|
preDisambiguate
public MultiWordChunker(String filename)
filename
- file text with multiwords and tagspublic MultiWordChunker(String filename, boolean allowFirstCapitalized, boolean allowAllUppercase)
filename
- file text with multiwords and tagsallowFirstCapitalized
- if set to true
, first word of the
multiword can be capitalizedallowAllUppercase
- if set to true
, the all uppercase
version of the multiword is allowedpublic AnalyzedSentence disambiguate(AnalyzedSentence input) throws IOException
Disambiguator
input
- The sentence with already tagged words. The words are expected to
have multiple tags.IOException
public final AnalyzedSentence disambiguate(AnalyzedSentence input, @Nullable JLanguageTool.CheckCancelledCallback checkCanceled) throws IOException
input
- The tokens to be chunked.IOException