public class CatalanTagger extends BaseTagger
| Modifier and Type | Field and Description |
|---|---|
static CatalanTagger |
INSTANCE_CAT |
static CatalanTagger |
INSTANCE_VAL |
locale, wordTagger| Constructor and Description |
|---|
CatalanTagger(Language language) |
| Modifier and Type | Method and Description |
|---|---|
protected List<AnalyzedToken> |
additionalTags(String word,
morfologik.stemming.IStemmer stemmer) |
boolean |
overwriteWithManualTagger()
If true, tags from the binary dictionary (*.dict) will be overwritten by manual tags
from the plain text dictionary.
|
List<AnalyzedTokenReadings> |
tag(List<String> sentenceTokens)
Returns a list of
AnalyzedTokens that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag). |
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getManualAdditionsFileNames, getManualRemovalsFileNames, getWordTaggerpublic static final CatalanTagger INSTANCE_VAL
public static final CatalanTagger INSTANCE_CAT
public CatalanTagger(Language language)
public boolean overwriteWithManualTagger()
BaseTaggeroverwriteWithManualTagger in class BaseTaggerpublic List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
TaggerAnalyzedTokens that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag).
Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.
tag in interface Taggertag in class BaseTaggersentenceTokens - the text as returned by a WordTokenizer@Nullable protected List<AnalyzedToken> additionalTags(String word, morfologik.stemming.IStemmer stemmer)