public class CatalanTagger extends BaseTagger
Modifier and Type | Field and Description |
---|---|
static CatalanTagger |
INSTANCE_CAT |
static CatalanTagger |
INSTANCE_VAL |
locale, wordTagger
Constructor and Description |
---|
CatalanTagger(Language language) |
Modifier and Type | Method and Description |
---|---|
protected List<AnalyzedToken> |
additionalTags(String word,
morfologik.stemming.IStemmer stemmer) |
boolean |
overwriteWithManualTagger()
If true, tags from the binary dictionary (*.dict) will be overwritten by manual tags
from the plain text dictionary.
|
List<AnalyzedTokenReadings> |
tag(List<String> sentenceTokens)
Returns a list of
AnalyzedToken s that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag). |
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getManualAdditionsFileNames, getManualRemovalsFileNames, getWordTagger
public static final CatalanTagger INSTANCE_VAL
public static final CatalanTagger INSTANCE_CAT
public CatalanTagger(Language language)
public boolean overwriteWithManualTagger()
BaseTagger
overwriteWithManualTagger
in class BaseTagger
public List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
Tagger
AnalyzedToken
s that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag).
Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.
tag
in interface Tagger
tag
in class BaseTagger
sentenceTokens
- the text as returned by a WordTokenizer@Nullable protected List<AnalyzedToken> additionalTags(String word, morfologik.stemming.IStemmer stemmer)