public class GermanTagger extends BaseTagger
de/german.dict
in the classpath.
The POS tagset is described in
tagset.txtModifier and Type | Field and Description |
---|---|
static GermanTagger |
INSTANCE |
locale, wordTagger
Constructor and Description |
---|
GermanTagger() |
Modifier and Type | Method and Description |
---|---|
AnalyzedTokenReadings |
lookup(String word)
Return only the first reading of the given word or
null . |
List<AnalyzedTokenReadings> |
tag(List<String> sentenceTokens)
Returns a list of
AnalyzedToken s that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag). |
List<AnalyzedTokenReadings> |
tag(List<String> sentenceTokens,
boolean ignoreCase) |
List<TaggedWord> |
tag(String word) |
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getManualAdditionsFileNames, getManualRemovalsFileNames, getWordTagger, overwriteWithManualTagger
public static final GermanTagger INSTANCE
@Nullable public AnalyzedTokenReadings lookup(String word) throws IOException
null
.IOException
public List<TaggedWord> tag(String word)
public List<AnalyzedTokenReadings> tag(List<String> sentenceTokens) throws IOException
Tagger
AnalyzedToken
s that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag).
Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.
tag
in interface Tagger
tag
in class BaseTagger
sentenceTokens
- the text as returned by a WordTokenizerIOException
public List<AnalyzedTokenReadings> tag(List<String> sentenceTokens, boolean ignoreCase) throws IOException
IOException