public class GermanTagger extends BaseTagger
de/german.dict in the classpath.
The POS tagset is described in
tagset.txt| Modifier and Type | Field and Description |
|---|---|
static GermanTagger |
INSTANCE |
locale, wordTagger| Constructor and Description |
|---|
GermanTagger() |
| Modifier and Type | Method and Description |
|---|---|
AnalyzedTokenReadings |
lookup(String word)
Return only the first reading of the given word or
null. |
List<AnalyzedTokenReadings> |
tag(List<String> sentenceTokens)
Returns a list of
AnalyzedTokens that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag). |
List<AnalyzedTokenReadings> |
tag(List<String> sentenceTokens,
boolean ignoreCase) |
List<TaggedWord> |
tag(String word) |
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getManualAdditionsFileNames, getManualRemovalsFileNames, getWordTagger, overwriteWithManualTaggerpublic static final GermanTagger INSTANCE
@Nullable public AnalyzedTokenReadings lookup(String word) throws IOException
null.IOExceptionpublic List<TaggedWord> tag(String word)
public List<AnalyzedTokenReadings> tag(List<String> sentenceTokens) throws IOException
TaggerAnalyzedTokens that assigns each term in the
sentence some kind of part-of-speech information (not necessarily just one tag).
Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.
tag in interface Taggertag in class BaseTaggersentenceTokens - the text as returned by a WordTokenizerIOExceptionpublic List<AnalyzedTokenReadings> tag(List<String> sentenceTokens, boolean ignoreCase) throws IOException
IOException