Class | Description |
---|---|
ConfusionFileIndenter |
Re-Indent confusion_set.txt files.
|
ConfusionSetFileFormatter | |
ContextBuilder | |
FrequencyIndexCreator |
Index *.gz files from Google's ngram corpus into a Lucene index ('text' mode)
or aggregate them to plain text files ('lucene' mode).
|
LuceneIndexExporter |
Export the sentences of a Lucene index.
|
OccurrenceAdder |
Get occurrence counts for words by iterating compressed Google ngram files.
|