public class FrequencyIndexCreator extends Object
The reason this isn't faster is not Lucene but the aggregation work we do or simply the large amount of data. Indexing every line takes 3µs/doc, i.e. Lucene can index about 333,000 docs/s.
Also see https://dev.languagetool.org/finding-errors-using-n-gram-data.
Constructor and Description |
---|
FrequencyIndexCreator(org.languagetool.dev.bigdata.FrequencyIndexCreator.Mode mode) |