Class | Description |
---|---|
CorrectLinePrinter |
Simple script to print only the lines that do not have a match.
|
DuplicateRemover |
Print file without duplicate lines (except comments).
|
DutchWordSplitter | |
ExportGermanNouns |
Export German nouns, to be used by jWordSplitter.
|
FalseFriendExporter | |
FilterFileByLanguage | |
Fuzzer |
A very simple fuzzer to see if certain random input causes long processing times.
|
FuzzerForAnalysis |
A very simple fuzzer to see if certain random input causes long processing times
when analyzing text.
|
GenderWordFinder |
Simple script to find words like "Schülerlots*innen" that might not be accepted
by the speller.
|
GenerateIrishWordforms | |
GermanCaseAmbiguityFinder | |
GermanElLeAdjectives |
Finds word pairs like "flexible" / "flexibel".
|
GermanOldSpellingFinder |
Help finding old German spelling that lack their new version,
like "abschloß" without "abschloss".
|
IncorrectLinePrinter |
Simple script to print only the lines that do have a match.
|
InvalidGermanVerbFinder |
Findet potentiell falsche Wortformen mit "zu" wie "verzukokeln", die entstehen, weil die "_"
Syntax benutzt wurde, ohne zu bedenken, dass diese Form mit "zu" nicht gültig ist.
|
MissingEnglishPosFinder |
Fast hack to find words which have no POS tagging.
|
MissingGermanCompoundAdjectiveFinder |
Find German compounds like "verwaltungstechnisch" that are not tagged yet.
|
MissingGermanCompoundsFinder |
Reads a list of words (one per line) and the first part of those that are not accepted
by the speller and that are probably compounds.
|
MissingGermanPosFinder |
Fast hack to find words which have no POS tagging.
|
MissingGermanPosForms |
Fast hack to find words which have an uppercase reading in the speller dict
but only a lowercase reading in the POS dict => Reading might be missing
in POS dict.
|
MissingGermanWords |
List words from given wordlist that are unknown to the German speller or German tagger.
|
MissingPortuguesePosFinder |
Portuguese: Fast hack to find words which have no POS tagging (note: does not consider the disambiguator).
|
MissingRussianPosFinder |
Russian: Fast hack to find words which have no POS tagging (note: does not consider the disambiguator).
|
NGramLangIdentifierPerformanceTest | |
NoSuggestionRuleList |
List rules that offer no suggestion.
|
ProhibitComparator |
Compare two (mostly) expanded prohibit.txt files.
|
RegexExpander |
"Expand" regular expressions like {@code [Ss](?
|
RuleCreator |
Takes the output of
HomophoneOccurrenceDumper and automatically
writes the error probabilities of variants (e.g. "0.977 public hair")
when XML_MODE = false or writes XML rules when XML_MODE = true . |
RuleDetails |
Extends a CSV with a column of rule ids with rule descriptions and categories
|
RuleOverview |
Command line tool to list supported languages and their number of rules.
|
UnknownWordFinder |
Print words unknown to the spell checker, sorted by number of occurrences.
|
UnknownWordFinderForCsv |
Print words from CSV that are unknown to the spell checker, sorted by number of occurrences.
|
XmlIndenter |
A lightweight XML indentation helper for grammar.xml files.
|