MultiWordChunker (LanguageTool 6.4-SNAPSHOT API)

java.lang.Object
- org.languagetool.tagging.disambiguation.AbstractDisambiguator
- - org.languagetool.tagging.disambiguation.MultiWordChunker

All Implemented Interfaces:

Disambiguator
```
public class MultiWordChunker
extends AbstractDisambiguator
```
Multiword tagger-chunker.

Author:

Marcin Miłkowski

Constructor Summary

Constructors
Constructor and Description
`MultiWordChunker(String filename)`
`MultiWordChunker(String filename, boolean allowFirstCapitalized, boolean allowAllUppercase)`
`MultiWordChunker(String filename, boolean allowFirstCapitalized, boolean allowAllUppercase, String defaultTag)`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`AnalyzedSentence`	`disambiguate(AnalyzedSentence input)` If possible, filters out the wrong POS tags.
`AnalyzedSentence`	`disambiguate(AnalyzedSentence input, JLanguageTool.CheckCancelledCallback checkCanceled)` Implements multiword POS tags, e.g., <ELLIPSIS> for ellipsis (...)

Methods inherited from class org.languagetool.tagging.disambiguation.AbstractDisambiguator
preDisambiguate

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - MultiWordChunker
```
public MultiWordChunker(String filename)
```
    Parameters:
    
    filename - file text with multiwords and tags
  - MultiWordChunker
```
public MultiWordChunker(String filename,
                        boolean allowFirstCapitalized,
                        boolean allowAllUppercase)
```
    Parameters:
    
    filename - file text with multiwords and tags
    
    allowFirstCapitalized - if set to true, first word of the multiword can be capitalized
    
    allowAllUppercase - if set to true, the all uppercase version of the multiword is allowed
  - MultiWordChunker
```
public MultiWordChunker(String filename,
                        boolean allowFirstCapitalized,
                        boolean allowAllUppercase,
                        String defaultTag)
```
- Method Detail
  - disambiguate
```
public AnalyzedSentence disambiguate(AnalyzedSentence input)
                              throws IOException
```
    Description copied from interface: Disambiguator
    
    If possible, filters out the wrong POS tags.
    
    Parameters:
    
    input - The sentence with already tagged words. The words are expected to have multiple tags.
    
    Returns:
    
    Analyzed sentence, where each word has only one (possibly the most correct) tag.
    
    Throws:
    
    IOException
  - disambiguate
```
public final AnalyzedSentence disambiguate(AnalyzedSentence input,
                                           @Nullable
                                           JLanguageTool.CheckCancelledCallback checkCanceled)
                                    throws IOException
```
    Implements multiword POS tags, e.g., <ELLIPSIS> for ellipsis (...) start, and </ELLIPSIS> for ellipsis end.
    
    Parameters:
    
    input - The tokens to be chunked.
    
    Returns:
    
    AnalyzedSentence with additional markers.
    
    Throws:
    
    IOException

Class MultiWordChunker

Constructor Summary

Method Summary

Methods inherited from class org.languagetool.tagging.disambiguation.AbstractDisambiguator

Methods inherited from class java.lang.Object

Constructor Detail

MultiWordChunker

MultiWordChunker

MultiWordChunker

Method Detail

disambiguate

disambiguate