public class WordTokenizer extends Object implements Tokenizer
http://foobar.org).| Constructor and Description |
|---|
WordTokenizer() |
| Modifier and Type | Method and Description |
|---|---|
static List<String> |
getProtocols()
Get the protocols that the tokenizer knows about.
|
String |
getTokenizingCharacters() |
static boolean |
isEMail(String token) |
static boolean |
isUrl(String token) |
protected List<String> |
joinEMails(List<String> list) |
protected List<String> |
joinEMailsAndUrls(List<String> list) |
protected List<String> |
joinUrls(List<String> l) |
List<String> |
tokenize(String text) |
public static List<String> getProtocols()
http, https, and ftppublic static boolean isUrl(String token)
public static boolean isEMail(String token)
public String getTokenizingCharacters()