public class WordTokenizer extends Object implements Tokenizer
http://foobar.org
).Constructor and Description |
---|
WordTokenizer() |
Modifier and Type | Method and Description |
---|---|
static List<String> |
getProtocols()
Get the protocols that the tokenizer knows about.
|
String |
getTokenizingCharacters() |
static boolean |
isEMail(String token) |
static boolean |
isUrl(String token) |
protected List<String> |
joinEMails(List<String> list) |
protected List<String> |
joinEMailsAndUrls(List<String> list) |
protected List<String> |
joinUrls(List<String> l) |
List<String> |
tokenize(String text) |
public static List<String> getProtocols()
http
, https
, and ftp
public static boolean isUrl(String token)
public static boolean isEMail(String token)
public String getTokenizingCharacters()