public class WikipediaSentenceSource extends SentenceSource
pages-articles.xml.bz2 from
http://download.wikimedia.org/backup-index.html, e.g.
http://download.wikimedia.org/dewiki/latest/dewiki-latest-pages-articles.xml.bz2.| Modifier and Type | Method and Description |
|---|---|
String |
getSource() |
boolean |
hasNext() |
Sentence |
next()
Return the next sentence.
|
acceptSentence, remove, toStringclone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitforEachRemainingpublic boolean hasNext()
hasNext in interface Iterator<Sentence>hasNext in class SentenceSourcepublic Sentence next()
SentenceSourcenext in interface Iterator<Sentence>next in class SentenceSourcepublic String getSource()
getSource in class SentenceSource