public class WikipediaSentenceSource extends SentenceSource
pages-articles.xml.bz2
from
http://download.wikimedia.org/backup-index.html, e.g.
http://download.wikimedia.org/dewiki/latest/dewiki-latest-pages-articles.xml.bz2
.Modifier and Type | Method and Description |
---|---|
String |
getSource() |
boolean |
hasNext() |
Sentence |
next()
Return the next sentence.
|
acceptSentence, remove, toString
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
forEachRemaining
public boolean hasNext()
hasNext
in interface Iterator<Sentence>
hasNext
in class SentenceSource
public Sentence next()
SentenceSource
next
in interface Iterator<Sentence>
next
in class SentenceSource
public String getSource()
getSource
in class SentenceSource