Entries from 2013-05-29 to 1 day
18. >>> text = nltk.corpus.gutenberg.raw('melville-moby_dick.txt') >>> words = nltk.word_tokenize(text) >>> list = sorted(set([w for w in words if re.search(r'^wh', w.lower())])) >>> for word in list: ... print word ... WHALE WHALE-FISHERY…