O'Reilly's textbook chapter 2.1.4-2.1.5Reuters corpus should have plenty of news documents. >>> import nltk >>> import sys >>> from nltk.corpus import reuters >>> reuters.fileids() ['test/14826', 'test/14828', 'test/14829', 'test/14832', '…
Quote saved.
Login to quote this blog
Failed to save quote. Please try again later.
You cannot quote because this article is private.