Entries from 2013-04-19 to 1 day

What is Tokenize? Part 2

Still continuing tokenize.word_tokenize does not handle some cases as I expected. For example. >>> word_tokenize("can't") ['ca', "n't"] In my textbook, other tools were introduced. For example, PunktWordTokenizer >>> from nltk.to…

What is Tokenize?

As my learning NLTK environment seems ready, let's moving forward.I set a one varialvle (para) to put 3 sentences. >>> import nltk >>> para = "Hello World. It's good to see you. Thansk for buying this book." >>> para "Hello World. It's goo…

My NLTK textbook

In addition to O'Reilly's "Natural Language Processing with Python", now I have downloaded this book into my Kindle. I will use this book mainly in my Window's environment.Python Text Processing With NLTK 2.0 Cookbook: Over 80 Practical Re…