Skip to content

Latest commit

 

History

History
4 lines (4 loc) · 296 Bytes

NLTK.md

File metadata and controls

4 lines (4 loc) · 296 Bytes
s = '''' Good muffins cost $3.88\nin New York.  Please buy me\n    ... two of them.\n\nThanks. '''
[word_tokenize(t) for t in sent_tokenize(s)]  #=> [['Good', 'muffins', 'cost', '$', '3.88', 'in', 'New', 'York', '.'], ['Please', 'buy', 'me', 'two', 'of', 'them', '.'], ['Thanks', '.']]