Cal Poly Pomona NLP Lib
Tools
Crawler
Amazon - using Amazon RESTful API (moved to https://github.com/AnakinFoxe/AmazonCrawler)
Facebook - using Facebook4j
Twitter - using Twitter4j
Translator
Google - using Google traslate RESTful API
Utils
ChineseSeg - Chinese words segmentation using mmseg4j
FileProcessor - Process batch of files
MapUtil - Include sorting, updating, summation etc. for Map
NGram - N-gram manipulation
Preprocessor - Text preprocessing
SemSimilarity - Word semantic similarity using ws4j
SentenceDetector - Detect sentence boundary
Stemmer - using snowball stemmer
Stopword - Stopword removal for different languages