Projects in 'Big Data Programming' class
1. Naver Blog: https://section.blog.naver.com/BlogHome.nhn?directoryNo=0¤tPage=1&groupId=0
2. Twitter Search: https://twitter.com/search-home
ANN: Necessary concept for data training for bigdata analysis
TFIDF( term frequency–inverse document frequency) : numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus. It is often used as a weighting factor in searches of information retrieval, text mining, and user modeling.
source (wikipedia https://en.wikipedia.org/wiki/Tf%E2%80%93idf)