Note of NLP tools:
- OpenNLP : a Java package to do text tokenization, part-of-speech tagging, chunking, etc.
- ScalaNLP : NLP and machine learning
- Snowball : a stemmer, support C and Java.
- MALLET : a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
- JGibbLDA : LDA for Java
- Stanford Topic Modelling Toolbox : CVB0 algorithm, etc.