Week 9
- In this week we increase the size of the data set we test the domain based methodology on 92349 documents. We applied the ClassificationForest on the above documents we found the accuracy of the classifier before and after eliminating 1000 words from the domain based common words.
- Build the ground truth of the classification method by computing
precision, recall and F1 measure.