Cluster Wikipedia documents using k-means ============================================ Use file people_wiki.cvs.zip for this project. To work with text data, we must first convert the documents into numerical features. As in the first assignment, let's extract TF-IDF features for each article. Find a library function which will do it or write your own code. Cluster documents using k-means algorithm Visualize some of the clustering results to see if we think the clustering makes sense.