k-means in Apache Pig - Carnegie Mellon School of Computer Science

k-means in Apache Pig: input data. Assume we need to cluster documents. Stored in a 3-column table D: Initial centroids are k randomly chosen docs. Stored in ...
展开查看详情