大型稀疏矩阵在R中的聚类分析
问题描述:
我有一个包含250000个交易(行)和2183个项目(列)的交易数据集.我想将其转换为稀疏矩阵,然后对其进行分层聚类.我尝试了软件包"sparcl",但似乎在稀疏矩阵上不起作用.关于如何解决此问题的任何建议?或者我可以使用任何其他软件包对稀疏矩阵进行聚类分析?谢谢!
I have a transaction dataset with 250000 transactions (rows) and 2183 items (columns). I wanna transform it to a sparse matrix and then do hierarchical cluster on it. I tried package 'sparcl', but it seems it doesn't work on sparse matrix. Any suggestion about how to solve this problem? Or any other package I can use to do cluster analysis on sparse matrix? Thanks!
答
apcluster软件包中实现的相似性传播从1.4.0版本开始支持稀疏矩阵.因此,请尝试一下.
Affinity propagation, as implemented in the apcluster package, supports sparse matrices since version 1.4.0. So please give it a try.