如何从熊猫对称数据框中提取元组

问题描述：

我有一个表示对称矩阵的数据框:

I have a dataframe representing a symmetric matrix:

我要去的地方:

[(a,b,2),(a,c,3),(a,d,4),(b,c,6),...]

是否有使用pythonic/pandatic/代数的方式进行操作，或者我应该进行for循环?

Is there a pythonic/pandatic/ algebraic way of doing it or I should go of for loops?

谢谢.

答

df.unstack().reset_index().values

从某种意义上说，这将保留重复项，因为(a, b, 2)和(b, a, 2)都将在列表中.然后，您可以在lambda t: t[0] < t[1]上进行过滤.

This will keep duplicates in a sense that both (a, b, 2) and (b, a, 2) will be in the list. You can then filter on lambda t: t[0] < t[1].