元组到 pandas 数据框列表的列表

问题描述：

我有这个数组(这是相似度计算的结果)，它是一个像这样的元组列表:

I have this array (it's a result from similarity calcul) it's a list of tuples like this:

example = [[(a,b), (c,d)], [(a1,b1), (c1,d2)] …]

例如，有121044个列表，每个列表包含30个元组.

In example there is 121044 list of 30 tuples each.

我想要一个像元组的第二个值(即b，d，b1，d2)那样的熊猫数据框，而又不花很多时间计算它

I want to have a pandas Dataframe like of just the second value of the tuples (i.e : b, d, b1, d2) without spending to much time compute it

您有什么想法吗?

答

使用嵌套列表理解:

df = pd.DataFrame([[y[1] for y in  x] for x in example])
print (df)
    0   1
0   b   d
1  b1  d2

df = pd.DataFrame([[y[1] for y in  x] for x in example], columns=['col1','col2'])
print (df)
  col1 col2
0    b    d
1   b1   d2