Question

提供示例后，问题变得更加清晰。我有一个包含两列的数据框，一列为字符串型，一列为整数：

Col1   Col2
-------------
str1    2
str2    4
str3    1

现在我需要一个列表，其中包含Col1中的字符串乘以Col2中的数字，即['str1', 'str1', 'str2', 'str2', 'str2', 'str2', 'str3']。

最有效的方法是什么？

Answer 1

print (df)
                    creationDate  density
188080 2019-08-01 21:28:39+03:00      5.0
188081 2019-08-01 21:33:13+03:00      5.0
188082 2019-08-01 21:39:53+03:00      5.0
188083 2019-08-01 21:43:24+03:00      4.0
188084 2019-08-01 21:48:17+03:00      3.0
188085 2019-08-01 21:52:56+03:00      3.0 #here you have 3 rows within 20 minutes
188086 2019-08-01 21:58:27+03:00      4.0
188087 2019-08-01 22:10:50+03:00      3.0
188088 2019-08-01 22:14:58+03:00      2.0
188089 2019-08-01 22:17:43+03:00      1.0

输出：

import numpy as np
np.repeat(df.Col1.values, df.Col2.values)

如何从数据框创建一个包含字符串列的每个值乘以第二列的列表？

1 个答案: