我有200万个葡萄酒。我需要按照以下格式"VIN";"VIN";"VIN";"VIN"
将这200万个vins分成1,500个不同的列表。
我不确定如何迭代并将这些VIN放入1,500个列表中,然后将这些列表放入列表中。我最终需要遍历大列表,并以上述格式将1,500的每个列表发送到api调用。
我尝试了此代码示例,列出了3个列表:
import pandas as pd
#develop the data
df = pd.DataFrame(columns = ["vin"], data = ['LHJLC79U58B001633','SZC84294845693987','LFGTCKPA665700387','L8YTCKPV49Y010001',
'LJ4TCBPV27Y010217','LFGTCKPM481006270','LFGTCKPM581004253','LTBPN8J00DC003107',
'1A9LPEER3FC596536','1A9LREAR5FC596814','1A9LKEER2GC596611','1A9L0EAH9C596099',
'22A000018'])
df['manufacturer'] = ['A','A','A','A','B','B','B','B','B','C','C','D','D']
for i in df:
list_of_lists = []
list = []
df['vin'] = list.append(3)
list = list_of_lists.append()
return list_of_list
答案 0 :(得分:1)
您可以使用numpy.array_split
分割数据框。如果您想将示例分成4个均匀大小的块,则可以
aux = np.array_split(df,4)
输出
aux [0]
vin manufacturer
0 LHJLC79U58B001633 A
1 SZC84294845693987 A
2 LFGTCKPA665700387 A
3 L8YTCKPV49Y010001 A
aux [1]
vin manufacturer
4 LJ4TCBPV27Y010217 B
5 LFGTCKPM481006270 B
6 LFGTCKPM581004253 B
aux [2]
vin manufacturer
7 LTBPN8J00DC003107 B
8 1A9LPEER3FC596536 B
9 1A9LREAR5FC596814 C
aux [3]
vin manufacturer
10 1A9LKEER2GC596611 C
11 1A9L0EAH9C596099 D
12 22A000018 D