Question

我有一个数据集，其中包含起点站IDS，终点站IDS的数量以及城市中自行车旅行的持续时间。数据可追溯到2017年，因此某些站点现在不存在。我有这些电台ID的列表。如何从数据框中删除在这些站点开始或结束的行？

例如，如果我要删除索引4和5中的StartStation ID = 135，该怎么办？这意味着一百万行，其中135个可以出现在任何地方。

Bike   Id      StartStation Id  EndStation Id   Duration    
0      395     573              137.0           660.0   
1      12931   399              507.0           420.0   
2      7120    399              507.0           420.0
3      1198    599              616.0           300.0   
4      10739   135              486.0           1260.0  
5      10949   135              486.0           1260.0  
6      8831    193              411.0           540.0   
7      8778    266              770.0           600.0   
8      700     137              294.0           540.0   
9      5017    456              39.0            3000.0  
10     4359    444              445.0           240.0   
11     2801    288              288.0           5340.0  
12     9525    265              592.0           300.0

Answer 1

我正在呼叫您的ID列表以删除removed_ids。

df=df.loc[
    (~df['StartStation ID'].isin(removed_ids)) &\
    (~df['EndStation ID'].isin(removed_ids))
]

在python数据框中删除具有相同索引值的多行

1 个答案: