如何从 DataFrame 中删除重复项?我已经使用了 drop_duplicates()
但它仍然保留了该行的 1 个副本。我想删除所有重复的痕迹。
df:
Name Age Sex
0 James 24 Male
1 Alice 28 Female
2 Phil 40 Male
3 James 24 Male
代码片段:
data = {"Name": ["James", "Alice", "Phil", "James"],
"Age": [24, 28, 40, 24],
"Sex": ["Male", "Female", "Male", "Male"]}
df = pd.DataFrame(data)
所需的df输出:
Name Age Sex
1 Alice 28 Female
2 Phil 40 Male