Question

我正在尝试比较2个数据框的值。如果任何值不同，我将返回该行。

df1
     Value1 Value2
Name     
a      1      1
b      1      2
c      0      1

df2
     Value Value2
Name     
a      1      1
b      1      1
c      1      1

我做了df1 == df2

df3
     Value Value2
Name
a    True  True 
b    True  False
c    False True

我想只返回b和c，我该怎么做？我不想做

df3[(df3['Value']==False)|(df3['Value2'==False)]

因为我可能有两个以上的列，列名可能不同

Answer 1

假设您有一个数据文件（d3.txt）或数据列表，如（line），

line = [i.strip().split() for i in open("d3.txt").readlines()]

print line 
[['#df3'], ['#', 'Value', 'Value2'], ['#Name'], ['#a', 'True', 'True'], ['#b', 'True', 'False'], ['#c', 'False', 'True']]

 for i in line[:][:]:
    mydict[i[0]] = ",".join(li[li.index(i)][1:])

我刚刚创建了一本字典。所以你可以打电话给

print mydict
print mydict['#a'] #Depend of which name you want to look.

输出

{'#': 'Value,Value2', '#c': 'False,True', '#b': 'True,False', '#a': 'True,True', '#Name': '', '#df3': ''}
True,True

或者你可以不用创建字典就这样做，

for n in range(len(line)):
    if (line[n][0] == '#c' or line[n][0]== '#b'):
        print line[n][:]

输出是（也许这就是你想要的）：

['#b', 'True', 'False']  
['#c', 'False', 'True']

Answer 2

我认为应该这样做：

df3[~df3.all(axis=1)]

Python比较2个数据帧中的变量

2 个答案: