输入是击球手的跑步列表。它应该返回击球手的平均跑垒率最高的国家。
我正在尝试找到最高的平均值,例如当下面的列表传递给我的方法时,它应该返回“Pakistan”。
[
["Pakistan", 23],
["Pakistan", 127],
["India", 3],
["India", 71],
["Australia", 31],
["India", 22],
["Pakistan", 81]
]
我试过了:
创建两个词典:
total={'Australia': 31, 'India': 96, 'Pakistan': 231}
division={'Australia': 1, 'India': 2, 'Pakistan': 3}
想到划分两个dicts的值并找到它们中最高的。
还有其他有效方法吗?
感谢您的帮助。
答案 0 :(得分:2)
您可以使用pandas来实现这一目标,您的代码就像:
import pandas as pd
data = [
["Pakistan", 23],
["Pakistan", 127],
["India", 3],
["India", 71],
["Australia", 31],
["India", 22],
["Pakistan", 81]
]
df = pd.DataFrame(data, columns=['country', 'count'])
grouped = df.groupby(['country']).mean().reset_index()
highest = list(grouped.max())
print(highest)
打印:
['Pakistan', '77']
答案 1 :(得分:1)
可能可以使用更少的代码行完成,但这可行!!
def average(data):
highest = {}
index = 0
while True:
for i in data:
if i[0] in highest:
highest[i[0]].append(i[1])
else:
highest[i[0]] = [i[1]]
for i in highest:
highest[i] = sum(highest[i]) / len(highest[i])
answer = 0
for i in highest:
if highest[i] >= answer:
answer = i
return answer
print average(data)
答案 2 :(得分:1)
您可以创建一个以国家/地区名称为键的字典,以及国家/地区计数和分数列表作为值。然后你可以进一步修改相同的字典来计算平均值,并使用max来打印最大平均值的国家。
这是代码:
>>> a = [
["Pakistan", 23],
["Pakistan", 127],
["India", 3],
["India", 71],
["Australia", 31],
["India", 22],
["Pakistan", 81]
]
>>>
>>>
>>> a
[['Pakistan', 23], ['Pakistan', 127], ['India', 3], ['India', 71], ['Australia', 31], ['India', 22], ['Pakistan', 81]]
>>> d = {}
>>> for l in a:
if l[0] not in d.keys():
d.update({l[0]:[1,l[1]]})
else:
d[l[0]] = [d[l[0]][0]+1,d[l[0]][1]+l[1]]
>>> #updated list
>>> d
{'Pakistan': [3, 231], 'Australia': [1, 31], 'India': [3, 96]}
>>> for key,val in d.items():
d[key] = val[1]/val[0]
#Updated dict with average per country
>>> d
{'Pakistan': 77.0, 'Australia': 31.0, 'India': 32.0}
>>> max(d.items())
('Pakistan', 77.0)
>>>
可以采用更简单,更pythonic的方式,但这就是逻辑所在。
答案 3 :(得分:1)
Thia是另一种方法:
lst = [
["Pakistan", 23],
["Pakistan", 127],
["India", 3],
["India", 71],
["Australia", 31],
["India", 22],
["Pakistan", 81]
]
tuples = [tuple(i) for i in lst]
newdata = {}
for k,v in tuples:
newdata.setdefault(k, []).append(v)
result = {k:(sum(v)/len(v)) for k,v in newdata.items()}
a = max(result)
b = max(result.values())
print "The highest average is %s: %s " % (a,b)
输出:
The highest average is Pakistan: 77