我正在研究Minkowski距离,其定义如下:
我使用for循环来进行如下计算,
import numpy as np
import random
A = np.random.randint(5, size=(10, 5))
B = [1, 3, 5, 2, 4]
for i in range(10):
dist = (sum((abs(A[i]-B))**5))**(1/5) # I set p=5 in this case
print("Distances: ", dist)
有什么方法可以避免使用numpy技术造成此循环吗?
答案 0 :(得分:4)
您可以使用broadcasting:
import numpy as np
np.random.seed(42)
A = np.random.randint(5, size=(10, 5))
B = [1, 3, 5, 2, 4]
result = (np.abs(A - B)**5).sum(axis=1)**(1/5)
print(result)
for i in range(10):
dist = (sum((abs(A[i]-B))**5))**(1/5) # I set p=5 in this case
print("Distances: ", dist)
输出
[3.14564815 3.00246508 2.04767251 2.02439746 4.04953891 4.00312013
2.49663093 3.49301675 3.53370523 2.04767251]
Distances: 3.1456481457393184
Distances: 3.0024650813881837
Distances: 2.0476725110792193
Distances: 2.024397458499885
Distances: 4.049538907295691
Distances: 4.003120128600393
Distances: 2.496630931732087
Distances: 3.4930167541811468
Distances: 3.5337052340491883
Distances: 2.0476725110792193