问题：

Question

我刚写了这个非常简单的手写数字识别。 Here is 8kb archive使用以下代码+十个.PNG图像文件。它有效：被公认为。

简而言之，数据库的每个数字（50x50像素= 250个系数）汇总为10系数向量（通过保留10个最大奇异值，请参阅Low-rank approximation with SVD）

然后，对于要识别的数字，我们最小化与数据库中数字的距离。

from scipy import misc
import numpy as np
import matplotlib.pyplot as plt

digits = []
for i in range(11):
    M = misc.imread(str(i) + '.png', flatten=True)
    U, s, V = np.linalg.svd(M, full_matrices=False)
    s[10:] = 0        # keep the 10 biggest singular values only, discard others
    S = np.diag(s)
    M_reduced = np.dot(U, np.dot(S, V))      # reconstitution of image with 10 biggest singular values
    digits.append({'original': M, 'singular': s[:10], 'reduced': M_reduced})

# each 50x50 pixels digit is summarized into a vector of 10 coefficients : the 10 biggest singular values s[:10]    

# 0.png to 9.png = all the digits (for machine training)
# 10.png = the digit to be recognized
toberecognizeddigit = digits[10]    
digits = digits[:10]

# we find the nearest-neighbour by minimizing the distance between singular values of toberecoginzeddigit and all the digits in database
recognizeddigit = min(digits[:10], key=lambda d: sum((d['singular']-toberecognizeddigit['singular'])**2))    

plt.imshow(toberecognizeddigit['reduced'], interpolation='nearest', cmap=plt.cm.Greys_r)
plt.show()
plt.imshow(recognizeddigit['reduced'], interpolation='nearest', cmap=plt.cm.Greys_r)
plt.show()

问题：

代码有效（您可以在ZIP存档中运行代码），但我们如何才能改进它以获得更好的结果？（主要是我想象的数学技术）。

例如，在我的测试中，9和3有时会相互混淆。

Answer 1

数字识别可能是一个非常困难的领域。特别是当数字以非常不同或不清楚的方式书写时。为了解决这个问题，已经采取了许多方法，并且整个竞标都致力于这一主题。有关示例，请参阅Kaggle's digit recognizer competition。本次比赛基于众所周知的MNIST data set。在那里的论坛中，你会发现很多关于这个问题的想法和方法，但我会给出一些快速的建议。

很多人将此问题作为分类问题来解决。解决此类问题的可能算法包括例如kNN，神经网络或梯度增强。

然而，通常只是算法不足以获得最佳分类率。提高分数的另一个重要方面是特征提取。我们的想法是计算能够区分不同数字的特征。此数据集的某些示例功能可能包括彩色像素的数量，或者可能包括数字的宽度和高度。

虽然其他算法可能不是您想要的，但添加更多功能可能会提高您当前使用的算法的性能。

简单（工作）手写数字识别：如何改进？

问题：

1 个答案: