我正在尝试使用AlexNethttps实现图像搜索引擎://github.com/akrizhevsky/cuda-convnet2
这个想法是通过训练神经网络来对图像进行分类,然后使用网络最后隐藏层中的代码作为相似性度量来实现图像搜索引擎。
我正在试图弄清楚如何在一组新的图像上训练CNN来对它们进行分类。有谁知道如何开始这个?
由于
答案 0 :(得分:0)
You basically have two approaches to your problem:
-Either you have plenty of good training data (>1M) and dozens of GPUs and you retrain the network from scratch using SGD with the classes you have for your queries.
-Either you don't and then you simply truncate a pretrained AlexNet (where exactly you truncate it is for you to choose) and plug it to your images (possibly resized to fit the network (227x227x3 if I am not mistaken)). Then from your image you get a feature vector (sometimes called a descriptor) and you use those feature vectors to train a linear SVM on your images and your specific task.