
时间:2013-11-06 11:35:43

标签: python rotation translation correlation phase


现在我想确定修改过的图片从原件转换的像素数(或任何距离单位),以及相对于原件的旋转度。相位相关应该通过首先将坐标转换为logpolar坐标,然后做很多事情来解决这个问题,最终得到一个相关矩阵。从那个矩阵我应该找到峰值,而(x,y)组合将以某种方式揭示平移和旋转。此链接解释得更好: Phase correlation


import scipy as sp
from scipy import ndimage
from PIL import Image
from math import *
import numpy as np

def logpolar(input,silent=False):
    # This takes a numpy array and returns it in Log-Polar coordinates.

    if not silent: print("Creating log-polar coordinates...")
    # Create a cartesian array which will be used to compute log-polar coordinates.
    coordinates = sp.mgrid[0:max(input.shape)*2,0:360]
    # Compute a normalized logarithmic gradient
    log_r = 10**(coordinates[0,:]/(input.shape[0]*2.)*log10(input.shape[1]))
    # Create a linear gradient going from 0 to 2*Pi
    angle = 2.*pi*(coordinates[1,:]/360.)

    # Using scipy's map_coordinates(), we map the input array on the log-polar 
    # coordinate. Do not forget to center the coordinates!
    if not silent: print("Interpolation...")
    lpinput = ndimage.interpolation.map_coordinates(input,

    # Returning log-normal...
    return lpinput

def load_image( infilename ) :
    img = Image.open( infilename )
    data = np.asarray( img, dtype="int32" )
    return data

def save_image( npdata, outfilename ) :
    img = Image.fromarray( np.asarray( np.clip(npdata,0,255), dtype="uint8"), "L" )
    img.save( outfilename )

image = load_image("C:/images/testing_image1.jpg")
target = load_image("C:/images/testing_otherimage.jpg")

# Conversion to log-polar coordinates
lpimage = logpolar(image)
lptarget = logpolar(target)

# Correlation through FFTs
Fcorr = np.fft.fft(lpimage)*np.fft.fft(lptarget)
correlation = np.fft.ifft(Fcorr)


Traceback (most recent call last):
  File "./phase.py", line 44, in <module>
    lpimage = logpolar(image)
  File "./phase.py", line 24, in logpolar
  File "C:\Python27\lib\site-packages\scipy\ndimage\interpolation.py", line 295, in map_coordinates
    raise RuntimeError('invalid shape for coordinate array')
RuntimeError: invalid shape for coordinate array


  Traceback (most recent call last):
  File "./phase.py", line 41, in <module>
  File "./phase.py", line 36, in save_image
    img = Image.fromarray( np.asarray( np.clip(npdata,0,255), dtype="uint8"), "L" )
  File "C:\Python27\lib\site-packages\PIL\Image.py", line 1917, in fromarray
    raise ValueError("Too many dimensions.")
ValueError: Too many dimensions.

看一下原始文档并没有给我提供很多关于问题的灵感。我不认为将图像转换为numpy数组的代码是错误的,因为我已经使用print type(image)测试了类型,结果看起来是合法的。但我无法将其转换回图像。我能得到的任何帮助都将不胜感激。

1 个答案:

答案 0 :(得分:2)


image = load_image("/path/to/image")[:,:,0]