我有一些用文件夹分开的图像。因此,我导入了它们并将其转换为像素数组。当我输入:
In [9]: X_train.shape
out [9]: (7467,60,80,3)
我想在此后面加上编号。类,创建数据集并另存为.json
文件,然后导入新的笔记本中,并为我自己的项目目的进行图像处理。
所以我输入了这段代码:
In [10]: dataset = pd.DataFrame({'label': y_train, 'images': list(X_train)},
columns=['label', 'images'])
但是当我输入时:
In [11]: X_train
Out [11]: array([[[0., 0., 0.],
[0., 0., 0.],
[0., 0., 0.],
...,
[0., 0., 0.],
[0., 0., 0.],
[0., 0., 0.]],
[[0., 0., 0.],
[0., 0., 0.],
[0., 0., 0.],
...,
[0., 0., 0.],
[0., 0., 0.],
[0., 0., 0.]],
[[0., 0., 0.],
[0., 0., 0.],
[0., 0., 0.],
...,
[0., 0., 0.],
[0., 0., 0.],
[0., 0., 0.]],
...,
但是当我导入json文件并显示:
In [2]: train=pd.read_json('train_file.json')
train.head()
Out [2]:
image_no images
0 7468 [[[0.0, 0.0, 0.0], [0.0, 0.0, 0.0], [0.0039215...
1 7469 [[[0.0, 0.0, 0.0], [0.0, 0.0, 0.0], [0.0, 0.0,...
10 7478 [[[0.0, 0.0, 0.0], [0.0, 0.0, 0.0], [0.0, 0.0,...
100 7568 [[[0.0, 0.0, 0.0], [0.0, 0.0, 0.0], [0.0, 0.0,...
1000 8468 [[[0.27058823530000004, 0.1843137255, 0.247058.
..
当我输入时:
In [3]: train['images].values
Out [3]: array([list([[[0.7411764706, 0.7607843137, 0.8274509804], `[0.7215686275000001, 0.7058823529, 0.7882352941], [0.7019607843, 0.6823529412, 0.7843137255], [0.7176470588, 0.7215686275000001, 0.8196078431], [0.8, 0.8352941176, 0.8549019608], [0.8352941176, 0.8666666667, 0.8666666667], [0.8509803922, 0.8745098039, 0.8666666667], [0.8549019608, 0.8745098039, 0.8666666667], [0.8431372549, 0.8666666667, 0.8666666667], [0.8235294118, 0.8705882353000001, 0.8588235294000001], [0.831372549, 0.8705882353000001, 0.8627450980000001], [0.8352941176, 0.831372549, 0.8549019608], [0.7686274510000001, 0.7686274510000001, 0.8117647059], [0.7098039216, 0.7254901961, 0.7803921569000001], [0.7019607843, 0.7333333333000001, 0.8], [0.7254901961, 0.7686274510000001, 0.8392156863], [0.7647058824, 0.7803921569000001, 0.8509803922], [0.7372549020000001, 0.7411764706, 0.8117647059], [0.7098039216, 0.7019607843, 0.7960784314], [0.6980392157, 0.6705882353, 0.8039215686000001], [0.6901960784000001, 0.6823529412, 0.8117647059], [0.6901960784000001, 0.6901960784000001, 0.8196078431], [0.6941176471, 0.6980392157, 0.831372549], [0.6980392157, 0.7058823529, 0.8352941176], [0.7254901961, 0.7490196078, 0.8352941176], [0.8, 0.831372549, 0.8745098039], [0.8431372549, 0.8784313725, 0.8862745098], [0.8509803922, 0.8823529412000001, 0.8862745098], [0.831372549, 0.8352941176, 0.8745098039], [0.7725490196, 0.7411764706, 0.8392156863], [0.7529411765, 0.7294117647, 0.8392156863], [0.7607843137, 0.7764705882, 0.8352941176], [0.8078431373, 0.8392156863, 0.8705882353000001], [0.8274509804, 0.8549019608, 0.8862745098], [0.8117647059, 0.8431372549, 0.8705882353000001], [0.7725490196, 0.8, 0.8352941176], [0.7529411765, 0.7764705882, 0.8431372549], [0.8117647059, 0.8352941176, 0.8862745098], [0.8745098039, 0.8980392157, 0.9176470588000001], [0.8862745098, 0.9098039216, 0.9058823529000001], [0.8823529412000001, 0.9058823529000001, 0.9019607843], [0.8784313725, 0.9098039216, 0.9058823529000001], [0.8666666667, 0.9137254902, 0.9058823529000001], [0.8627450980000001, 0.9176470588000001, 0.9098039216], [0.86274509....`
当我输入时:
In [4]: train['images'].shape
Out [4]: (7467,)
但是我可以使用plt.imshow()
绘制这些图像。但是当我尝试直接做model.fit(train['images],y_train)
时
我收到此错误:
ValueError:设置具有序列的数组元素
那么我要去哪里错了?将其转储到.json
文件中时,或者导入json
文件并修复错误后如何将其转换为数组。
答案 0 :(得分:0)
将数据帧存储为np arrays
时,您的.json
将转换为列表。要将它们输入到Keras模型中,您需要将它们设置为array
形状的(images, height, width, channels)
之一:
X_train = np.array(train['images'].tolist())