如何使用Python将带有图像的文件夹转换为Excel文件

时间:2017-10-04 14:42:55

标签: python excel openpyxl xlsxwriter

我有一个充满图片的文件夹,所有图片都以相同的方式命名。

文件名: .. \名_ID。

我想创建一个电子表格,并将图片的名称,ID和链接放入不同的列中。

应该使用openpyxl,xlsxwriter还是别的什么?

3 个答案:

答案 0 :(得分:1)

我没有使用openpyxl或xlsxwriter的经验,但如果我查看openpyxl的文档,我想这个程序会是这样的

from openpyxl import Workbook
from openpyxl.styles import PatternFill
from scipy.misc import imread

wb = Workboo()
ws = wb.active

img = imread('image.jpg', mode='RGB')
for i in range(len(img)):
    for j in range(len(img[0])):
        # TODO a method to set turn (3, 1) into 'D2'
        index = excel_coordinate(i, j)
        # TODO a method to change RGB in a hex value, perhaps imread also support hex, not sure
        hexval = RGB2hex(img[i][j])
        cel = ws[index]
        cel.PatternFill("Solid", fgColor=hexval)

答案 1 :(得分:1)

我正在提供一个答案,说明如何使用xlsxwriter实现这一目标。它创建了一个电子表格,其中包含名称和ID以及指向三个单独列中相关图片的链接。

答案使用urllib.request以便它是可重现的(这个模块不是必需的,我只是把它放在那里下载三个测试图像)。我还将目录设置为当前目录,您可以根据需要进行修改。此外,在我的回答中,我将其设置为仅查找.png文件,但您也可以调整以查找其他文件格式。

import urllib.request
import xlsxwriter
import os


#comment out the next 4 lines if you don't want to download 3 pictures
url = 'https://upload.wikimedia.org/wikipedia/en/thumb/4/43/Ipswich_Town.svg/255px-Ipswich_Town.svg.png'
urllib.request.urlretrieve(url, "pica_1.png")
urllib.request.urlretrieve(url, "picb_2.png")
urllib.request.urlretrieve(url, "picc_3.png")


dir_wanted = os.getcwd()
#uncomment the following line if you don't want the current directory
#dir_wanted = "C:\\users\\doe_j"


file_list = [file for file in os.listdir(dir_wanted) if file.endswith('.png')]
full_path_list = [dir_wanted + '\\' + file for file in file_list]

name_list = []
num_list = []

for item in file_list:
    temp_list = item.rpartition('_')
    name = str(temp_list[0])
    num = str(temp_list[2].rpartition('.')[0])
    name_list.append(name)
    num_list.append(num)


workbook = xlsxwriter.Workbook('pics_and_links.xlsx')
ws = workbook.add_worksheet('Links')

#adding column titles and making them bold
bold = workbook.add_format({'bold': True})
ws.write('A1', "Name", bold)
ws.write('B1', "Number", bold)
ws.write('C1', "Link", bold)

#putting the three lists we made into the workbook
for i in range (0, len(full_path_list)):
    row_num = i + 2
    ws.write('A%d' % row_num, name_list[i])
    ws.write('B%d' % row_num, int(num_list[i]))
    ws.write_url('C%d' % row_num, full_path_list[i])

#Set the width of the column with the links in it
ws.set_column(2, 2, 40)

workbook.close()

答案 2 :(得分:1)

您可以使用pandas

来执行此操作
import glob
import os
import pandas as pd

files_dir = '/home/username/files_dir' # here should be path to your directory with images
files = glob.glob(os.path.join(files_dir, '*'))
df = pd.DataFrame(columns=['name', 'id', 'hyperlink'])

for i, full_filename in enumerate(files):
    filename = os.path.basename(full_filename)
    name, id_ = filename.split('_')
    id_ = os.path.splitext(id_)[0] # remove file extension from id_
    hyperlink = '=HYPERLINK("file:///{}")'.format(full_filename)
    df.loc[i] = [name, id_, hyperlink]

df.to_excel('output_file.xlsx', index=False)