将所有具有相同名称的文件放在文件夹中

时间:2012-07-09 07:05:50

标签: python

我是Python新手,我需要一个程序,可以将同一天的文件复制到新文件夹中。

示例文件:

20120807_first_day_pic.jpg
20120807_first_day_sheet.jpg
20120807_first_day_sheet2.jpg
20120907_second_day_pic.jpg
20120907_second_day_sheet.jpg
20120907_second_day_sheet2.jpg

这是我到目前为止所做的,但每个文件都有一个文件夹而不是一整天。

import os, re, shutil

tfolder = 'D:/Testing/src/'

os.chdir(tfolder)

re_year19xxxxxx = re.compile('(19[0-9][0-9][0-9][0-9])')
re_year20xxxxxx = re.compile('(20[0-9][0-9][0-9][0-9])')

re_ed = re.compile('(ED[0-9])')
destPath = 'D:/Testing/Dest/'

def analyse_file_name(fname):
    filePath, coords = os.path.split(fname) #the new folders will be named according to the first 4 characters of the original file name
    coordsFolder = coords[:53]
    coordsFname = coords[:53]
    coordsExt = os.path.splitext(fname)
    year = 'year' #create variable year
    ed = 'ed' #create variable ed to store the edition number if necessary
    bname = fname #the original file name
    for re_year in (re_year19xxxxxx, re_year20xxxxxx):
        rx = re_year.search(fname) #search for regex in the file name and store it in rx
        if rx:
            year = rx.group(1) #if the regex is found, store the year
            bname.replace(year, ' ')
            res = re_ed.search(fname)
            if res:
                ed = res.group(1)
                bname.replace(ed, ' ')
        os.chdir(destPath)  
        if year is 'year':
            fname2 = os.path.join(destPath, coordsFolder) + '\\' + coordsFname + coordsExt[1]
        else:
            fname2 = os.path.join(destPath, coordsFolder,year,ed) + '\\' + coordsFname + coordsExt[1]
        print('%s -> %s' % (fname, fname2)) #debug print
        dirn, _ = os.path.split(fname2)
        if not os.path.exists(dirn):
            os.makedirs(dirn)
        shutil.copy(fname, fname2)

for root, dirs, files in os.walk(tfolder):
    for name in files:
        fn = os.path.join(root, name)
        analyse_file_name(fn)

4 个答案:

答案 0 :(得分:2)

如果您只想复制以已知日期字符串格式开头的文件,那么这样的事情怎么样?

def copyfile(filepath, target_dir):
    p, filename = os.path.split(filepath)

    # get date component of name
    date_component = filename.split("_", 1)[0]

    # try to parse out the date
    try:
        d = datetime.datetime.strptime(date_component, "%Y%m%d")
    except ValueError:  
        print "Could not place: ", filename
        return
    target_date_dir = os.path.join(target_dir, str(d.year), str(d.month), str(d.day))
    os.makedirs(target_date_dir)
    shutil.copy(filepath, target_date_dir)

答案 1 :(得分:0)

首先,创建一个dict(这里的defaultdict更方便),它会收集日期的文件(最好使用re,但是使用{{split给出文件的名称1}}更容易):

>>> import os
>>> import re
>>> pat = r'(\d+)(?:_\d+)?_(\w+?)[\._].*'
>>> from collections import defaultdict
>>> dict_date = defaultdict(lambda : defaultdict(list))
>>> for fil in os.listdir(path):
    if os.path.isfile(os.path.join(path, fil)):
        date, animal = re.match(pat, fil).groups()
        dict_date[date][animal].append(fil)


>>> dict_date['20120807']
defaultdict(<type 'list'>, {'first': ['20120807_first_day_pic.jpg', '20120807_first_day_sheet.jpg', '20120807_first_day_sheet2.jpg']})

然后,对于每个日期,创建一个子文件夹并在那里复制相应的文件:

>>> from shutil import copyfile
>>> for date in dict_date:
        for animal in dict_date[date]:
        try:
            os.makedirs(os.path.join(path, date, animal))
        except os.error:
            pass
        for fil in dict_date[date][animal]:
            copyfile(os.path.join(path, fil), os.path.join(path, date, animal, fil))

编辑:考虑了OP的新要求,以及Khalid的评论。

答案 2 :(得分:0)

import os, shutil

src_path = "D:\\Testing\\Src\\"
dest_path = "D:\\Testing\\Dest\\"

for file in os.listdir(src_path):
    if not os.path.isdir(dest_path + file.split("-")[0]):
       os.mkdir(dest_path + file.split("-")[0])
shutil.copy(src_path + file, dest_path + file.split("-")[0])

答案 3 :(得分:0)

正则表达式:) 如何尝试将文件名与

匹配
pattern=r'(?P<filedate>(?P<year>\d{4})(?P<month>\d{2})(?P<day>\d{2}))\_(?P<bloodyrestofname>.*)'

可以从比赛中的相应命名组中检索完整的日期,年份等。