如何将非阻塞stderr捕获添加到线程popen的

时间:2015-05-11 08:22:13

标签: python multithreading subprocess blocking stderr

我有一个用于备份和加密mysqldump文件的python 3脚本,并且我对一个加密后的67gb数据库存在特殊问题。压缩。 mysqldump正在输出错误代码3,因此我想捕获实际的错误消息,因为这可能意味着一些事情。 随机的是备份文件的大小合适,所以不确定错误的含义。它在这个数据库上工作过一次......

代码如下所示,我非常感谢有关如何在返回代码为p1和p2时只返回0的情况下添加stderr的非阻塞捕获的一些帮助。

另外,如果我做任何明显错误的事情,请告诉我,因为我想确保这是一个可靠的过程。它在15gb压缩下的数据库上工作正常。

def dbbackup():
    while True:
        item = q.get()
        #build up folder structure, daily, weekly, monthy & project
        genfile = config[item]['DBName'] + '-' + dateyymmdd + '-'
        genfile += config[item]['PubKey'] + '.sql.gpg'
        if os.path.isfile(genfile):
            syslog.syslog(item + ' ' + genfile + ' exists, removing')
            os.remove(genfile)
        syslog.syslog(item + ' will be backed up as ' + genfile)
        args = ['mysqldump', '-u', config[item]['UserNm'],
                '-p' + config[item]['Passwd'], '-P', config[item]['Portnu'],
                '-h', config[item]['Server']]
        args.extend(config[item]['MyParm'].split())
        args.append(config[item]['DBName'])
        p1 = subprocess.Popen(args, stdout=subprocess.PIPE)
        p2 = subprocess.Popen(['gpg', '-o', genfile, '-r',
                               config[item]['PubKey'], '-z', '9', '--encrypt'], stdin=p1.stdout)
        p2.wait()
        if p2.returncode == 0:
            syslog.syslog(item + ' encryption successful')
        else:
            syslog.syslog(syslog.LOG_CRIT, item + ' encryption failed '+str(p2.returncode))
            p1.terminate()
        p1.wait()
        if p1.returncode == 0:
        #does some uploads of the file etc..
        else:
            syslog.syslog(syslog.LOG_CRIT, item + ' extract failed '+str(p1.returncode))
        q.task_done()


def main():
    db2backup = []
    for settingtest in config:
            db2backup.append(settingtest)
    if len(db2backup) >= 1:
        syslog.syslog('Backups started')
        for database in db2backup:
            q.put(database)
            syslog.syslog(database + ' added to backup queue')
        q.join()
        syslog.syslog('Backups finished')


q = queue.Queue()
config = configparser.ConfigParser()
config.read('backup.cfg')
backuptype = 'daily'
dateyymmdd = datetime.datetime.now().strftime('%Y%m%d')


for i in range(2):
    t = threading.Thread(target=dbbackup)
    t.daemon = True
    t.start()

if __name__ == '__main__':
    main()

1 个答案:

答案 0 :(得分:0)

简化您的代码:

  • 避免不必要的全局变量,将参数传递给相应的函数
  • 避免重新实现线程池(这会损害可读性,并且会错过多年积累的便利功能)。

捕获stderr的最简单方法是使用stderr=PIPE.communicate()(阻止调用):

#!/usr/bin/env python3
from configparser import ConfigParser
from datetime import datetime
from multiprocessing.dummy import Pool
from subprocess import Popen, PIPE

def backup_db(item, conf): # config[item] == conf
    """Run `mysqldump ... | gpg ...` command."""
    genfile = '{conf[DBName]}-{now:%Y%m%d}-{conf[PubKey]}.sql.gpg'.format(
                conf=conf, now=datetime.now())
    # ...
    args = ['mysqldump', '-u', conf['UserNm'], ...]
    with Popen(['gpg', ...], stdin=PIPE) as gpg, \
         Popen(args, stdout=gpg.stdin, stderr=PIPE) as db_dump:
        gpg.communicate() 
        error = db_dump.communicate()[1]
    if gpg.returncode or db_dump.returncode:
        error

def main():
    config = ConfigParser()
    with open('backup.cfg') as file: # raise exception if config is unavailable
        config.read_file(file)
    with Pool(2) as pool:
        pool.starmap(backup_db, config.items())

if __name__ == "__main__":
    main()

注意:如果db_dump.terminate()过早死亡,则无需致电gpgmysqldump在尝试向关闭的gpg.stdin写入内容时死亡。

如果配置中有大量商品,那么您可以使用pool.imap()代替pool.starmap()the call should be modified slightly)。

对于健壮性,请包装backup_db()函数以捕获并记录所有异常。