Python Celery group() - TypeError:[...]后面的参数必须是映射,不长

时间:2015-06-23 07:41:05

标签: django celery

我试图运行芹菜(3.1.17)任务,在一个组中执行更多任务,但我总是遇到错误。这就是我设置代码的方式:

from celery import task, group

@task
def daily_emails():

    [...]

    all_tasks = []

    for chunk in range(0, users.count(), 1000):
        some_users = users[chunk:chunk+1000]
        all_tasks.append(write_email_bunch.subtask(some_users, execnum))

    job = group(all_tasks)
    # result = job.apply_async()
    # job.get()
    result = job.delay()
    print result
    results = result.join()
    print results

    print "done writing email tasks"
    count = sum(results)
    print count


@task
def write_email_bunch(some_users, execnum):

    [...]

    return len(some_users) - skipped_email_count

这是输出:

<GroupResult: 3d766c85-21af-4ed0-90cb-a1ca2d281db1 [69527252-8468-4358-9328-144f727f372b, 6d03d86e-1b69-4f43-832e-bd27c4dfc092, 1d868d1b-b502-4672-9895-430089e9532e]>
Traceback (most recent call last):
  File "send_daily_emails.py", line 8, in <module>
    daily_emails()
  File "/var/www/virtualenvs/nt_dev/local/lib/python2.7/site-packages/celery/app/task.py", line 420, in __call__
    return self.run(*args, **kwargs)
  File "/var/www/nt_dev/nt/apps/emails/tasks.py", line 124, in daily_emails
    results = result.join()
  File "/var/www/virtualenvs/nt_dev/local/lib/python2.7/site-packages/celery/result.py", line 642, in join
    interval=interval, no_ack=no_ack,
  File "/var/www/virtualenvs/nt_dev/local/lib/python2.7/site-packages/celery/result.py", line 870, in get
    raise self.result
TypeError: write_email_bunch() argument after ** must be a mapping, not long

所以我得到了一个GroupResult但不知何故我无法加入它或进一步处理它。 当我使用 write_email_bunch.s(some_users,execnum)时,我得到了这个例外:

  File "/var/www/virtualenvs/nt_dev/local/lib/python2.7/site-packages/celery/result.py", line 870, in get
    raise self.result
TypeError: 'tuple' object is not callable

我如何等待所有组任务完成后继续? job.get()给了我这个例外:

TypeError: get expected at least 1 arguments, got 0

2 个答案:

答案 0 :(得分:3)

subtask需要一个args元组,一个kwargs和任务选项的字典,所以它应该像这样调用:

    all_tasks.append(write_email_bunch.subtask((some_users, execnum)))

请注意,我们正在传递一个包含args

的元组

此外,您不应该等待任务内的任务 - 这可能会导致死锁。在这种情况下,我认为daily_emails不需要是芹菜任务 - 它可以是一个常规函数,它创建一个canvas对象并运行apply async。

def daily_emails():

    all_tasks = []

    for chunk in range(0, users.count(), 1000):
        some_users = users[chunk:chunk+1000]
        all_tasks.append(write_email_bunch.subtask(some_users, execnum))

    job = group(all_tasks)
    result = job.apply_async()
    return result.id

答案 1 :(得分:2)

除了其他答案,你可以在这里使用chunkshttp://docs.celeryproject.org/en/latest/userguide/canvas.html#chunks

@app.task
def daily_emails():
    return write_email.chunks(users, 1000).delay()

@task
def write_email(user):
    [...]

如果一次获得多个对象,手动执行此操作可能会有所帮助 从数据库很重要。您还应该考虑模型对象将在此处序列化,以避免您只能发送pk并在任务中重新获取模型,或者发送您关心的字段(如电子邮件地址或发送该电子邮件所需的任何内容)给用户)。