限制Python中的计数器集合

时间:2017-06-07 18:32:14

标签: python django python-3.x counter python-collections

我目前正在从网络API中提取数据,我正在尝试根据以下条件过滤某些值:计算用户在两个给定日期之间出现的次数,但仅限于他/她已购买项目覆盆子pi,香蕉pi和覆盆子pi2而不是项目覆盆子pi3

我收到的JSON对象具有以下结构:

[{
user_id : 0001
CreatedOn: "2017-02-16 15:54:48",
item: "raspbery pi",
VIP: "YES",
Vendor_CODE: "XYZ12345",
},
{
user_id : 0001
CreatedOn: "2017-02-15 13:49:16",
item: "raspbery pi2",
VIP: "YES",
Vendor_CODE: "XYZ67890",
},
{
user_id : 0001
CreatedOn: "2017-02-10 15:54:48",
item: "raspbery pi",
VIP: "YES",
Vendor_CODE: "RST171820",
},
{
user_id : 0001
CreatedOn: "2017-01-01 21:51:13",
item: "raspbery pi3",
VIP: "YES",
Vendor_CODE: "XOL002321",
},
{
user_id : 0005
CreatedOn: "2017-01-30 17:34:18",
item: "raspbery pi",
VIP: "YES",
Vendor_CODE: "RST171820",
},
{
user_id : 0005
CreatedOn: "2017-05-30 09:04:08",
item: "banana pi",
VIP: "YES",
Vendor_CODE: "ITI342027",
}]

目前,我有以下代码,用于计算用户在给定两个日期时出现的次数。

from django.shortcuts import render
from django.http import JsonResponse
from rest_framework.views import APIView
from rest_framework.response import Response
from collections import Counter
from datetime import datetime, timedelta
import json, urllib.request, dateutil.parser, urllib.parse,

#Request a response to the Web API
def get_data(request, *args, **kwargs):
    # YYYY-MM-DD
    start_date = datetime.now() - timedelta(days=7)
    end_date = datetime.now() - timedelta(days=1) 

    with urllib.request.urlopen("http://10.61.202.98:8081/T/ansdb/api/rows/dev/ect",timeout=15) as url:
    response_data = json.loads(url.read().decode())

    #count the number of times the user appears when he bought the 4 items in two given dates
    count_user_01 = Counter([k['user_id'] for k in response_data if
         start_date_week < dateutil.parser.parse(k.get('CreatedOn')) < end_date_week])

我的方法是添加一些额外的&#34;条件&#34;计算除了项目覆盆子pi3之外的所有项目,例如:

count_user_01 = Counter([k['user_id'] for k in response_data if
     start_date_week < dateutil.parser.parse(k.get('CreatedOn')) < end_date_week] and k['item']!='raspberry pi3')

但如果我这样做,那么我遇到错误 bool对象不可迭代,我认为我收到此错误,因为收集计数器不允许我这样做。

我的问题是:

  1. 如何在列表中实现这个额外条件,这样我可以计算一个用户购买的除了覆盆子pi3之外的所有项目?

  2. 目前,我正在计算为特定用户购买的商品,但我如何计算所有用户购买的所有商品?

  3. 欢迎所有评论,答案和建议。

    更新

    对于问题1,解决方案是将问题固定在括号中。

    count_user_01 = Counter([k['user_id'] for k in response_data if
     start_date_week < dateutil.parser.parse(k.get('CreatedOn')) < end_date_week and k['item']!='raspberry pi3'])
    

1 个答案:

答案 0 :(得分:1)

我认为您的括号有问题。试试这个:

count_user_01 = Counter([k['user_id'] for k in response_data if
     start_date_week < dateutil.parser.parse(k.get('CreatedOn')) < end_date_week and k['item']!='raspberry pi3'])