Python:在JSON数据结构中提取属性

时间:2012-08-20 23:23:47

标签: python parsing ebay simplejson

我目前正在使用ebay public api解析ebay数据,我已经想出要解析json结构,除了一些json元素。

以下是我正在查看的JSON结构:

  {u'itemId': [u'370640300983'], u'isMultiVariationListing': [u'false'], u'globalId': [u'EBAY-US'], u'title': [u'DELL Latitude D630 Core 2 Duo 2GHz 1GB 80GB CD-RW/DVD WiFi Notebook 14" Laptop'], u'country': [u'US'], u'shippingInfo': [{u'expeditedShipping': [u'true'], u'shippingType': [u'Calculated'], u'handlingTime': [u'1'], u'shipToLocations': [u'US'], u'oneDayShippingAvailable': [u'false']}], u'galleryURL': [u'http://thumbs4.ebaystatic.com/pict/3706403009834040_1.jpg'], u'autoPay': [u'false'], u'location': [u'Saint Paul,MN,USA'], u'postalCode': [u'55114'], u'returnsAccepted': [u'true'], u'viewItemURL': [u'http://www.ebay.com/itm/DELL-Latitude-D630-Core-2-Duo-2GHz-1GB-80GB-CD-RW-DVD-WiFi-Notebook-14-Laptop-/370640300983?pt=Laptops_Nov05'], u'sellingStatus': [{u'currentPrice': [{u'@currencyId': u'USD', u'__value__': u'99.99'}], u'timeLeft': [u'P0DT0H13M10S'], u'convertedCurrentPrice': [{u'@currencyId': u'USD', u'__value__': u'99.99'}], u'bidCount': [u'4'], u'sellingState': [u'Active']}], u'paymentMethod': [u'PayPal', u'VisaMC', u'Discover'], u'primaryCategory': [{u'categoryId': [u'177'], u'categoryName': [u'PC Laptops & Netbooks']}], u'condition': [{u'conditionId': [u'3000'], u'conditionDisplayName': [u'Used']}], u'listingInfo': [{u'listingType': [u'Auction'], u'gift': [u'false'], u'bestOfferEnabled': [u'false'], u'startTime': [u'2012-08-15T23:28:05.000Z'], u'buyItNowAvailable': [u'false'], u'endTime': [u'2012-08-20T23:28:05.000Z']}]}

我正在解析的数据

370640300983
DELL Latitude D630 Core 2 Duo 2GHz 1GB 80GB CD-RW/DVD WiFi Notebook 14" Laptop
{u'@currencyId': u'USD', u'__value__': u'99.99'}

第二元素:

{u'itemId': [u'170892723100'], u'isMultiVariationListing': [u'false'], u'globalId': [u'EBAY-US'], u'title': [u'Dell Latitude D620 Laptop Core 2 Duo 2GHz  1GB Ram No HDD INCOMPLETE'], u'country': [u'US'], u'shippingInfo': [{u'expeditedShipping': [u'false'], u'handlingTime': [u'1'], u'shippingServiceCost': [{u'@currencyId': u'USD', u'__value__': u'24.0'}], u'oneDayShippingAvailable': [u'false'], u'shipToLocations': [u'US'], u'shippingType': [u'Flat']}], u'galleryURL': [u'http://thumbs1.ebaystatic.com/pict/1708927231004040_1.jpg'], u'autoPay': [u'false'], u'location': [u'Hughesville,PA,USA'], u'postalCode': [u'17737'], u'returnsAccepted': [u'true'], u'viewItemURL': [u'http://www.ebay.com/itm/Dell-Latitude-D620-Laptop-Core-2-Duo-2GHz-1GB-Ram-No-HDD-INCOMPLETE-/170892723100?pt=Laptops_Nov05'], u'sellingStatus': [{u'currentPrice': [{u'@currencyId': u'USD', u'__value__': u'20.01'}], u'timeLeft': [u'P0DT1H10M35S'], u'convertedCurrentPrice': [{u'@currencyId': u'USD', u'__value__': u'20.01'}], u'bidCount': [u'2'], u'sellingState': [u'Active']}], u'paymentMethod': [u'PayPal'], u'primaryCategory': [{u'categoryId': [u'177'], u'categoryName': [u'PC Laptops & Netbooks']}], u'condition': [{u'conditionId': [u'3000'], u'conditionDisplayName': [u'Used']}], u'listingInfo': [{u'listingType': [u'Auction'], u'gift': [u'false'], u'bestOfferEnabled': [u'false'], u'startTime': [u'2012-08-18T00:25:30.000Z'], u'buyItNowAvailable': [u'false'], u'endTime': [u'2012-08-21T00:25:30.000Z']}]}

第二个元素的解析元素:

170892723100
Dell Latitude D620 Laptop Core 2 Duo 2GHz  1GB Ram No HDD INCOMPLETE
{u'@currencyId': u'USD', u'__value__': u'20.01'}

如果您在我的代码的两次迭代中都看到我无法获得u':元素解析并从数据结构中获取实际价格提取:

基本上代替{u'@currencyId': u'USD', u'__value__': u'20.01'}我希望20.01作为解析后的值。我应该使用正则表达式来解析它还是有更好的方法来做它?

Here is my code:
  data = json.load(urllib2.urlopen(url))
  #print data
  for item in data['findItemsByKeywordsResponse'][0]['searchResult'][0]['item']:
    print item
    for itemId in item['itemId']:
      print itemId
    for title in item['title']:
      print title
    for price in item['sellingStatus'][0]['currentPrice']:
      print price
    print '\n'

2 个答案:

答案 0 :(得分:2)

这样做:

for price in item['sellingStatus'][0]['currentPrice']:
      print float(price["__value__"])

当然,使用花车赚钱是一个可怕的想法,所以你应该使用decimal模块:

from decimal import Decimal

for price in item['sellingStatus'][0]['currentPrice']:
      print Decimal(price["__value__"])

或者将其解析为以美分为单位的整数价格:

for price in item['sellingStatus'][0]['currentPrice']:
      dollars, cents = price["__value__"].split(".")
      print int(dollars) * 100 + int(cents)

答案 1 :(得分:1)

(以上评论):

尝试更改:

for price in item['sellingStatus'][0]['currentPrice']:
    print price

for price in item['sellingStatus'][0]['currentPrice']:
    print price['__value__']