Python,尝试通过URL获取文件扩展名

时间:2013-07-22 06:45:42

标签: python mime-types

我正在为蓬松的服务做一个图像抓取工作;但是,每当我生成随机URL并尝试将其验证为.PNG图像时,都会生成错误。我今天早些时候刚刚开始学习Python语言,所以你可以说我对此很新!

生成的错误:

Traceback (most recent call last):
  File "run.py", line 19, in <module>
    extension = guess_extension(guess_type(url))
  File "C:\Python33\lib\mimetypes.py", line 320, in guess_extension
    return _db.guess_extension(type, strict)
  File "C:\Python33\lib\mimetypes.py", line 189, in guess_extension
    extensions = self.guess_all_extensions(type, strict)
  File "C:\Python33\lib\mimetypes.py", line 168, in guess_all_extensions
    type = type.lower()
AttributeError: 'tuple' object has no attribute 'lower'

运行的代码:

#!/usr/bin/env python
import sys
import urllib
from mimetypes import guess_type, guess_extension
from random import choice

randoms = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z',
           'A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I',' J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z',
           '0', '1', '2', '3', '4', '5', '6', '7', '8', '9'];

downloads = 1;

#the number of files we want to download
target = int(sys.argv[1]);

while downloads <= target:
    string = choice(randoms)+choice(randoms)+choice(randoms)+choice(randoms)+choice(randoms)
    url = 'http://puu.sh/'+string

    print(str(downloads)+': '+string)

    #download
    extension = guess_extension(guess_type(url))
    print(extension)

    #urllib.request('http://puu.sh/'+string, string+'.png')
    downloads += 1

有关此错误试图告诉我的任何想法?感谢。

2 个答案:

答案 0 :(得分:6)

guess_type()会返回元组(type,encoding),而guess_extension()会接受单个参数type

该行

extension = guess_extension(guess_type(url))

调用guess_type,然后将其返回值(元组)传递给guess_extension。您应该只传递元组的第一个元素(type

extension = guess_extension(guess_type(url)[0])

答案 1 :(得分:1)

我可以建议将来遇到这样的问题你print任何给你悲伤的变数。

所以在你的情况下,如果你做了print (type) python会给你一个元组,你就可以看到你在哪里找到的URL然后完成{{1} }