Question

我正在尝试使用urllib2编写一个python脚本，以便从here

下载带有标题的所有图像

from urllib import urlopen
from re import findall
from Tkinter import *
import urllib, cStringIO
import random 

photo_window=Tk()
photo_window.title('Photo Browser')
import base64
url = "https://www.flickr.com/photos/flickr/galleries/72157652879582045/"
regex = '<img src="([^"]+)".*>'
photofile=urllib.urlopen(url)
raw_data=photofile.read()
pattern = re.compile(regex)
download=re.findall(pattern,raw_data)
t=0
for x in download:
    urllib.urlretrieve(x, "local-filename-%d.gif" %t )
    t+=1

这允许我下载所有图像，但我仍在努力只获取完整图像（不是缩略图）并获取字幕

Answer 1

对不起，我无法发表评论，但我会... ...

对于＆＃34;大尺寸＆＃34;图片，当你有这样一个集团时：

<img class="pc_img" width="640" border="0" height="426" data-thumbdata="" alt="Falls in the bush - Zambia" src="https://farm3.staticflickr.com/2035/2399025352_d96e82fbe1_z.jpg">

图片src（在Flickr上）作为结尾(_z)的一个小属性，如果你改成它，让我们说_b你有一个更大的图片。这一切都解释为here

所以你可以做的就是解析图片来源的地址，并用_o替换_z作为原始尺寸！

Python脚本从flickr URL下载所有图像和标题

1 个答案: