Question

我是python的新手，被卡在某些东西上。我编写了一个代码，该代码非常有用，可以登录到网页并从网页中抓取我想要的信息并进行打印。

事情是我想循环它，因为我有更多具有相同登录信息和要从中剪贴的信息的网页。我在列表中的网页（在excel中有，但是如果更简单的话可以将其制成纯文本文件）。因此，我想做的是从列表中获取第一个网址，然后通过代码运行它，然后将其循环到获取第二个网址的顶部，依此类推，直到它遍历整个地址列表为止。

有人可以帮我这个忙，还是可以向我指出正确的方向？

非常感谢，托马斯

Answer 1

with open('urls.txt', 'r') as file:
    for link in file:
        # do what you want with the link

read more here

Answer 2

您可以将逻辑转换为函数，然后遍历站点列表（对它们进行遍历）。在每个站点都调用该函数。

# if you want to use a spreadsheet
import xlrd
path = 'pathtoexcel'
wb = xlrd.open_workbook(path)
sheet = wb.sheet_by_index(0)
sheet.cell_value(0,0) # As in the first value in the first column, change for the location of the url in your spreedsheet

for i in range(sheet.nrows):
  your_logic(sheet.cell_value(i, 0)

# ==================================

# if you decide to use a text file
with open('path_to_file.txt', 'r') as f:
  urls = f.readlines()

for url in urls:
  your_logic(url) # Where 'your_logic' has you logic for scraping

Pyhon-从文本文件导入URL并在代码运行时循环

2 个答案: