Question

我正在尝试在pytorch中加载多模式数据（例如文本和图像）以进行图像分类。我不知道如何像下面的代码那样同时加载它们。

 def __init__(self, img_path, txt_path, transform=None, loader=default_loader):

def __len__(self):
    return len(self.img_name)

def __getitem__(self, item):

有人可以帮助我吗？

Answer 1

在__getitem__中，您可以使用字典或元组来表示数据样本。在训练的后期，当您使用数据集创建数据加载器时，pytorch将自动创建一批字典或元组。

如果您想以更多不同的方式创建样本，请在pytorch中签出collate_fn。

Answer 2

getitem（self，item）方法将帮助您完成此任务。

例如：

def __getitem__(self, item):  # item can be thought as an index

    text = textList[item]  # textList would be a list containing the text you want to input into the model for element 'item'
    img = imgList[image]  # imgList would be a list containing the images you want to input into the model for element 'item'

    input = [text, img]  
    y = labels[item]  # labels would be a list containing the label for the input of the text and img. This is your target.

    return input, y

用pytorch加载多模式数据

2 个答案: