在PyTorch中使用Dataloader迭代数据集时出现IndexError

时间:2018-12-17 07:17:22

标签: machine-learning deep-learning computer-vision pytorch

我在PyTorch 0.2中使用Dataloader迭代了数据集,如下所示:

dataloader = torch.utils.data.DataLoader(...)
data_iter = iter(dataloader)
data = data_iter.next()

但是出现IndexError。

Traceback (most recent call last):
  File "main.py", line 193, in <module>
    data_target = data_target_iter.next()
  File "/usr/local/lib/python3.5/dist-packages/torch/utils/data/dataloader.py", line 201, in __next__
    return self._process_next_batch(batch)
  File "/usr/local/lib/python3.5/dist-packages/torch/utils/data/dataloader.py", line 221, in _process_next_batch
    raise batch.exc_type(batch.exc_msg)
IndexError: Traceback (most recent call last):
  File "/usr/local/lib/python3.5/dist-packages/torch/utils/data/dataloader.py", line 40, in _worker_loop
    samples = collate_fn([dataset[i] for i in batch_indices])
  File "/usr/local/lib/python3.5/dist-packages/torch/utils/data/dataloader.py", line 40, in <listcomp>
    samples = collate_fn([dataset[i] for i in batch_indices])
  File "/home/asr4/zhuminxian/adversarial/code/dataset/data_loader.py", line 33, in __getitem__
    return self.X_train[idx], self.y_train[idx]
IndexError: index 4196 is out of bounds for axis 0 with size 4135

我想知道为什么索引超出范围。是Pytorch的错误吗?

我试图再次运行我的代码,引发了同样的错误,但是迭代次数不同,索引越界不同。

1 个答案:

答案 0 :(得分:0)

我的猜测是您的data.Dataset.__len__没有正确地过载,并且实际的len(dataloader.dataset)返回的数字大于len(self.X_train)
'/home/asr4/zhuminxian/adversarial/code/dataset/data_loader.py'中检查基础数据集的实现。