Question

我正在学习机器学习并尝试使用Iris数据集编写自己的代码。

我用pandas打开数据集，然后我试图在我的数据集中传递一个字典，将最后一列从字符串转换为Int但是在尝试这个时：

dataset.columns = ['sepal length', 'sepal width', 'petal length', 'petal width', 'class']

class_mapping = {'Iris-setosa': 1, 'Iris-versicolor': 2, 'Iris-virginica': 3}
for classe in dataset :
    classe['class'] = classe['class'].map(class_mapping)

PyCharm将此返回给我：TypeError：字符串索引必须是整数

Answer 1

最后我设法解决了这个问题。我没有使用for循环，而是使用了这个：

dataset ['class'] = dataset ['class']. map (class_mapping)

我不需要for循环因为.map为我迭代。

Answer 2

我遇到了一些与“ .map”的用法有关的代码，如下所示：

  def get_one_shot_iterator(self):
    """Gets an iterator that iterates across the dataset once.

    Returns:
      An iterator of type tf.data.Iterator.
    """

    files = self._get_all_files()

    dataset = (
        tf.data.TFRecordDataset(files, num_parallel_reads=self.num_readers)
        .map(self._parse_function, num_parallel_calls=self.num_readers)
        .map(self._preprocess_image, num_parallel_calls=self.num_readers))

似乎在这里两次使用了map函数，希望这会有所帮助。

Python尝试使用.map（）

2 个答案: