Question

我对数据科学和编程非常陌生。我正在从kaggle处理电影元数据数据集，试图检查列名并设置索引。但是在jupyter中打开CSV时，所有列名都显示后缀'u'。类似标题列显示为-u'title'。在播放示例的视频中，列名不显示前缀。尝试将同一列设置为索引时，它会生成KeyError。

我尝试过使用python2

df.columns
Index([u'adult', u'belongs_to_collection', u'budget', u'genres', u'homepage',
       u'id', u'imdb_id', u'original_language', u'original_title', u'overview',
       u'popularity', u'poster_path', u'production_companies',
       u'production_countries', u'release_date', u'revenue', u'runtime',
       u'spoken_languages', u'status', u'tagline', u'title', u'video',
       u'vote_average', u'vote_count'],
      dtype='object')

#Change the index to the title
df = df.set_index('title')

Error:

pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()

pandas\_libs\index.pyx in pandas._libs.index.IndexEngine.get_loc()

pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()

pandas\_libs\hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item()

KeyError: 'title'

熊猫：不正确的列名称显示在数据框和set_index中，从而生成KeyError

0 个答案: