Question

我有以下代码：

sentiments = ['He is good', 'He is bad', 'She love her', 'She is fine with it', 'I like going outside', 'Its okay']
positive = [1,0,1,0,1,0]
negative = [0,1,0,0,0,0]
neutral = [0,0,0,1,0,1]
neutral
df = pd.DataFrame({'Sentiments':sentiments, 'Positives':positive, 'Negatives': negative, 'Neutrals':neutral})
df.head()

它创建了这个：

我只想拥有 2 列，其中 1 列带有情绪，其他带有类别，这应该是特定的情绪，即结果应该是：

<头>

情绪	类别
ABC	正面
xmy	否定
点	中立

Answer 1

试试.melt()：

x = df.melt("Sentiments", var_name="Category")
x = x[x.value != 0].drop(columns="value")
x["Category"] = x["Category"].str.replace(r"s$", "", regex=True)
print(x)

打印：

              Sentiments  Category
0             He is good  Positive
2           She love her  Positive
4   I like going outside  Positive
7              He is bad  Negative
15   She is fine with it   Neutral
17              Its okay   Neutral

Answer 2

假设只有一列值为 1（即哑元），请尝试：

>>> df.set_index("Sentiments").idxmax(axis=1).rename("Category").reset_index()
             Sentiments   Category
0            He is good  Positives
1             He is bad  Negatives
2          She love her  Positives
3   She is fine with it   Neutrals
4  I like going outside  Positives
5              Its okay   Neutrals

Answer 3

另一种方式：

df = df.set_index('Sentiments').dot(df.columns[1:]).reset_index(name = 'Category')

用 3 列重塑熊猫数据框

3 个答案: