Question

我使用了以下代码：

import pandas as pd
pandas_bigram = pd.DataFrame(bigram_data)
print pandas_bigram

我输出如下

                                0
0                     ashoka -**0
1                 - wikipedia,**1
2               wikipedia, the**2
3                     the free**2
4            free encyclopedia**2
5          encyclopedia ashoka**1
6                  ashoka from**2
7              from wikipedia,**1
8               wikipedia, the**2
9                     the free**2
10           free encyclopedia**2

我的问题是如何拆分此数据框。所以，我将获得两行数据。这里的数据用“**”分隔。

Answer 1

import pandas as pd

df= [" ashoka -**0","- wikipedia,**1","wikipedia, the**2"]
df=pd.DataFrame(df)

print(df)
                   0
0        ashoka -**0
1    - wikipedia,**1
2  wikipedia, the**2

使用split函数：方法split（）返回字符串中所有单词的列表，使用str作为分隔符（如果未指定则拆分所有空格），可选地将拆分数限制为num

df1 = pd.DataFrame(df[0].str.split('*',1).tolist(),
                                       columns = ['0','1'])

print(df1)

                0   1
0        ashoka -  *0
1    - wikipedia,  *1
2  wikipedia, the  *2

如何在python中拆分数据框中的数据？

1 个答案: