Question

我编写了一个使用nltk的FreqDist模块的脚本，然后将其转换为pandas数据帧。代码段如下：

    .......
    import unicodedata
    str2 = unicodedata.normalize('NFKD', str1).encode('ascii','ignore')    
    words = nltk.tokenize.word_tokenize(str2)

    fdist = nltk.FreqDist(words)
    df = pd.DataFrame.from_dict(fdist, orient='index').reset_index()
    df = df.rename(columns={'index':'query_word', 0:'count'})
    df2 = df.sort_values(['count'], ascending=[False])

现在，我正在尝试使用plotly绘制它，我的代码片段如下所示：

import plotly.plotly as py
import plotly.graph_objs as go

data = [go.Bar(x= df.query_word, y= df.count)]
py.iplot(data, filename='basic-bar')

当我运行这部分时，我收到以下错误：

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-15-87d0c9af254b> in <module>()
----> 1 py.iplot(data, filename='basic-bar')

/usr/local/lib/python2.7/dist-packages/plotly/plotly/plotly.pyc in iplot(figure_or_data, **plot_options)
    150     if 'auto_open' not in plot_options:
    151         plot_options['auto_open'] = False
--> 152     url = plot(figure_or_data, **plot_options)
    153 
    154     if isinstance(figure_or_data, dict):

/usr/local/lib/python2.7/dist-packages/plotly/plotly/plotly.pyc in plot(figure_or_data, validate, **plot_options)
    239 
    240     plot_options = _plot_option_logic(plot_options)
--> 241     res = _send_to_plotly(figure, **plot_options)
    242 
    243     if res['error'] == '':

/usr/local/lib/python2.7/dist-packages/plotly/plotly/plotly.pyc in _send_to_plotly(figure, **plot_options)
   1407     fig = tools._replace_newline(figure)  # does not mutate figure
   1408     data = json.dumps(fig['data'] if 'data' in fig else [],
-> 1409                       cls=utils.PlotlyJSONEncoder)
   1410     credentials = get_credentials()
   1411     validate_credentials(credentials)

/usr/lib/python2.7/json/__init__.pyc in dumps(obj, skipkeys, ensure_ascii, check_circular, allow_nan, cls, indent, separators, encoding, default, sort_keys, **kw)
    249         check_circular=check_circular, allow_nan=allow_nan, indent=indent,
    250         separators=separators, encoding=encoding, default=default,
--> 251         sort_keys=sort_keys, **kw).encode(obj)
    252 
    253 

/usr/local/lib/python2.7/dist-packages/plotly/utils.pyc in encode(self, o)
    144 
    145         # this will raise errors in a normal-expected way
--> 146         encoded_o = super(PlotlyJSONEncoder, self).encode(o)
    147 
    148         # now:

/usr/lib/python2.7/json/encoder.pyc in encode(self, o)
    205         # exceptions aren't as detailed.  The list call should be roughly
    206         # equivalent to the PySequence_Fast that ''.join() would do.
--> 207         chunks = self.iterencode(o, _one_shot=True)
    208         if not isinstance(chunks, (list, tuple)):
    209             chunks = list(chunks)

/usr/lib/python2.7/json/encoder.pyc in iterencode(self, o, _one_shot)
    268                 self.key_separator, self.item_separator, self.sort_keys,
    269                 self.skipkeys, _one_shot)
--> 270         return _iterencode(o, 0)
    271 
    272 def _make_iterencode(markers, _default, _encoder, _indent, _floatstr,

/usr/local/lib/python2.7/dist-packages/plotly/utils.pyc in default(self, obj)
    211             except NotEncodable:
    212                 pass
--> 213         return json.JSONEncoder.default(self, obj)
    214 
    215     @staticmethod

/usr/lib/python2.7/json/encoder.pyc in default(self, o)
    182 
    183         """
--> 184         raise TypeError(repr(o) + " is not JSON serializable")
    185 
    186     def encode(self, o):

TypeError: <bound method DataFrame.count of                     query_word  count
0                          1,2      1
1                         four      1
2                       prefix      1
..                      ......      ..
..                      ......      ..
3                    francesco      1

据我所知，其他SF问题的主题是“不是json serializable ”，而且从错误信息来看，这是编码的问题吗？而不是数据类型。

因为，当我print type(df2.query_word)时，<class 'pandas.core.series.Series'>。那么如何制作一系列可序列化？由于回溯未显示任何编码错误，例如here或here。

简单的转身是什么？发表这个问题的主要内容是要了解这是数据框架，数据，ipython还是情节问题。

Pandas Dataframe以图形方式显示TypeError

0 个答案: