应用错误收集

我正在尝试运行此代码：

MaptoChunk = basketchunks.mapPartitions(lambda x:Apriori(x,SupportThreshold))
for x in MaptoChunk.collect():
    print x

我有一个名为Apriori的功能，如下所示：

def Apriori(baskets,Support):

Itemsets=baskets.map(lambda x:x[1]) 
......(To count frequent itesets)
return(freqsets)  #this is an rdd

但是当我运行代码时，它给了我一个错误：

PicklingError: Could not serialize object: Exception: It appears that you 
are attempting to reference SparkContext from a broadcast variable, action, 
or transformation. SparkContext can only be used on the driver, not in code 
that it run on workers. For more information, see SPARK-5063.

有人能告诉我如何保存每个分区的地图值：我的代码有什么问题

执行mapPartition后无法收集Rdd：Pickling Error

0 个答案: