如何创建一个类标记以使用Java构建广播变量spark?

时间:2018-10-06 11:50:03

标签: java apache-spark

我需要在我的JAVA函数中创建广播,这是字符串列表的广播

Broadcast<List<String>>broadcastSp = sc.broadcast(Tvalue,classTag<T>evidence);

T值为“ my_list”(List<String>),但是问题在于如何创建第二部分classTag<T>evidence

2 个答案:

答案 0 :(得分:1)

应该没有必要。使用Java时,您不应使用org.apache.spark.SparkContext,它是为Scala设计的。而是使用org.apache.spark.api.java.JavaSparkContext

引用官方文档:

import org.apache.spark.api.java.JavaSparkContext;
import org.apache.spark.SparkConf;

SparkConf conf = new SparkConf().setAppName("broadcast").setMaster("local[*]");
JavaSparkContext sc = new JavaSparkContext(conf);

它的广播方法不需要ClassTagsenter image description here):

Broadcast<int[]> broadcastVar = sc.broadcast(new int[] {1, 2, 3});

答案 1 :(得分:0)

这是我的功能:

public static JavaSparkContext sc;
public static Broadcast < List < String >> broadcastP;
public static void main(String args[]) throws Exception {
    sc = new JavaSparkContext("local", "test");
    .
    .
    .
    private static List < String > fFtItemSets(JavaRDD < String > base_initiale) throws Exception {
      List < String > kMinusOneSets;
      List < String > k_sets;

      int i = 1;
      k_sets = remplir_ksets_intiale(base_initiale);
      System.out.println(k_sets);
      List < String > k_sets1 = k_sets;
      int NB = k_sets.size();
      while (NB > 1) {
        kMinusOneSets = k_sets;
        k_sets = Jointure(kMinusOneSets);

        k_sets = Elagage(k_sets, k_sets1, i);
        System.out.println(k_sets);


        Broadcast < List < String >> broadcastSp = sc.broadcast(k - sets, List < String > );
        //main.sc.broadcast(k_sets);

        System.out.println(broadcastSp.value());


        k_sets = tr_freq(k_sets, broadcastnSp, base_initiale, i);
        k_sets1 = k_sets;
        NB++;
        i++;

      }
      return k_sets;
    }