我们正在从AMPS [CAMP UP AMPS]服务器订阅实时数据作为Apache flink的来源。关于如何像kafka一样连接它们的任何想法。
答案 0 :(得分:1)
目前,Apache Flink并未为AMPS提供任何现成的连接器,您可以看到here。但是,它确实提供了可扩展的“源/接收器”界面,可用于点击任何自定义源/接收器。
您可以通过扩展RichSourceFunction
并将其传递给flink documentation中提到的addSource
方法来创建自己的AMPS源连接器。请参阅crankuptheamps提供的Java Client API,以连接到源主题并订阅消息。
import org.apache.flink.configuration.Configuration;
import org.apache.flink.streaming.api.functions.source.RichSourceFunction;
import com.crankuptheamps.client.Client;
import com.crankuptheamps.client.Message;
public class AMPSSource extends RichSourceFunction<String> {
private static final long serialVersionUID = -8708182052610791593L;
private String name, topic, connectionString;
private Client client;
public AMPSSource(String name, String connectionString, String topic) {
this.name = name;
this.topic = topic;
this.connectionString = connectionString;
}
@Override
public void open(Configuration parameters) throws Exception {
// We create a Client, then connect() and logon()
client = new Client(this.name);
client.connect(this.connectionString);
client.logon();
}
public void run(SourceContext<String> sourceContext) throws Exception {
/*
* Here, we iterate over messages in the MessageStream returned by
* subscribe method
*/
for (Message message : client.subscribe(this.topic)) {
sourceContext.collect(message.getData());
}
}
@Override
public void close() throws Exception {
try {
cancel();
} finally {
super.close();
}
}
public void cancel() {
client.close();
}
}
可以用作处理器中的源,如下所示,
import org.apache.flink.streaming.api.datastream.DataStream;
import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;
public class StreamProcessor {
public static void main(String[] args) throws Exception {
final StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
DataStream<String> ampsStream = env
.addSource(new AMPSSource("flink-consumer", "tcp://127.0.0.1:9007/amps/json", "test-topic"));
ampsStream.print();
env.execute();
}
}
注意:RichSourceFunction实现的并行度为1。要启用并行执行,用户定义的源应实现org.apache.flink.streaming.api.functions.source.ParallelSourceFunction
或扩展org.apache.flink.streaming.api.functions.source.RichParallelSourceFunction