Apache Storm是否允许处理HDFS上存储的大量文件?
知道我的目标是获得实时响应(秒或毫秒),
或apache风暴仅用于流式处理!!
谢谢
答案 0 :(得分:1)
Storm is only for streaming (as opposed to batch processing), but if I'm understanding you correctly, you want to read files from HDFS and process them?
The storm-hdfs module has a spout (topology data source). It might do what you want.
https://github.com/apache/storm/tree/master/external/storm-hdfs#hdfs-spout