Apache Storm是否允许处理HDFS上存储的大量文件?

时间:2019-01-08 08:15:23

标签: hadoop apache-storm

Apache Storm是否允许处理HDFS上存储的大量文件?

知道我的目标是获得实时响应(秒或毫秒),

或apache风暴仅用于流式处理!!

谢谢

1 个答案:

答案 0 :(得分:1)

Storm is only for streaming (as opposed to batch processing), but if I'm understanding you correctly, you want to read files from HDFS and process them?

The storm-hdfs module has a spout (topology data source). It might do what you want.

https://github.com/apache/storm/tree/master/external/storm-hdfs#hdfs-spout