通过使用水槽将本地文件源发送到HDFS接收器

时间:2018-06-27 02:35:16

标签: hadoop flume flume-ng

我正在使用水槽将本地文件发送到HDFS接收器,以下是我的配置文件:

# Name the components on this agent
a1.sources = r1
a1.sinks = k1
a1.channels = c1

# Describe/configure the source
a1.sources.r1.type = spooldir
a1.sources.r1.spoolDir = /usr/download/test_data/
a1.sources.r1.basenameHeader = true
a1.sources.r1.basenameHeaderKey = fileName

# Describe the sink
a1.sinks.k1.type = hdfs
a1.sinks.k1.hdfs.path = hdfs://172.16.10.5/user/admin/Data/
a1.sinks.k1.hdfs.filePrefix = %{fileName}
a1.sinks.k1.hdfs.idleTimeout=60

# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 10000
a1.channels.c1.transactionCapacity = 5000

# Bind the source and sink to the channel
a1.sources.r1.channels = c1
a1.sinks.k1.channel = c1

然后我使用用户'flume'执行该conf文件。

time bin/flume-ng agent -c conf -f conf/hdfs_sink.conf -n a1 -Dflume.root.logger=INFO,console

但是它表明我找不到本地文件,权限被拒绝

Could not find file: /usr/download/test_data/sale_record0501.txt
java.io.FileNotFoundException: /usr/download/test_data/.flumespool/.flumespool-main.meta (Permission denied)

如何解决这个问题?

1 个答案:

答案 0 :(得分:0)

您的flume用户在假脱机目录下可能没有权限。假脱机目录位于/usr,可能需要root权限才能访问该路径。

首先以sudo su成为根用户,然后执行或将执行命令替换为

sudo bin/flume-ng agent -c conf -f conf/hdfs_sink.conf -n a1 -Dflume.root.logger=INFO,console

另一方面,您可以通过{p>

flume