尝试使用apache beam
时收到此错误java.lang.NoClassDefFoundError: 组织/阿帕奇/波束/流道/数据流/ util的/ DoFnInfo
尝试运行时
PCollection<ReadableFile> activities = p.apply(
"ReadBatchFiles", FileIO.match().filepattern("<google cloud storage bucket taken out>"))
.apply(FileIO.readMatches());
的pom.xml:
<dependencies>
<dependency>
<groupId>com.google.cloud.dataflow</groupId>
<artifactId>google-cloud-dataflow-java-sdk-all</artifactId>
<version>2.1.0</version>
</dependency>
<dependency>
<groupId>org.apache.beam</groupId>
<artifactId>beam-sdks-java-core</artifactId>
<version>2.2.0</version>
</dependency>
<dependency>
<groupId>org.apache.beam</groupId>
<artifactId>beam-sdks-java-io-google-cloud-platform</artifactId>
<version>2.2.0</version>
</dependency>
<dependency>
<groupId>org.apache.beam</groupId>
<artifactId>beam-runners-direct-java</artifactId>
<version>2.2.0</version>
<scope>test</scope>
</dependency>
<dependency>
<groupId>org.apache.beam</groupId>
<artifactId>beam-runners-google-cloud-dataflow-java</artifactId>
<version>2.2.0</version>
</dependency>
<dependency>
看起来这是移动到:link1
可能在:link2
知道如何将其链接到所需位置吗?
感谢
答案 0 :(得分:0)
Google使DataflowJavaSDK与Beam 2.2.0 only two days ago兼容,但尚未发布。您必须等待SDK的maven版本2.2.0或自己编译源代码依赖。