【问题标题】:KafkaIO Connector/Apache Beam Transform "go" SDK Available?KafkaIO 连接器/Apache Beam 转换“go”SDK 可用吗?
【发布时间】:2020-01-25 01:13:54
【问题描述】:

我正在使用 Apache Beam “go” SDK 构建数据摄取管道。

我的管道是使用来自 Kafka 队列的数据并将数据持久化到 Google Cloud Bigtable(和/或另一个 Kafka 主题)。

到目前为止,我还没有找到用“go”编写的 Kafka IO 连接器(也称为 Apache I/O 转换)(但是我找到了 java 版本)。

这里是支持的 Apache Beam 内置 I/O 转换的链接: https://beam.apache.org/documentation/io/built-in/

我正在寻找以下 Java 代码的“go”等价物:

    pipeline.apply("kafka_deserialization", KafkaIO.<String, String>read()
        .withBootstrapServers(KAFKA_BROKER)
        .withTopic(KAFKA_TOPIC)
        .withConsumerConfigUpdates(CONSUMER_CONFIG)
        .withKeyDeserializer(StringDeserializer.class)
        .withValueDeserializer(StringDeserializer.class))

您有关于 KafkaIO 连接器“go”SDK/库的可用性的任何信息吗?

【问题讨论】:

  • 那里有很多。最后我检查了一下,这个包裹了优秀的 librdkafka,所以应该相当不错confluent。它是由编写 C/C++ kafka 库 BTW 的同一个人开始的。 PS:这个问题很可能会被关闭,因为你要求的是场外资源(查看帮助部分,了解为什么这被认为是题外话)
  • @Elias 不过这不适合 Beam
  • 感谢 @EliasVanOotegem 的 cmets。我已经更详细地更新了我的帖子。任何有关此主题的信息将不胜感激。谢谢。
  • Beam 是开源的...Have you searched Github?

标签: go apache-kafka apache-beam apache-beam-io


【解决方案1】:

@cricket_007 如果您也好奇,我收到了来自 Apache Beam 团队的 Robert Burke (rebo@google.com) 的以下更新:

There presently isn't a Kafka transform for Go. 

The Go SDK is still experimental, largely due to scalable IO support, which is why the Go SDK isn't represented in the built-in io page.

There's presently no way for an SDK user to write a Streaming source in the Go SDK, since there's no mechanism for a DoFn to "self terminate" bundles, such as to allow for scalability and windowing from streaming sources. 

However, SplittableDoFns are on their way, and will eventually be the solution for writing these.

At present, the Beam Go SDK IOs haven't been tested and vetted for production use. Until the initial SplittableDoFn support is added to the Go SDK, Batch transforms cannot split, and can't scale beyond a single worker thread. This batch version should land in the next few months, and the streaming version land a few months after that, after which a Kafka IO can be developed. 

I wish I had better news for you, but I can say progress is being made.

Robert Burke

【讨论】:

  • 这个对话有链接吗?我想看看有没有这方面的更新。
猜你喜欢
  • 2023-03-11
  • 1970-01-01
  • 2023-02-03
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 2019-09-26
  • 1970-01-01
  • 1970-01-01
相关资源
最近更新 更多