MongoDB Kafka Sink连接器不处理RenameByRegex处理器

时间:2020-04-13 15:21:00

标签: mongodb apache-kafka apache-kafka-connect mongodb-kafka-connector

我需要监听从Kafka主题和接收器到MongoDB中的集合的事件。该消息包含一个具有id属性的嵌套对象,如上例所示。

{
    "testId": 1,
    "foo": "bar",
    "foos": [{ "id":"aaaaqqqq-rrrrr" }]
}

我正在尝试使用RegExp将嵌套的id重命名为_id

{
        "connector.class":"com.mongodb.kafka.connect.MongoSinkConnector",
        "topics": "test",
        "connection.uri": "mongodb://mongo:27017",
        "database": "test_db",
        "collection": "test",
        "key.converter": "org.apache.kafka.connect.storage.StringConverter",
        "value.converter": "org.apache.kafka.connect.json.JsonConverter",
        "value.converter.schemas.enable": "false",
        "document.id.strategy": "com.mongodb.kafka.connect.sink.processor.id.strategy.PartialValueStrategy",
        "value.projection.list":"testId",
        "value.projection.type": "whitelist",
        "post.processor.chain": "com.mongodb.kafka.connect.sink.processor.DocumentIdAdder, com.mongodb.kafka.connect.sink.processor.field.renaming.RenameByRegex",
        "field.renamer.regexp": "[{\"regexp\":\"\b(id)\b\", \"pattern\":\"\b(id)\b\",\"replace\":\"_id\"}]"
    }

配置/验证的结果为500 Internal Server Error,并显示以下消息:

{
    "error_code": 500,
    "message": null
}

我缺少什么或有问题吗?

1 个答案:

答案 0 :(得分:0)

我认为您想要的只是Kafka Connect Single Message Transform (SMT),更准确地说是ReplaceField

过滤或重命名Struct或Map中的字段。


以下内容将id字段名称替换为_id

"transforms": "RenameField",
"transforms.RenameField.type": "org.apache.kafka.connect.transforms.ReplaceField$Value",
"transforms.RenameField.renames": "id:_id"

对于您而言,在应用上述转换之前,您可能还想Flatten foos

"transforms": "flatten",
"transforms.flatten.type": "org.apache.kafka.connect.transforms.Flatten$Value",
"transforms.flatten.delimiter": "."

,最后将转换应用于字段重命名:

"transforms": "RenameField",
"transforms.RenameField.type": "org.apache.kafka.connect.transforms.ReplaceField$Value",
"transforms.RenameField.renames": "foos.id:foos._id"