【发布时间】:2020-05-02 08:09:48
【问题描述】:
我已经创建了想要将它们连接在一起的 KStreams。两个流的输出如下:
流 1:
2 {"CODE":"AAAA96","STATUS":"SUBMITTED","ID":2}
流 2:
26 {"DESCRIPTION":"blah blah blah","QUANTITY":1,"ID_CUSTOMER_ORDER":"GR0100926","ID":26}
我想创建这两个Streams的joined stream(内连接),所以我创建了以下KStream:
KStream<String, String> s_joined = s_order
.join(s_order_item, (left,right) -> left + right,
JoinWindows.of(Duration.ofSeconds(30)))
.mapValues(value -> {
String[] arrOfstr = value.split("(?<=})");
JSONObject jl = new JSONObject(arrOfstr[0]);
JSONObject jr = new JSONObject(arrOfstr[1]);
JSONObject json = new JSONObject();
Iterator<String> keys = jl.keys();
while(keys.hasNext()) {
String key = keys.next();
json.put(key, jl.get(key));
}
keys = jr.keys();
while(keys.hasNext()) {
String key = keys.next();
json.put(key, jr.get(key));
}
return json.toString();
});
在这个 KStream 中,我只使用了一个连接,我正在更改输出消息的格式,仅此而已。
通过一个例子,我将解释我想要做什么:
以下消息在窗口内发布:
流 1
9 {"CODE":"AAAA98","STATUS":"CANCELED","ID":"9"}
流 2
9 {"DESCRIPTION":"blah blah blah","QUANTITY":3,"ID_CUSTOMER_ORDER":"GR0100121","ID":"9"}
9 {"DESCRIPTION":"blah blah blah","QUANTITY":0,"ID_CUSTOMER_ORDER":"GR0100480","ID":"9"}
9 {"DESCRIPTION":"blah blah blah","QUANTITY":1,"ID_CUSTOMER_ORDER":"GR0100606","ID":"9"}
9 {"DESCRIPTION":"blah blah blah","QUANTITY":7,"ID_CUSTOMER_ORDER":"GR0100339","ID":"9"}
9 {"DESCRIPTION":"blah blah blah","QUANTITY":6,"ID_CUSTOMER_ORDER":"GR0100911","ID":"9"}
加入流
发布的内容
9 {"CODE":"AAAA98","STATUS":"CANCELED","DESCRIPTION":"blah blah blah","QUANTITY":3,"ID_CUSTOMER_ORDER":"GR0100121","ID":"9"}
9 {"CODE":"AAAA98","STATUS":"CANCELED","DESCRIPTION":"blah blah blah","QUANTITY":0,"ID_CUSTOMER_ORDER":"GR0100480","ID":"9"}
9 {"CODE":"AAAA98","STATUS":"CANCELED","DESCRIPTION":"blah blah blah","QUANTITY":1,"ID_CUSTOMER_ORDER":"GR0100606","ID":"9"}
9 {"CODE":"AAAA98","STATUS":"CANCELED","DESCRIPTION":"blah blah blah","QUANTITY":7,"ID_CUSTOMER_ORDER":"GR0100339","ID":"9"}
9 {"CODE":"AAAA98","STATUS":"CANCELED","DESCRIPTION":"blah blah blah","QUANTITY":6,"ID_CUSTOMER_ORDER":"GR0100911","ID":"9"}
我想要发布的内容
9 {"CODE":"AAAA98","STATUS":"CANCELED","DESCRIPTION":"blah blah blah","QUANTITY":6,"ID_CUSTOMER_ORDER":"GR0100911","ID":"9"}
最后,我只想发布窗口内的最新消息,而不是全部。这可能吗?
【问题讨论】:
标签: join apache-kafka apache-kafka-streams