Discussion about this post

User's avatar
Neural Foundry's avatar

The multi primary Kafka setup here is facinating. Im surprised the standard Flink connectors treat an unavailable cluster as a fatal error, that seems like a major oversight for high avaliability systems. The custom union of Kafka streams approach makes total sense as a workaround, tho I wonder how much complexity that adds to debugging when things go wrong. Also interesting that PyFlink still has these gaps around async IO and streaming joins, seems like those would be pretty critical for AI workloads. Did they end up having to write a lot of Java wrappers to get around that?

Expand full comment
Xian's avatar

Except for ChatGPT, you’re the only one who’s helped me understand so many things that were completely beyond my scope.

Expand full comment

No posts