Linkedin open sources Brooklin, a distributed service for streaming data in near real-time at scale, currently powering over 2 trillion messages per day at Linkedin. Data streaming is truly driving the way for real-time machine learning usecases. This is also a very interesting project, primarily as it doesn't aim to replace OSS projects like Kafka, instead it sits on a higher level providing a primary solution for streaming across various stores and messaging systems (Kafka, Azure Events Hub, Kinesis, etc). In this post, they showcase how Brooklyn can be used as a streaming bridge across these heterogeneous messaging services, as well as mirroring kafka functionality, and beyond.
|