Real-time data processing just got more options: LinkedIn releases Apache Samza 1.0 streaming framework
Samza is now at near-parity with other Apache open-source streaming frameworks such as Flink and Spark. The key features in Samza 1.0 are SQL and a higher level API, adopting Apache Beam. What does this mean for this space, and how do you choose?
The ability to ingest and process large volumes of data in real time is something interesting to more and more organizations. This is an area seeing rapid growth, as the use cases can translate to direct business benefits.
We have been following this space for a while now, and the release of Apache Samza 1.0 is a good opportunity to revisit it and see how this changes things, if at all.
Apache Samza was developed at LinkedIn in 2013. Samza became a top-level Apache project in 2014, and now, it is used by over 3,000 applications in production at LinkedIn. The use cases include detecting anomalies, combating fraud, monitoring performance, notifications, real-time analytics, and many more.