A standard for storing big data? Apache Spark creators release open-source Delta Lake

From data lakes to data swamps and back again. Data reliability, as in transactional support, is one of the pain-points keeping organizations from getting the most out of their data lakes. Delta Lake is here to address this.
Read More →Google Cloud gives open-source data vendors a break. Will that save open source?

Google will give open-source data vendors that offer their software on Google Cloud a share of the proceeds. It's a good move, and a good thing. But there's more than meets the eye here.
Read More →Streamlio, an open-core streaming data fabric for the cloud era

Apache Kafka replacement and beyond. This is open-core Streamlio's claim to fame, and today's announcement of a managed cloud service brings it one step closer to reality.
Read More →Open source AI chips making Green Waves: Bringing energy efficiency to IoT architecture

What if machine learning applications on the edge were possible, pushing the limits of size and energy efficiency? GreenWaves is doing this, based on an open-source parallel ultra low power microprocessor architecture. Though it's early days, implications for IoT architecture and energy efficiency could be dramatic.
Read More →Confluent shows open source, paradigm shifts, cloud, and commercial success can all co-exist

Confluent just became a unicorn. We discuss why, what happens from now on, and how this is significant for the entire data ecosystem and the world at large with CEO Jay Kreps
Read More →Start the reskilling revolution without me: Future of Work trends and soft data on soft skills

As work is being redefined by automation as well as changing economy and social norms, how can technology help us stay abreast, discover, hone and document our skills, and match labor demand and supply?
Read More →Beyond experts: Jobs, tasks, and skills for a data driven Future of Work

Understanding and keeping track of the relationship between jobs, tasks and skills is essential for the future of work. If current efforts fall short, how can we do this going forward?
Read More →Alibaba Blinks: Building an open source, data-driven cloud empire in real-time

Acquiring data Artisans, the vendor leading development of open source Apache Flink framework for real-time data processing, is the latest move from Alibaba. Where does this fit in Alibaba's strategy to grow its cloud?
Read More →Real-time data processing just got more options: LinkedIn releases Apache Samza 1.0 streaming framework

Samza is now at near-parity with other Apache open-source streaming frameworks such as Flink and Spark. The key features in Samza 1.0 are SQL and a higher level API, adopting Apache Beam. What does this mean for this space, and how do you choose?
Read More →Just another Cyber Monday: Amazing Amazon and the best deal ever

When you get something at 80% off on Amazon, who do you think wins — you or Amazon? If you think that’s a strange question, you ain’t seen nothing yet. Maybe it’s time we re:Invent some things. But, how can possibly getting a huge discount be bad? It’s not, if you actually need what you’re buying, and […]
Read More →