Want to boost customer engagement? Invest in data integration, metadata, data governance, says Informatica

What does a Data Hub Reference Architecture have to do with Customer Engagement? A lot, according to Informatica, who wants to complement Adobe, Microsoft, and SAP, in their Open Data Initiative. The big question, however, is whether this has legs.
Read More →Real-time data processing just got more options: LinkedIn releases Apache Samza 1.0 streaming framework

Samza is now at near-parity with other Apache open-source streaming frameworks such as Flink and Spark. The key features in Samza 1.0 are SQL and a higher level API, adopting Apache Beam. What does this mean for this space, and how do you choose?
Read More →ScyllaDB achieves Cassandra feature parity, adds HTAP, cloud, and Kubernetes support

ScyllaDB, the open-source drop-in replacement for Apache Cassandra, is growing up. Version 3.0 closes the gap in terms of features, and has a few extras to add on top of superior performance over Cassandra.
Read More →MemSQL 6.7 brings free tier and performance enhancement: Transactions for nothing, and faster queries for free

MemSQL is not the first database to offer a free tier. But this one comes with full functionality to support real-world use cases, while also improving performance for typical data warehousing queries by a factor of 100.
Read More →Knowledge graphs beyond the hype: Getting knowledge in and out of graphs and databases

What exactly are knowledge graphs, and what's with all the hype about them? Learning to tell apart hype from reality, defining different types of graphs, and picking the right tools and database for your use case is essential if you want to be like the Airbnbs, Amazons, Googles, and LinkedIns of the world.
Read More →Google can now search for datasets. First research, then the world?

Did you ever need data on a topic you wanted to research, and had a hard time finding it? Wish you could just Google it? Well, now you can do that.
Read More →The web as a database: The biggest knowledge graph ever

Imagine you could get the entire web in a database, and structure it. Then you would be able to get answers to complex questions in seconds by querying, rather than searching. This is what Diffbot promises.
Read More →Data-driven disaster relief: Measuring the impact of emergency response

With natural disasters picking up in frequency and intensity, the role of NGOs in disaster relief is picking up as well. A key requirement for all NGOs is transparency, and applying data-driven techniques may help.
Read More →MemSQL 6.5: NewSQL with autonomous workload optimization, improved data ingestion and query execution speed

MemSQL wants to be the world's best database. Leading that race is a tall order, but the new version seems to improve on an already strong offering.
Read More →Moving fast without breaking data: Governance for managing risk in machine learning and beyond

How do you resolve the tension between the need to build and deploy accurate machine learning models fast, and the need to understand how those models work, what data they touch upon, and what are the implications? Immuta says data governance is the answer.
Read More →