Semantic data lake architecture in healthcare and beyond

Data lakes can be a great asset, but they need an array of elements to work properly. We take a look at how it works for Montefiore Health System and discuss the role of semantics and graph databases in the data lake architecture.
Read More →Supercharging your image: Machine learning for photography applications

Advanced capabilities for image retrieval and processing are relatively new and powered to a large extent by advances in machine learning technology. We present a brief history of this space, and share the story of how Shutterstock has embraced this technology and what it does for them.
Read More →In-memory computing: Where fast data meets big data

The evolution of memory technology means we may be about to witness the next wave in computing and storage paradigms. If Hadoop disrupted by making it easy to utilize pooled commodity hardware with spare compute and storage, could the next disruption come from doing the same for spare compute and memory?
Read More →Will the real Elon Musk please stand up? Autonomous bots and synthesized speech in the public domain

The ability to create virtual clones that appear to think and talk like the real thing is very much real, as it has been done for Elon Musk and Barack Obama. We discuss techniques and potential with the people behind them.
Read More →CatBoost Machine Learning framework from Yandex boosts the range of AI

This is the year artificial intelligence (AI) was made great again. AI is all about machine learning, and machine learning is all about deep learning (DL), according to the hype. For connaisseurs like Yandex, there's more to AI than deep learning. CatBoost, the open source framework Yandex just released, aims to expand the range of what is possible in AI and what Yandex can do.
Read More →How machine learning is taking on online retail fraud

Fraud is one of the biggest causes of lost revenue for online retailers. Fraugster and Riskified, two startups that operate in this space, share their insights and methods for safeguarding online retail.
Read More →Shipping to data: The case of Amazon Prime Day

Did you ever wonder how retail hallmarks like Amazon Prime Day and global shipping are connected? Freightos has the data to answer such questions, as it happens to be on a mission to disrupt the freight industry.
Read More →Alibaba: Building a retail ecosystem on data science, machine learning, and cloud

What does it take to compete in a global arena in which retail and cloud are increasingly intertwined? Domain-specific data science and machine learning for the masses, according to Alibaba.
Read More →NBA analytics: Going data pro

For the NBA, like every other sports league, awards are important. They can generate attention, spur debate, make money, and involve fans, players, and experts, among others. Is there data science and analytics behind them — can there or should there be? We picked the NBA Most Improved Player award as an example to analyze some aspects of data-driven culture.
Read More →Kafka: The story so far

Hard problems at scale, the future of application development, and building an open source business. If any of that is of interest, or if you want to know about Kafka, real-time data, and streaming APIs in the cloud and beyond, Jay Kreps has some thoughts to share.
Read More →