Indexing Hadoop: If it’s so simple, how come not everyone’s doing it?

Is Hadoop really the best thing since sliced bread? You’d probably get this idea, if you have been talking to any of the (proliferating) Hadoop advocates /  vendors. Hadoop and its expanding ecosystem are being touted as the ideal solution to any organization’s data management needs – and admittedly, for good reason. Hadoop offers a […]

Read More →

Twitter doesn’t like me, but it’s nothing personal: stock plunges, engagement, and walled gardens

I’m a very late joiner on Twitter. Even though as an analyst being active on Twitter is something that has come to be part and parcel of the job, i used to joke about my “no Twitter by Design” strategy. I had my reasons, some having to do with me, some with Twitter itself. Now […]

Read More →

A small step for Impala, a big step for SQL-on-Hadoop. More to come, hopefully.

Recently Cloudera published the results of a benchmark performed internally, comparing its own SQL-on-Hadoop implementation (Impala) against a carefully selected competition composed of Hive and an undisclosed RDBMS and showing that Impala outperforms both. As Gigaom’s Derrick Harris was quick to point out, beating Hive is not something to write home about as Hive is […]

Read More →