Indexing Hadoop: If it’s so simple, how come not everyone’s doing it?

Is Hadoop really the best thing since sliced bread? You’d probably get this idea, if you have been talking to any of the (proliferating) Hadoop advocates / vendors. Hadoop and its expanding ecosystem are being touted as the ideal solution to any organization’s data management needs – and admittedly, for good reason. Hadoop offers a […]
Read More →Data modeling for APIs. Part 3: SOAP and XML

In the third part of this series of posts we take a look at some of the details of data modeling for APIs using an XML – SOAP approach. SOAP has been around for a while now, as it was designed as an object-access protocol in 1998 by Dave Winer, Don Box, Bob Atkinson, and […]
Read More →Twitter doesn’t like me, but it’s nothing personal: stock plunges, engagement, and walled gardens

I’m a very late joiner on Twitter. Even though as an analyst being active on Twitter is something that has come to be part and parcel of the job, i used to joke about my “no Twitter by Design” strategy. I had my reasons, some having to do with me, some with Twitter itself. Now […]
Read More →Mastering the business side of SaaS and IaaS

You probably know Gigaom and its expertise on, among other things, cloud. You probably know David Linthicum and Janakiram MSV as renowned cloud experts, and you may know myself from work we’ve done with Gigaom and Interworks as an up and coming vendor. Given that, you are most likely to be interested in our webinar […]
Read More →Data Modeling for APIs. Part 2: REST and JSON

In the second part of this series of posts we start looking into the implications of the choice between a SOAP and a REST approach to implementing APIs from a data modeling perspective. For most people a SOAP API is associated with an XML data model, while a REST API is associated with a JSON […]
Read More →A small step for Impala, a big step for SQL-on-Hadoop. More to come, hopefully.

Recently Cloudera published the results of a benchmark performed internally, comparing its own SQL-on-Hadoop implementation (Impala) against a carefully selected competition composed of Hive and an undisclosed RDBMS and showing that Impala outperforms both. As Gigaom’s Derrick Harris was quick to point out, beating Hive is not something to write home about as Hive is […]
Read More →Data Modeling for APIs. Part 1: setting the stage

Lately we’ve been engaged in the design of a data model for a project aiming to deliver an API for analytics in the domain of energy. As there is an ongoing debate in the consortium wrt to the type of API that will be implemented (RESTful vs Web Services), we have been asked to provide […]
Read More →Predictions, Predictions: It’s BI’s time of year in the cloud

As the year comes to an end, making predictions about the year(s) to come is one of everyone’s favorite pastimes and/or guilty pleasures. Even though i may be somewhat late to join, i got a little bit of inspiration from some of the people, companies and topics that i have worked with in 2013. So […]
Read More →Agile business intelligence: reshaping the landscape

With all the intense activity going on lately, i realized i did not actually make a note of something quite significant: the report that was recently published via GigaOM and Linked Data Orchestration on the new landscape in Business Intelligence. The report is titled “Agile business intelligence: reshaping the landscape” and highlights the crucial factors […]
Read More →Cloudstack Collaboration Conference EU 2013

Last week i had the pleasure of attending the European Cloudstack Collaboration Conference in Amsterdam, as i was kindly invited by Citrix Cloud Advocate, Reuven Cohen. I am most certainly glad i could make it, not only because it was a well-organized event overall (great venue, almost everything functioning smoothly – except for the mix-up […]
Read More →