Cloudera, Hortonworks, and how they are progressing in their Hadoop security agenda

Cloudera, Hortonworks, and how they are progressing in their Hadoop security agenda

Back in July 2014, Gigaom published a research note on Hadoop security written by me. As there seems to be rapid progress in this domain, in the last few days there have been a couple of related developments that call for an update on the picture drawn in that report. As was noted in the […]

Read More →

The best approach to managing Hadoop

The best approach to managing Hadoop

This week Linked Data Orchestration published another report on Big Data. The report is titled “The best approach to managing Hadoop” and is brought to you in collaboration with Gigaom and Zettaset. The report examines the 3 choices available for managing Hadoop and discusses the pros and cons for each one. Organizations leveraging on-premises Hadoop […]

Read More →

WSO2: Open Source Enterprise Application Integration in the Cloud

WSO2: Open Source Enterprise Application Integration in the Cloud

WSO2 is a notable Enterprise Application Integration & Cloud vendor. They are notable not just because of what they do, but also because of how they do it. WSO2 offers an integrated platform featuring an Enterprise Service Bus and solutions that span Identity, Governance, Business Process Management, API Management, Big Data Analytics and Cloud. And […]

Read More →

Hadoop World vendor announcements and impact on Accessibility, Scalability and Security

Hadoop World vendor announcements and impact on Accessibility, Scalability and Security

This week the world’s biggest event for all things Hadoop takes place, the Strata and Hadoop World conference. Vendors announce and showcase new releases and features in their offerings, and Gigaom covered the extensive array of news. Let’s try to decipher them and see their impact in terms of Hadoop distributions Accessibility, Scalability and Security. […]

Read More →

Data Modeling for APIs. Part 5: Modeling vs. Meta-Modeling

Data Modeling for APIs. Part 5: Modeling vs. Meta-Modeling

Recently i was involved the creation of a data model for a project in the Energy domain. As this was an international, multi-partner project with many stakeholders and respective components, a dillema emerged for debate: to model, or to meta-model? We use this occassion as an example to mention the pros and cons for each […]

Read More →

RDF on Hadoop and Schema on Read vs. Schema on Write

RDF on Hadoop and Schema on Read vs. Schema on Write

One of the challenges for any Big Data solution is dealing with scale, and RDF stores are no exception: going for billions of RDF triples (the equivalent of rows in the SQL world) is not trivial. Hadoop on the other hand is great at scaling out on commodity hardware, which is a feature every MPP […]

Read More →

SPARQL City and Benchmarks

SPARQL City and Benchmarks

We have written in the past about SPARQL, Hadoop and benchmarks. In this post, we take a look at a company that combines all of these subjects, SPARQL City, on the occasion of the results they released after subjecting their product, SPARQLVerse, to the SP2 benchmark. This year’s NoSQLNow conference was colocated with SemTechBiz, providing […]

Read More →

Structure Data and the Competitive Landscape

Structure Data and the Competitive Landscape

Structure Data 2014 was another excellent Gigaom event. To anyone who has ever been to one, this is no surprise. Here’s my pick of the most interesting sessions and related news. Intel and its Hadoop distribution strategy Strange as it may sound, Intel has its own Hadoop distribution. Or at least it did up to […]

Read More →

Unlikely PaaS alliances, strange offerings, and variable gauges

Unlikely PaaS alliances, strange offerings, and variable gauges

Recently we’ve seen developments in PaaS offerings that may strike some as odd or surprising: first, RedHat offers Microsoft .Net and SQL Server as cloud services, and then Microsoft offers Oracle’s flagship database, WebLogic Server middleware and Java on its Azure platform as “license-included virtual machine images” in the Windows Azure Image Gallery. What’s this […]

Read More →

On APIs, JSON, Linked Data, attitude and opportunities

On APIs, JSON, Linked Data, attitude and opportunities

I’ve been meaning to revisit some of the things i’ve been writing about and getting feedback on lately – APIs, the JSON vs. XML “non” debate and Linked Data. My focus was going to be on JSON-LD as the low-hanging fruit of Linked Data, and this week some news came out that gave me the […]

Read More →