Category: Hadoop

Scaling Hadoop – Myth vs. Reality

People use Hadoop for storing, processing and analyzing ever-increasing volumes of data. The question is no longer about whether Hadoop can scale to meet the demand. The question is  now at what point the operational cost of scaling Hadoop exceeds the value realized from the data analysis. How much of the Hadoop scale-out story is reality vs. hype?

Read on so you can separate the myth from the reality. Continue reading “Scaling Hadoop – Myth vs. Reality”

Designing for scalability with the Dell | Hadoop solution

Over the last months I’ve been having conversations with a lot of Hadoop users and developers. I’m glad to see that everyone wants to run Hadoop in production. Most of the practitioners also realize that, although Hadoop can scale, there are no clear guidelines that describe how to scale up/out Hadoop from very small to very large. Continue reading “Designing for scalability with the Dell | Hadoop solution”

OSCON 2011 Session: Hadoop – Enterprise Data Warehouse Integration

Last Monday, July 25th, I gave a presentation at OSCON about the integration between Hadoop and the Enterprise Data Warehouse (EDW).

The session was well attended. The dialog and the exchange of opinions was very good. The conversations continued after the session which, to me, shows that Big Data management is top of many practitioners’ mind.

I very much enjoyed the dialog. Also learned few things! Thanks O’Reilly for another successful OSCON! Continue reading “OSCON 2011 Session: Hadoop – Enterprise Data Warehouse Integration”