| By Paul Miller | Article Rating: |
|
| July 23, 2012 08:57 AM EDT | Reads: |
1,050 |
My latest piece of work for GigaOM Pro just went live. Scaling Hadoop clusters: the role of cluster management is available to GigaOM Pro subscribers, and was underwritten by StackIQ.
Thanks to everyone who took the time to speak with me during the preparation of this report.
As the blurb describes,
From Facebook to Johns Hopkins University, organizations are coping with the challenge of processing unprecedented volumes of data. It is possible to manually build, run and maintain a large cluster and to use it to run applications such as Hadoop. However, many of the processes involved are repetitive, time-consuming and error-prone. So IT managers (and companies like IBM and Dell) are increasingly turning to cluster-management solutions capable of automating a wide range of tasks associated with cluster creation, management and maintenance.
This report provides an introduction to Hadoop and then turns to more-complicated matters like ensuring efficient infrastructure and exploring the role of cluster management. Also included is an analysis of different cluster-management tools from Rocks to Apachi Ambari and how to integrate them with Hadoop.
Compulsory picture of an elephant as it’s a Hadoop story provided by Flickr user Brian Snelson.
Related articles

Read the original blog entry...
Published July 23, 2012 Reads 1,050
Copyright © 2012 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By Paul Miller
Paul Miller works at the interface between the worlds of Cloud Computing and the Semantic Web, providing the insights that enable you to exploit the next wave as we approach the World Wide Database. He blogs at www.cloudofdata.com.
- Not Quite Ready to Live in the Cloud
- Cloud Database Company Xeround, and a Tale of Evolving Business Models
- Discussing Virtual Machine Interoperability with the Open Data Center Alliance
- Visualisation – the key that unlocks data’s value?
- OpenStack Summit – thoughts from Portland
- Survey lifts covers on Cloud Promiscuity: good thing, bad thing, or who cares?
- Doing the DataBeat
- To Dublin, in search of evidence
- Getting it right with data attribution
- Find the data, aggregate the data, make the data useful
- Seeking Simplicity’s Sweet Spot
- Unpicking the multi-cloud at GigaOM Structure
- Not Quite Ready to Live in the Cloud
- Cloud Database Company Xeround, and a Tale of Evolving Business Models
- Discussing Virtual Machine Interoperability with the Open Data Center Alliance
- Visualisation – the key that unlocks data’s value?
- OpenStack Summit – thoughts from Portland
- Survey lifts covers on Cloud Promiscuity: good thing, bad thing, or who cares?
- Doing the DataBeat
- To Dublin, in search of evidence
- Getting it right with data attribution
- Is Infochimps running from the Data Market business?
- Find the data, aggregate the data, make the data useful
- Seeking Simplicity’s Sweet Spot
- Cloud Computing Is Far More Than Just Cutting Enterprise IT Costs
- Security and the Cloud
- David Eaves Talks About Vancouver’s Open Data Initiative
- Talking to Simon Wardley About Ubuntu and Cloud Computing
- Juan Carlos Soto Reaffirms Sun Microsystems’ Commitment to the Cloud
- If Government is a Platform, What Are People Building?
- Keep Your Executive Assistant Happy if Moving to the Cloud
- Tungle Goes a Long Way Toward Reducing the Pain of Scheduling Meetings
- Discussing Cisco’s Unified Computing System with Wendy Mars
- Eucalyptus Project Closes $5.5 Million Series A with Benchmark
- Licensing of Linked Data
- Hewlett Packard: A Tale of Many Clouds

















