Why big data means a big year for Hadoop
- 27 January, 2012 10:20
- Comments
You can't have a conversation in today's business technology world without touching on the topic of big data.
Simply put, it's about data sets so large-in volume, velocity and variety-that they're impossible to manage with conventional database tools. In 2011, our global output of data was estimated at 1.8 zettabytes (each zettabyte equals 1 billion terabytes). Even more staggering is the widely quoted estimate that 90 percent of the data in the world was created within the past two years.
Behind this explosive growth in data, of course, is the world of unstructured data. At last year's HP Discover Conference, Mike Lynch, executive vice president of information management and CEO of Autonomy, talked about the huge spike in the generation of unstructured data. He said the IT world is moving away from structured, machine-friendly information (managed in rows and columns) and toward the more human-friendly, unstructured data that originates from sources as varied as e-mail and social media and that includes not just words and numbers but also video, audio and images.
Given the rise of big data, I'm sure you're hearing the buzz around Apache Hadoop, the software framework that supports data-intensive distributed applications under a free license. It enables applications to work with thousands of nodes and petabytes (a thousand terabytes) of data. It certainly looks like the Holy Grail for organizing unstructured data, so it's no wonder everyone is jumping on this bandwagon. A quick Web search will show you that in just the past few months, companies including EMC, Microsoft, IBM, Oracle, Informatica, HP, Dell and Cloudera (to name a few) have adopted this software framework.
What I find even more notable is that companies such as Yahoo, Amazon, comScore and AOL have turned to Hadoop to both scale their businesses and lower storage costs.
According to some recent research from Infineta Systems, a WAN optimization startup, traditional data storage runs $5 per gigabyte, but storing the same data costs about 25 cents per gigabyte using Hadoop.
That's one number any CEO will remember.
So get ready for Hadoopalooza 2012. I'd love to hear what you're doing to tackle big data storage, so please drop me a line anytime.
Michael Friedenberg is the president and CEO of CIO magazine's parent company, IDG Enterprise. Email him at mfriedenberg@cio.com.
Read more about data management in CIO's Data Management Drilldown.
Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.
- Bookmark this page
- Share this article
- Got more on this story? Email CIO
- Follow CIO on twitter
- Transforming Your Business by Transforming Your Processes
- Seven Ways Business Activity Monitoring (BAM) Makes Your Supply Chain More Efficient
- IDC MarketScape: Worldwide Business Process Platforms 2011 Vendor Analysis
- Security Threat Report 2012
- HP VirtualSystem VS1 for VMware - Virtualised environments made faster and easier
-
How to implement next-generation storage infrastructure for Big Data
-
Pfizer's Future Depends on IT Transformation
-
Pfizer's Future Depends on IT Transformation
-
Pfizer's Future Depends on IT Transformation
-
Apple aims iPads at High Schools
-
Consolidated Storage for Virtualised Server Environments
This research brief is based on a recent Tech Target survey with more than 200 storage administrators and IT professionals in mid-sized and enterprise-class companies, and focuses on how these decision-makers view the storage-related challenges that result from server virtualisation. See the results. -
Unified Communications Strategy Guide
Articles include: How to ensure a successful UC project; Five reasons to set up unified communications; Unified communications: Is your network ready?; How to get the most from unified communications. Read this Computerworld Strategy Guide. -
Providing effective endpoint management at the lowest total cost
Endpoints, otherwise known as servers, workstations, laptops, mobile devices, and virtually any other network-connected device, are critical components that enable business to be transacted. Properly implemented, endpoint management ensures continuous compliance with IT policies, regardless of where the machines are located and what type of network they are connected to.
-
Windows Home Server
-
Master Visually Microsoft Office 2003, 2nd Edition
-
Agile Documentation - a Pattern Guide to Producing Lightweight Documents for Software Projects
-
Pocket Pcs! I Didn't Know You Could Do That... (Includes CD-ROM)
-
Photoshop Elements 4
-
Mindmanager for Dummies
-
Microsoft PowerPoint 2002 Step By Step Courseware
-
Access 2000 Programming for Dummies
-
Macromedia Dreamweaver 8 Visual Encyclopedia








Comments
Post new comment