Pentaho open sources 'big data' integration tools under Apache 2.0
- 31 January, 2012 01:05
- Comments
BI vendor Pentaho is open sourcing a number of tools related to "big data" in the 4.3 release of its Kettle data-integration platform and has moved the project overall to the Apache 2.0 license, the company announced Monday.
While Kettle had always been available in a community edition at no charge, the tools being open sourced were previously only available in the company's commercialized edition. They include integrations for Hadoop's file system and MapReduce as well as connectors to NoSQL databases such as Cassandra and MongoDB.
Those technologies are some of the most popular tools associated with the analysis of "big data," an industry buzzword referring to the ever-larger amounts of unstructured information being generated by websites, sensors and other sources, along with transactional data from enterprise applications.
The big data components will still be offered as part of a commercial package, Pentaho Business Analytics Enterprise Edition, which bundles in tech support maintenance and additional functionality, said Doug Moran, company co-founder and big data product manager.
Kettle's big data features provide visual tools that can greatly boost developer productivity by cutting down on the amount of code they need to write to work with MapReduce, NoSQL data stores and other technologies, according to Pentaho. They also deliver a "super-easy on-ramp" to Pentaho's BI suite, the company added.
The move to the Apache 2.0 license from LPGL makes sense, since major big data projects like Hadoop already use it, said Forrester Research analyst James Kobielus. Therefore, life should get simpler from a licensing perspective for developers running big data projects that incorporate multiple technologies.
Kettle competes with the likes of Talend Integration Suite, another open-source offering that has also added support for big data platforms, including Hadoop.
Chris Kanaracus covers enterprise software and general technology breaking news for The IDG News Service. Chris's e-mail address is Chris_Kanaracus@idg.com
Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.
- Bookmark this page
- Share this article
- Got more on this story? Email CIO
- Follow CIO on twitter
-
Face Time - Interview with John Brennan and Robert DiStefano
-
How to implement next-generation storage infrastructure for Big Data
-
Pfizer's Future Depends on IT Transformation
-
Pfizer's Future Depends on IT Transformation
-
Pfizer's Future Depends on IT Transformation
-
Get the Whole Picture Why Most Organizations Miss User Response Monitoring—and What to Do About It
You can be armed with vast amounts of performance metrics, but if you don’t know what users are actually experiencing, you don’t have the real performance picture. While this measure is critical, it is one many organizations fail to consistently capture. This guide looks at the challenges of user response monitoring, and it shows how you can overcome these challenges and start to get a real handle on your infrastructure performance and how it impacts your users’ experience. -
Enterprise Buyers Guide for Application Development Software
New software delivery models, leaner and faster development methodologies, emerging mobile apps and the impact of open source are all key trends changing the way software will be procured in the future. To help organisations understand this changing landscape and to provide a framework for procurement Computerworld has created an enterprise buyers guide which includes the top technology trends in applications, programming, architectures and methodologies. It profiles the software vendors to watch, addresses the security concerns caused by Web 2.0 and examines the impact of Open Source Software (OSS). -
ALM Buyers Guide: A Practical Guide to Choosing the Right Agile Tools for your Team
This buyer's guide describes the key criteria for application lifecycle management (ALM) solutions for today's high-performance teams. It includes key considerations for enhancing your single- or multi-vendor ALM environment.
-
Blackberry Curve for Dummies®
-
Mastering AutoCAD and AutoCAD LT
-
Remoting Patterns - Foundations of Enterprise, Internet and Realtime Distributed Object Middleware
-
WileyPlus Stand-alone to Accompany Java Concepts 6/E for Java 7 and 8 International Student Version
-
Mobile Vpn
-
Writing Scientific Programs Under the Os/2 Presentation Manager
-
Comptia Linux+ Study Guide (Exam Xk0-003)
-
Mr. Spreadsheet's Excel 2007 Library
-
Unicode








Comments
Post new comment