Critical.
Authoritative.
Strategic.
Subscribe to CIO Magazine »

Informatica adds support for 'big data,' Hadoop

A Hadoop connector is part of Informatica 9.1

Informatica is joining the growing ranks of vendors moving to support Hadoop, the open-source framework for large-scale or "big data" processing, the company announced Monday.

The 9.1 version of Informatica's platform features a connector to the Hadoop file system (HDFS), allowing customers to move data in and out of Hadoop clusters.

While the Hadoop project has its roots in Web companies, having been led by Yahoo, enterprises are quickly warming up to it as well, said James Markarian, Informatica executive vice president and CTO.

The problem is that corporate IT shops may not have the right sort of in-house expertise, Markarian said.

"It's early days for Hadoop, but as it starts to get more mainstream, the kind of developers that can make use of it is really changing," he said. "Your average guy in IT, they don't know MapReduce, they don't know Hive, they don't know Pig," he said, referring to a range of Hadoop tools. "That's where we come in. You don't need to learn anything else. Take your Informatica skills and we'll bring you to Hadoop."

Informatica 9.1 also targets the growing number of information types that get lumped under the "big data" header. Along with "near-universal" connections to transactional databases like IBM DB2 and Oracle, as well as analytic-focused data platforms such as Netezza and Teradata, the new release can also pull in data from social sites like Facebook, Twitter and LinkedIn.

Other aspects of Informatica's platform, such as MDM (master data management), data quality and self-service tools, are also getting a range of updates as part of the 9.1 release.

The social media and Hadoop connectors are sold separately from the core platform, according to Markarian. Pricing was not immediately available.

It is indeed early days for Hadoop, according to Forrester Research analyst James Kobielus .

While Informatica now has the ability to load and retrieve data from Hadoop clusters, that's not necessarily different from what a number of data warehousing vendors already have, and it's likely that other data integration vendors will follow Informatica's move, he said.

Overall, effective use of Hadoop is not about one tool, according to Kobielus. Early adopters should look to standardize their activities on a core stack of technologies, which hasn't emerged yet, he said. So far, it seems like the only common element in Hadoop projects is the use of MapReduce for the modeling layer, he said.

"I would like to see Informatica and other data integration vendors offer rich IDEs (integrated development environments) for [Hadoop]," Kobielus said. "I strongly expect they will do that."

Chris Kanaracus covers enterprise software and general technology breaking news for The IDG News Service. Chris's e-mail address is Chris_Kanaracus@idg.com

Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.

More about: Facebook, Forrester Research, IBM, IBM Australia, IDG, Informatica, James Kobielus, Netezza, Oracle, Teradata, Yahoo
References show all

Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
Users posting comments agree to the CIO comments policy.
Login or register to link comments to your user profile, or you may also post a comment without being logged in.
Related Coverage
Related Whitepapers
Latest Stories
Community Comments
Tags: informatica, Apache Software Foundation, software, applications, data warehousing, business intelligence, open source, middleware, data integration
Latest Blog Posts
Whitepapers
  • The State of Data Security
    Recognize how your data can become vulnerable, including the latest issues stemming from unprotected data on mobile devices and social media sites. Understand the compliance issues involved, and identify data protection strategies you can use to keep your company’s information both safe and compliant.
    Learn more »
  • How progressive companies are using social technologies
    Social networks and collaborative technologies are now commonplace in many workplaces. Having first been used “on the quiet” by highly-networked employees, in increasing numbers they are now being proactively used by businesses keen to connect more effectively with their internal and external audiences. Web collaboration is now viewed as critical to company success and as having multiple benefits and applications to the business. Read on.
    Learn more »
  • Case Study: Keeping information on the move: Clearswift protects Maman, the logistics experts
    Time is money. Every minute a consignment is held up in transit costs money and causes problems. Web and email are mission critical business tools that enable Maman, and their customers, to efficiently collaborate with partners across the globe. Spam, and other web based threats can result in delays that ultimately lead to missed deadlines - keeping the lines of communication open is therefore a key priority for Maman. Read on.
    Learn more »
All whitepapers
rhs_login_lockGet exclusive access to Invitation only events CIO, reports & analysis.
Recent comments