Critical.
Authoritative.
Strategic.
Subscribe to CIO Magazine »

Sybase IQ 15.4 features 'Big Data' theme, support for Hadoop, MapReduce

Release 15.4 of the analytic database hops on the 'Big Data' train

Sybase is hoping its IQ analytic database can make its mark in the burgeoning "Big Data" market with an array of new features, including native integration with the open-source MapReduce and Hadoop programming frameworks for large-scale data processing.

The 15.4 release of IQ, scheduled for no later than Dec. 1, will come just several months after version 15.3, which added massively parallel-processing features to the platform. But work on 15.4 actually began about a year-and-a-half ago, because Sybase set up a few different teams to work on separate upcoming editions, said Dan Lahl, senior director of product marketing. "This is not a three-month hack job."

"Big Data" is an industry buzz-phrase that refers generally to the huge amounts of information created by websites, sensors and other sources outside of traditional enterprise applications. With Tuesday's announcement, Sybase joins the wave of database, data warehousing and analytics vendors that have been moving quickly to develop premium features and services around Hadoop and MapReduce.

New features in the 15.4 release include a native MapReduce programming interface that uses standard SQL; a Hadoop integration that provides various ways to tie together data from Sybase and Hadoop; a Java interface and additional extensions for existing C++ interfaces for running in-database algorithms; support for PMML (Predictive Model Markup Language) via a partnership with Zementis; and a data mining and statistics library from Fuzzy Logix for use in conjunction with MapReduce.

Sybase is also now offering an Express edition of IQ, which can be used indefinitely, but for development purposes only and with a 5GB database size limit.

One might think that Sybase IQ lies in an increasingly uncertain position, despite the rash of recent improvements, given SAP's unrelenting hype of late around HANA, its homegrown in-memory computing engine that so far has been targeted primarily at analytic workloads.

But the two databases can play complementary roles, said Joydeep Das, director of analytics product management.

HANA's blazing performance makes it a good fit for the real-time analysis of operational data from SAP applications, he said. As for IQ, "if you have very large enterprise data warehouse with growing numbers of users and data, Sybase IQ is a very good fit because of the scalability and speed."

Companies could also analyze data in HANA and use Sybase as "near-line" storage for that information once they're finished, he added.

Overall, a customer's database environment could be compared to a wedding cake, he added. In this scenario, Hadoop would sit at the widest bottom layer, crunching large data pools to find the "needles in the haystack," or actual information of value. Those needles would then be fed into IQ for further analysis, and potentially to an even faster system like HANA, he said.

"We're saying there's a place for both, and let customers use them where they make sense."

One expert took a measured view of Sybase's announcement.

"There's a lot of goodness here, but it's just not enough. There's nothing terribly differentiating about any of this," said Forrester Research analyst James Kobielus. Other vendors such as EMC's Greenplum division are further along the trail, according to Kobielus.

That said, Sybase will undoubtedly build on these initial features later on, he added.

Missing pieces include Sybase's own distribution of Hadoop, advanced MapReduce modeling tools or support for Hadoop sub-projects like the Pig programming framework, he said.

A Hadoop distribution would give Sybase "greater control over the full stack in terms of performance."

Overall, Hadoop and MapReduce don't necessarily address anything you can't also do now with a high-end, massively parallel data warehousing system, but "it's on a roll in terms of adoption" with "a vibrant ecosystem of vendors," Kobielus said. "It's getting to the point where it's the Linux of Big Data."

Chris Kanaracus covers enterprise software and general technology breaking news for The IDG News Service. Chris's e-mail address is Chris_Kanaracus@idg.com

Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.

More about: EMC, Forrester Research, IDG, James Kobielus, Linux, SAP, Sybase
References show all

Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
Users posting comments agree to the CIO comments policy.
Login or register to link comments to your user profile, or you may also post a comment without being logged in.
Related Coverage
Related Whitepapers
Latest Stories
Community Comments
Tags: applications, business intelligence, databases, data warehousing, EMC, greenplum, IBM, open source, sap, software, sybase
Latest Blog Posts
Whitepapers
  • The State of Data Security
    Recognize how your data can become vulnerable, including the latest issues stemming from unprotected data on mobile devices and social media sites. Understand the compliance issues involved, and identify data protection strategies you can use to keep your company’s information both safe and compliant.
    Learn more »
  • Staying Secure and Preventing Data Leaks in a Cloud-obsessed World
    If your organisation is to benefit from this explosive growth, it needs to be able to exploit all that the cloud has to offer. But at the same time, it is vital to protect your company’s employees, networks, data and reputation from the risks that exist in the cloud.
    Learn more »
  • Providing effective endpoint management at the lowest total cost
    Endpoints, otherwise known as servers, workstations, laptops, mobile devices, and virtually any other network-connected device, are critical components that enable business to be transacted. Properly implemented, endpoint management ensures continuous compliance with IT policies, regardless of where the machines are located and what type of network they are connected to.
    Learn more »
All whitepapers
rhs_login_lockGet exclusive access to Invitation only events CIO, reports & analysis.
Recent comments