When it comes to big data projects, the biggest challenge most organizations face, regardless of size, is staffing, says Peter Guerra, a principal in Booz Allen Hamilton's Strategic Innovation Group.
Organizations struggle to find trained data scientists, or to find the money to retrain staff, Guerra says. To address that need, Booz Allen Hamilton released Explore Data Science last week at O'Reilly Strata Conference + Hadoop World in New York City.
"We have seen the rate of adoption of big data really start to skyrocket in the past couple of years," Guerra says. "It's gone from clients who are wondering whether they should pay attention to this or not, to 'How do I move forward? What's the right team? What's the right set of technology?'"
Build Core Team to Address Big Data Challenges
Many CIOs either have big data pilots going today or will shortly, Guerra says. "As they move to production deployments, we have seen some gaps," he adds. "A lot of CIOs get inundated with messages from vendors: 'If you buy my solution, everything will be fine.' But there's no magic bullet. We see more and more CIOs who are willing to piece together a set of technologies so they can grow and expand over time."
Guerra says building out a big data infrastructure from what has become a bewildering array of open source technologies and tools, as well as solutions from proprietary vendors, is no easy task. Despite representing a major consulting firm, Guerra strongly recommends that organizations establish an internal team with expertise in the big data technologies.
"We have a lot of great people that you can buy by the hour, and we'll come in and give you our expertise, but what we always encourage our clients to ... establish their own team," he says. "Hire it in and focus on having a core set of engineers. What we can then do is come in and share our lessons learned. At the end of the day, the best thing for a lot of our clients is to have that core team that they can then augment with consultants as needed."
[ How-to: Close the Big Data Skills Gap by Training Your IT Staff ]
That's just the beginning, Guerra says once you have the infrastructure in place, you need people who can take that data and transform it into insight. "We haven't seen a lot of training focused on getting the data out. How do you extract insight, not just BI?"
From Statistics to Data Science, With the Help of Gamification
Guerra says the browser-based Explore Data Science training program provides anyone who's taken a high school statistics class with hands-on experience in data science techniques such as the following:
- Distance metrics
- Dimensionality reduction
- Genetic algorithms
The self-paced course offers more than 40 hours of content featuring galactic-themed gamification elements that teach data science principles while guiding students through increasingly advanced, scenario-based challenges.
Students face 32 distinct "missions" in which they code in Python to transform raw information into business insights. When they complete a level, they earn points, awards and badges that they can share on social media. Organizations can also deploy leaderboards that show participants' progress through the missions and their efficiency at resolving them (based on things such as how many hints they had to access to solve the problem). At the end, participants receive a certificate signifying their completion of the course.
"We initially created this training internally," Guerra says. Clients who saw it asked that it be made more broadly available.
[ More: Data Scientist Role Shifting to Focus on Developers ]
"This way of learning complex techniques is one that's well suited for these types of problems," Guerra says. "We really need to narrow the skills gap, and we think this will help build a base of data scientists that can then be deployed into the enterprise."
He adds: "The goal of this is not to make piles of money. It's to educate a workforce. We see the coming growth of analytics, and workers have to be prepared for that."
Explore Data Science is available in both individual and group pricing. Guerra notes that Booz Allen Hamilton is starting to partner with academic institutions on the learning front; he expects the offering to expand over time.