NLA web services traffic rises above 2.5 billion in 2009
- 22 July, 2010 12:18
National Library of Australia assistant director-general of information technology, Mark Corbould
The National Library of Australia (NLA) has more than doubled its web services traffic to reach more than 2.5 billion hits in 2009.
Speaking at the CIO Summit 2010 in Sydney, NLA assistant director of general information technology, Mark Corbould, attributed a lot of the organisation’s success to the use of open source solutions.
In 2006 across all of its online assets including the public catalogue and the Trove search engine, the NLA had half a billion web server requests. This jumped in 2008 to over one billion and last year hit more than 2.5 billion.
“We are now probably the highest ranked cultural institution in Australia in terms of our web presence,” Corbould told the summit attendees.
In April the NLA unveiled its Trove search engine that was built on an open source platform.
The search engine provides access to more than 90 million items about Australians and Australia, sourced from more than 1000 libraries and cultural institutions across the country.
Courbold said for an organisation that costs roughly $70 million a year to run with 450 staff and 10 per cent of its budget going towards IT, there was little chance of securing a commercial software licence to do what the NLA does.
The Trove project’s team of five developers used SOLR 1.4, which internally uses Lucene 2.9, for the main bibliographic search database and the web page archive, and MySQL 5 for managing all data relationships.
“The overpowering characteristic was that it was the only one we could afford,” Courbold said.
The project team also opted for Jetty as a web server, Nginx as the HTTP front-end/reverse proxy, Java Server Pages (JSP) for the newspapers part of the site, and Restlet and FreeMarker for the remaining portions of the service.
Additionally, one of the main steps taken was to use Solid State Disks (SSDs) – four Intel X-M25 160GB drives in each machine – for the Lucene indices to achieve the necessary performance. Trove issues more than 8000 IOPS (input/output operations per second) to the SSDs, which the team says would be expensive to achieve with even the fastest SAN setup.
“A lot of what we do is not well supported by the mainstream market,” Courbold told the CIO summit.
“When you want to harvest the web or when you want to integrate access to varieties of collections we have the market place doesn’t service us well.”
(See the CIO Summit 2010 in pictures)
Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.
Five trends affecting legal CIOs
CIO Roundtable: The changing face of security
Bitcoin malware count soars as cryptocurrency value climbs
Bouncing Back From CIO Unemployment
Union slams latest fibre-to-premise trial in Tasmania
Protection Storage Architecture: The What, Why, and How
Traditional backup architectures lack the flexibility, agility, and scale to meet new data protection challenges and requirements. That’s where a Protection Storage Architecture comes in. This whitepaper details how transformational architecture enables backup teams to solve immediate tactical challenges, while helping to evolve IT teams.
APAC Digital Performance
With some of the highest levels of social media penetration, mobile device ownership, and Internet connectivity in the world, Asian markets are ripe for more innovative and adept interactive engagement. In this study, we look at how marketers in the region express high hopes for digital, but hare held back with limited budgets and a region-wide lack of talent and training. Click for more
The Total Cost of Ownership Benchmarking
This white paper provides business insight after an extensive analysis of Total Cost of Ownership (TCO) associated with IP and legacy TDM (Time Division Multiplexing) telephony systems in 236 different corporate environments worldwide.