The unprecedented growth in the volumes and complexity of business information is ushering in the era of “Big Data”. The phenomenon is not a "thing" but instead a strategic approach that crosses many IT borders and so poses one of the biggest challenges currently facing CIOs from companies of all sizes.
IDC defines the Big Data world as: “A new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis. Big Data is a horizontal cross-section of the digital universe and can include transactional data, warehoused data, metadata, and other data residing in ridiculously large files.”
As the amount of data held in these ridiculously large files continues to explode, the process of analysing large data sets will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus, according to research by MGI and McKinsey's Business Technology Office.
“Leaders in every sector will have to grapple with the implications of Big Data, not just a few data-oriented managers. The increasing volume and detail of information captured by enterprises, the rise of multimedia, social media, and the Internet of Things will fuel exponential growth in data for the foreseeable future,” the McKinsey report Big data: The next frontier for innovation, competition, and productivity, predicted.
The scale of this “exponential growth” in data is staggering. The fifth annual study of the digital universe by analyst company IDC revealed that in 2011, the amount of information created and replicated surpassed 1.8 zettabytes (1.8 trillion gigabytes) - growing by a factor of nine in just five years.
“Like our physical universe, the digital universe is something to behold — 1.8 trillion gigabytes in 500 quadrillion "files" — and more than doubling every two years. That's nearly as many bits of information in the digital universe as stars in our physical universe,” said IDC senior vice president John Gantz.
This growth in business data is due to multiple factors, with the main culprits being the falling cost of data storage hardware, the increasing density of storage, increased digitisation of business processes together with the growing prevalence of storage-intensive data such as video and machine-generated log files.
The net result of this data explosion is, according to Andrew Buss, service director at Freeform Dynamics, that many organisations, large and small, are becoming increasingly overwhelmed by storage – by the amount they have to store, by getting it to the systems that matter, and by the huge variety of the types of data being held. He pointed out that, while many companies may not think of it as such, this is very much a “Big Data” problem, whatever the scale.
“Many of you may think that Big Data is yet another hyped-up fad that will pass with time or impact only the largest or most demanding organisations with massive amounts of data. However, the reality is that it is a question of scale and the ability to deal with it in terms of investment capability, existing infrastructure and skills,” Buss said.
“Companies of all sizes are feeling the effects of the data explosion - a mid-size organisation may struggle to deal with a few tens of terabytes of data, while this may be thought of as relatively simple by a global company regularly dealing with petabytes or even exabytes themselves.”
And it is not just the volumes of data that pose a management headache for CIOs – it is the changing nature of the underlying file types. According to Clive Longbottom, service director at analyst company Quocirca, the biggest problem comes from the fact that the type of data that businesses need to manage and interpret is changing.
“When it was rows and columns of figures held in a standard database, life was – relatively – simple. It all came down to the speed of the database and the hardware it was running on. Now, more and more binary large objects reappearing in the databases – and these require different approaches to be able to identify and report on what the content actually is and in identifying patterns and making sense out of what this means to the end user,” Longbottom said.
“Even worse is the fact that less information is making it into standard databases – yes, there’s still an increasing amount of numerical and textual data being created that resides within a database, but this is being outstripped by the amount of information that is being created in a more ad hoc manner with files that lie directly in a filing system.”
Throwing more storage at the problem or improving the performance of existing storage systems will not solve the problem, Freeform Dynamic’s Buss warned. “Merely investing in bigger and faster storage arrays, multi-gigabit networks and lots of flash cache may alleviate the symptoms – but like putting lipstick on a pig – the underlying issues remain. Dealing with the problem of Big Data requires a change in mind-set from trying to manage storage systems to trying to manage information.”
It is apparent that, in the face of this assault, traditional storage and management technologies are simply no longer able to cope. Forrester notes that, at extreme scale, traditional data management and business intelligence (BI) become “impractical”. This results in organisations losing control of their data and failing to harness it effectively to gain insight that could be used to drive greater business performance.
Brian Hopkins, principal analyst at Forrester, explained that Big Data helps firms work with extremes to deliver value from data cost-effectively. However he cautioned that CIOs must understand that the world of Big Data is not business as usual: “In fact, Big Data will disrupt the data management landscape by changing fundamental notions about data governance and IT delivery. Take the time to understand Big Data as well as its implications and begin a balanced approach that considers more than just the technology hype,” Hopkins said.
As firms seek value from digital and analogue sensors, social media, financial systems, emails, surveys, and customer call centres — to name a few — new tidal waves swell on the horizon. All of this data promises to open up new frontiers of business opportunity — if we can figure out how to turn it into insight. Business-as-usual BI and IT delivery approaches, however, will not generate the future results your business expects.”
For companies that can tame this Big Data tiger, the rewards can be substantial. A December 2011 poll of 63 companies with at least 25 terabytes of active business data conducted by Aberdeen Group found that leading companies – the top 50% in terms of data performance – were able to meet their user demand for information more than four out of every five times. Such companies also had a data infrastructure the enabled employees to spend half as much time looking for information, and access almost three times as much business data as followers.
McKinsey predicts that leveraging Big Data successfully will underpin new waves of productivity and growth for companies of all sizes across all key verticals. For example, it estimates that a retailer using Big Data to the full has the potential to increase its operating margin by more than 60%.
In order to begin to address these issues McKinsey advises that multiple factors will have to be addressed. Policies related to privacy, security, intellectual property, and even liability will need to be modified up in a Big Data world. Looking across technology, people and processes, corporates will need to put the right talent and infrastructure in place but also shape workflows and incentive packages to optimise the use of Big Data.
The value of such a holistic approach was echoed by Forrester. In the analyst company’s June 2011 Global Big Data Online Survey, 70% of respondents said that Big Data is or will be a collaborative effort between business and IT. The poll revealed Big Data teams of skilled technicians are working increasingly closely with business executives to discover new uses for more data. In order to succeed in this brave new world of Big Data CIOs must, according to Forrester partner with business peers to identify opportunities and solutions.
“While business executives at some firms immediately get it, we expect the concepts of Big Data to be foreign to most. CIOs who understand the state of the art and are perceived as strategic partners will be well positioned to help executives justify investments and reap new benefits. Conversely, CIOs without a seat at the business strategy table may struggle,” the Forrester survey warned.
Forrester also advises CIOs to turn over part or all of Big Data solutions delivery to business leaders. A recurring theme in its research was the level of business involvement in Big Data solutions. CIOs are also warned to be ready to support rapid growth as commonly firms beginning a Big Data solution as a pilot with a few terabytes of planned growth, were found to often find themselves with more than a petabyte very quickly.
According to Gartner, enterprise architecture (EA) practitioners have a major role in ensuring their organisations maximise the business opportunities posed by Big Data. The analyst group notes that Big Data can makes organisations more productive by enabling people to derive value from the increasingly diverse data sources, and thus to identify previously unseen opportunities.
“Traditional approaches to EA are significantly impacted by Big Data,” said David Newman, research vice president at Gartner. “For the EA practitioner, the balance shifts from a focus on optimisation and standardisation within the organisation, to lightweight approaches that focus on harmonisation and externalisation across the broader enterprise ecosystem. Big Data disrupts traditional information architectures — from a focus on data warehousing (data storage and compression) toward data pooling (flows, links, and information shareability).”
Newman added that careful planning must be undertaken to determine the best tools and techniques for analysing complex datasets. These include skills in statistics, machine learning, natural-language processing and predictive modelling. In addition, practitioners must help teams understand how to use Big Data visualisations techniques, such as tag clouds, clustergrams, history flows, animations and infographics. Teams should use low-cost, open source tools in early pilots to demonstrate the feasibility of Big Data projects.
Instead of “Big Data” Quocirca suggests that CIOs aim for an “unbounded data” strategy – the capability to pull together data and information from a range of disparate sources and to make sense of it in a way that an end user needs.
Quocirca recommends that organisations seek data management solutions that can deal with different data types, including text, image, video and sound – from both within and outside of and organisation’s environment. According to the analyst company, it is essential that a successful approach uses meta data and pointers to minimise data replication and redundancy, and not simply create a new, large data warehouse that will only make the existing problem worse. Such a system should be able to present findings back to business decision makers –not only based on what has already happened - but also incorporate the capacity predict with some degree of certainty what may happen in the future. The system should also be inherently fault tolerant and flexible enough so that more resources can be apply easily to the system as required to cope with future demand.
It is clear that the brave new world of disruptive Big Data has the potential to give CIOs big headaches, but IDC’s Gantz remains upbeat. In fact, he argues that Big Data is putting CIOs on the threshold of a major period of exploration of the digital universe.
“This period of 'space exploration' of the digital universe will not be without its challenges. But for the 'astronauts' involved - CIOs and their staff - it represents a unique, perhaps once-in-a-career opportunity to drive growth for their enterprises,” Gantz said.
“They will need to lead the enterprise in the adoption of new information-taming technologies, best practices for leveraging and extracting value from data, and the creation of new roles and organisational design. Each step will require organisational change, not just a few new computers or more software. The success of many enterprises in the coming years will be determined by how successful CIOs are in driving the required enterprise-wide adjustment to the new realities of the digital universe.”