Sunday, October 25, 2015

What's the Big Data?

We have been hearing a great deal about Big Data in the last few years. What is this hype about? Why the hype? The hype says that with more data we can gather the better answers to business problems. My first exposure to Big Data was at the University of Arizona’s 2014 Symposium on Data Analytics in Healthcare held on October 17, 2014. At the time, my team of healthcare management consultants for the University of Arizona Medical Center, decided to attend this symposium to understand the hype and how Big Data is helping the healthcare sector. I am sure, that we have all read that the healthcare industry is under pressure to reduce the cost of care, while improve patient care. The various technologies such as Electronic Health Records (EHR), social media, cellular applications, etc. are pressuring healthcare to dive into the Big Data treasure hunt in order to find ways to improve patient care and reduce overall costs. The speakers at the symposium discussed various topics, projects, and research related to social media (such as twitter) and cellular applications to monitor and improve the healthcare. Nonetheless to say, the symposium was great exposure to various aspects and use of Big Data that sparked an interest in me.   

So the question is what and how Big is Big Data? Big data is an evolving term that describes a massive volume of data that is difficult to process with traditional software and database techniques. We live in a world where technology is rapidly evolving. In return, we are sitting on a big data bomb, with an expected volume exceeding 35 Zettabytes by 2020. I am sure you are wondering, well how much is Zettabytes (see picture below for a hierarchical explanation of Zettabyte)? Ok, so think about a cup of coffee (1 Gigabyte) and compare it to the Great Wall of China – have the picture? Well that is a visual explanation of a Zettabyte.

As part of the Business Intelligence lecture and readings for Week 1, I was mind-blown about the amount of data being generated in 60 seconds:
·        More than 100,000 tweets
·        More than 400,000 Skype calls
·        More than 80,000 posts on Facebook
·        200,000 e-mails sent         
·        700,000 searches on Google

Can you believe it? Are you as shocked as I was? Isn’t this miraculous!? In 2010, Eric Schmidt from Google expressed “There were 5 exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days.”  

So based on my blog so far, you are tempted to understand big data merely in terms of size but of course, you would be misled. In addition to volume, big data is characterized by volume and its ability to transform into data many aspects of the world that were previously unimaginable, such as GPS signals from cell phones, likes on Facebook, messages and images posted on social networks, readings from sensors, such as vending machines, car seats, jet engines, etc. Las but not least, the speed of data creation is another important aspect of Big Data. Real-time data makes it possible for businesses to understand their position in the market at the exact moment and become more agile than their competitors. In simplest terms, because of big data businesses can now measure, and understand a lot more about how they operate, and translate all that knowledge into improved decision-making and performance.

So, how can we deal with the variety, volume and velocity of Big Data? The answer is Business Intelligence BI). BI is the applications, technologies, tools and techniques that are used to gather and analyze data in order to provide businesses with actionable insights on measuring and managing their performance. Important thing worth noting is that BI is based on data and data needs to be accurate in order to get the benefit of it. There are numerous activities embedded within BI, such as analytics, data warehousing, data collection and processing, data mining, reporting and querying software, digital dashboards. As, Peter Sondergaard from Gartner Research said, “Information is the oil of the 21st century, and analytics is the combustion engine.”
Big data and BI are used in many industries, from manufacturing, to healthcare, from law enforcement to environment, from traffic control to fraud prevention, etc. The exponential growth of data will require need for BI related jobs. Based on a recent report from McKinsey, United States will require about 200,000 data scientists (an individual that can combine analytical, technical, quantitative and business skills) and 1.5 million data savvy managers.

In closing, if we think Big Data is big now, we just have to wait …


References

Cukier, K. N., & Mayer-Schoenberger, V. (2013). The Rise of Big Data. Foreign Affairs.
McAfee, A., & Brynjolfsson, E. (2012). Big Data: The management revolution. Harvard Business Review.
Miller, R. (2014, Aug 10). If You Think Big Data's Big Now, Just Wait. Retrieved from Tech Crunch: http://www.techcrunch.com/2014/10/big-data-bound-to-get-really-really-big-with-the-internet-of-things
Ram, S. (2015). Intro to Big Data and Business Intelligence. Module 1

No comments:

Post a Comment