Big Data is one of the hottest topics trending today. And it is growing day by day. The demand of Data Scientists (Big Data experts) is very high. Mostly people say it requires a Ph.D degree but according to me, it is good but not necessary one. One who wants to enter into the field of Data Science must have knowledge about different fields.
- Strong Mathematics background
- Statistics
- Programming
- Computer knowledge using Excel etc.
Now, what actually is Big Data ….?
Many different people and companies define it differently. IBM defines it as -
“Big data is the data characterized by 3 attributes: volume, variety and velocity.”
Oracle defines it as -
“Big data is the data characterized by 4 key attributes: volume, variety, velocity and
value.”
Google dictionary defines it as -
“extremely large data sets that may be analysed computationally to reveal patterns, trends, and associations, especially relating to human behaviour and interactions.”
But, I will define it differently.
Think 15 years back, when we were using computers. What was Data at that time? It was in bytes or kilobytes mostly. Then, we moved to desktops and data increased to Megabytes and Gigabytes. Then, the World Wide Web got introduced and data increased exponentially, and reached to Terabytes and Petabytes. The following image demonstrates it:
Now today, big companies like Microsoft, Google, eBay, Amazon, and Netflix all have a lot of data about users. The whole data in the world has doubled in just last two years. Today, data is available in Exabytes and Zettabytes. The following image may clear the concept.
This is known as the Big Data that is stored in top big databases in the world. Every company is saving its customers' data on servers for many useful purposes.
Why Big Data?
Now, the biggest question here is why companies rely on Big Data and what are its main advantages.
The answer of this question is straight forward. Companies use this data for many purposes, main purpose being understanding the customer behaviour - their liking, their understandings etc. Moreover, it is used for predictions and more. Following is the description of some giant companies who are using best use of Big Data.
- Google
Google is using Big Data for advertisements. From previous knowledge, they understand the user's likes, shopping behavior, types of ads a user clicks most often, and then they post ads on users' pages that best suit the users' behavior.
- Amazon
Amazon is best known for its online selling and purchasing. If someone buys some book or CD/DVD from Amazon, they save customer's data and then from previous data, they predict the customer's interests followed by recommending products geared to their interests.
- Netflix
Netflix also predicts users' interests and behaviors by previous data and then they recommend movies and create interesting content as per the reorded pattern.
- Airbnb
Airbnb also uses Big Data. They store a large amount of customer data and then, recommend to them the perfect price.
Big.... means Big
From the above discussion, we can conclude that these companies understand customers by their previous data. So, Big means - a large amount of data is needed to perform these useful operations. The above mentioned companies and many more in the world have turned their businesses into multi-million dollar ones by using Big Data Analytics.