De-mystifying Big Data
Data have always played a critical role in business. The digital age has brought with it a quantum increase in the amount of data available to the modern organization. Retail giant Wal-Mart feeds more than 1m transactions an hour into databases estimated at more than 2.5 petabytes. Facebook’s 750m users create an average of 90 pieces of content each month. And an average of 294bn e-mails is sent every day.
What is big data?
Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few. This data is big data.
Big data is not a precise term; rather it’s a characterization of the never-ending accumulation of all kinds of data, most of it unstructured. It describes data sets that are growing exponentially and that are too large, too raw or too unstructured for analysis using relational database techniques. Whether terabytes or petabytes, the precise amount is less the issue than where the data ends up and how it is used. - EMC
Big Data refers to data-sets that grow so large that it is difficult to capture, store, manage, share, analyze and visualize with the typical database software tools – Oracle
Big Data Is About…
• Tapping into diverse data sets
• Finding and monetizing unknown relationships
• Data driven business decisions
Volume: Enterprises are awash with ever-growing data of all types, easily amassing terabytes—even petabytes—of information.
Velocity: Sometimes 2 minutes is too late. For time-sensitive processes such as catching fraud, big data must be used as it streams into your enterprise in order to maximize its value.
Variety: Big data is any type of data - structured and unstructured data such as text, sensor data, audio, video, click streams, log files and more. New insights are found when analyzing these data types together.
Veracity: 1 in 3 business leaders don’t trust the information they use to make decisions. How can you act upon information if you don’t trust it? Establishing trust in big data presents a huge challenge as the variety and number of sources grows.
Prof. Siva Rama Krishnan
School of Information Technology
Vellore Institute of Technology
What is big data?
Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few. This data is big data.
Big data is not a precise term; rather it’s a characterization of the never-ending accumulation of all kinds of data, most of it unstructured. It describes data sets that are growing exponentially and that are too large, too raw or too unstructured for analysis using relational database techniques. Whether terabytes or petabytes, the precise amount is less the issue than where the data ends up and how it is used. - EMC
Big Data refers to data-sets that grow so large that it is difficult to capture, store, manage, share, analyze and visualize with the typical database software tools – Oracle
Big Data Is About…
• Tapping into diverse data sets
• Finding and monetizing unknown relationships
• Data driven business decisions
Volume: Enterprises are awash with ever-growing data of all types, easily amassing terabytes—even petabytes—of information.
Velocity: Sometimes 2 minutes is too late. For time-sensitive processes such as catching fraud, big data must be used as it streams into your enterprise in order to maximize its value.
Variety: Big data is any type of data - structured and unstructured data such as text, sensor data, audio, video, click streams, log files and more. New insights are found when analyzing these data types together.
Veracity: 1 in 3 business leaders don’t trust the information they use to make decisions. How can you act upon information if you don’t trust it? Establishing trust in big data presents a huge challenge as the variety and number of sources grows.
Prof. Siva Rama Krishnan
School of Information Technology
Vellore Institute of Technology