SapHanaTutorial.Com HOME     Learning-Materials Interview-Q&A Certifications Quiz Online-Courses Forum Jobs Trendz FAQs  
     Explore The World of Hana With Us     
About Us
Contact Us
 Apps
X
HANA App
>>>
Hadoop App
>>>
Tutorial App on SAP HANA
This app is an All-In-One package to provide everything to HANA Lovers.

It contains
1. Courses on SAP HANA - Basics, Modeling and Administration
2. Multiple Quizzes on Overview, Modelling, Architeture, and Administration
3. Most popular articles on SAP HANA
4. Series of Interview questions to brushup your HANA skills
Tutorial App on Hadoop
This app is an All-In-One package to provide everything to Hadoop Lovers.

It contains
1. Courses on Hadoop - Basics and Advanced
2. Multiple Quizzes on Basics, MapReduce and HDFS
3. Most popular articles on Hadoop
4. Series of Interview questions to brushup your skills
Apps
HANA App
Hadoop App
';
Search
Stay Connected
Search Topics
Topic Index
+
-
Hadoop Overview
+
-
Hadoop Examples
+
-
MapReduce
+
-
YARN
+
-
Miscellaneous

Introduction to Big Data for Beginners

What is Big Data?


Why Hadoop is needed for Big Data?

Where does Big Data come from?

The data coming from everywhere for example
  1. In last 10-15 minutes on Facebook, you see millions of links shared, event invites, friend requests, photos uploaded and comments
  2. Terabytes of data generated through Twitter feeds in the last few hours
  3. Consumer product companies and retail organizations are monitoring social media like Facebook and Twitter to get an unprecedented view into customer behaviour, preferences, and product perception
  4. GPS data from mobile devices
  5. weblogs, emails text, email attachments
  6. sensors used to gather climate information
  7. posts to social media sites,
  8. purchase transaction records and much more

All these together constitute Big Data.

Why Hadoop is needed for Big Data?



Big Data contains both Structured and Unstructured Data


Why Hadoop is needed for Big Data?

Large collection of structured and unstructured data that can be captured, stored, aggregated, analyzed and communicated to make better business decisions is called Big Data.

Big Data is Growing Fast


Why Hadoop is needed for Big Data?

3Vs (volume, variety and velocity) defining Big Data


Why Hadoop is needed for Big Data?

Volume refers to the amount of data


Why Hadoop is needed for Big Data?

The size of available data is growing today exponentially. A text file is a few kilobytes, a sound file is a few megabytes while a full-length movie is a few gigabytes.
More sources of data are getting added on continuous basis. It is very common to have Terabytes and Petabytes of the storage system for enterprises. As the database grows the applications and architecture built to support the data needs to be changed quite often.

Velocity refers to the speed of data processing


Why Hadoop is needed for Big Data?

The data growth and social media explosion have changed how we look at the data. Initially, companies analyzed data using a batch process. One takes a chunk of data, submits a job to the server and waits for output. That process works when the incoming data rate is slower.

With the new sources of data such as social and mobile applications, the batch process breaks down. Today people reply on social media to update them with the latest happening. On social media sometimes a few seconds old messages (a tweet, status updates etc.) is not something interests users. They often discard old messages and pay attention to recent updates. The data movement is now almost real time and the update window has reduced to fractions of the seconds.

Variety refers to the number of types of data


Why Hadoop is needed for Big Data?

From excel tables and databases, data structure has changed to lose its structure and to add hundreds of formats. Pure text, photo, audio, video, web, GPS data, sensor data, relational data bases, documents, SMS, pdf, flash etc.
Now we no longer have control over the input data format. Structure can no longer be imposed like in the past in order to keep control over the analysis. As new applications are introduced new data formats come to life. The real world has data in many different formats and that is the challenge we need to overcome with the Big Data.

What's Next?




Support us by sharing this article.



Explore More
Close X
Close X

One thought on “What is Big Data – for Beginners

  1. ramakrishnan says:

    Big data is one of the most emerging field. Thanks for sharing this valuable information on big data.

Leave a Reply

Your email address will not be published. Required fields are marked *

Current day month ye@r *

 © 2017 : saphanatutorial.com, All rights reserved.  Privacy Policy