Wednesday, 5 November 2014

Hadoop Benefits - Kapil Sharma

Hadoop Benefits:

Every day approx 3 exabytes (2.5×1018) of data were created globally.

Benefits: If analysed than they reveal patterns, trends, and associations, especially relating to human behaviour and interactions.

Low cost: Hadoop is open-source framework. So its free. 

Computing power: It has distributed computing model which quickly process very large data volumes. The more nodes(computing) you use the more processing power your hadoop setup going to have.

Scalability: Easily increase your hadoop system simply by adding more nodes. Less IT administration is required.

Storage flexibility: Hadoop includes unstructured data like text, images and videos in a single go. Users can store as much data as they want and decide how to use it later period.

Inherent data protection and self-healing capabilities: Data and application processing are protected against hardware failure. If a system(node) goes down, jobs are auto redirected to other systems to make sure the distributed computing does not leads failure. And it auto stores multiple data copies of input data.

What is Hadoop - Kapil Sharma

What is Hadoop?

Hadoop - is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity hardware. Essentially, it accomplishes two tasks: massive data storage and faster processing.

Currently three core components are included with your basic download from the Apache Software Foundation.


HDFS - the Java-based distributed file system that can store all kinds of data without prior organization.

MapReduce – a software programming model for processing large sets of data in parallel.

YARN – a resource management framework for scheduling and handling resource requests from distributed applications.

What is Big Data - Kapil Sharma

BigData: 

Extremely large data sets both structured and un-structured.

Four Vs of big data: volume, velocity, variety, variability and complexity.

Benefits

If analysed than they reveal patterns, trends, and associations, especially relating to human behaviour and interactions.

Every day approx 3 exabytes (2.5×1018) of data were created.

Try this cmd on google search: "?intitle:index.of?mp4 Oracle"