BIGDATA & HADOOP

BIGDATA AND HADOOP


Introduction to Hadoop
o Bigdata overview
o Bigdata Solutions
o History of Hadoop
o Hadoop Introduction
o Hadoop Ecosystem

HDFS
o What is HDFS?
o HDFS Architecture
o HDFS – Read/Write
o Daemons - Name node and Data node
o Data Replication
o HDFS commands
o Rack awareness
o Configuration Files

HDFS
o HDFS commands execution

Map Reduce
o What is MapReduce?
o Basic Flow of MapReduce
o Daemons – JobTracker and TaskTracker
o Combiner
o Partitioner
o Sample MapReduce program (Wordcount)
o Input and Output Formats

Pig
o Why Pig?
o What is Pig?
o Pig execution Modes
o Basic Pig Commands
o Pig operators
o UDF in Pig
o Joins in Pig

Hive
o What is Hive?
o Hive Tables
o Hive Datatypes
o Partitioning in Hive
o Bucketing in Hive
o Joins in Hive
o UDF in Hive