Skip to main content

Posts

Showing posts with the label big data book pdf

Hadoop Training Material

Introduction to BigData, Hadoop :-  Big Data Introduction  Hadoop Introduction  What is Hadoop? Why Hadoop?  Hadoop History?  Different types of Components in Hadoop?  HDFS, MapReduce, PIG, Hive, SQOOP, HBASE, OOZIE, Flume, Zookeeper and so on…  What is the scope of Hadoop? Deep Drive in HDFS (for Storing the Data) :-  Introduction of HDFS  HDFS Design  HDFS role in Hadoop  Features of HDFS  Daemons of Hadoop and its functionality o Name Node o Secondary Name Node o Job Tracker o Data Node o Task Tracker  Anatomy of File Wright  Anatomy of File Read  Network Topology o Nodes o Racks o Data Center  Parallel Copying using DistCp  Basic Configuration for HDFS  Data Organization o Blocks and o Replication  Rack Awareness  Heartbeat Signal  How to Store the Data into HDFS  How to Read the Data from HDFS  Accessing HDFS (Introduction of Basic UNIX commands)  CLI commands MapReduce using Java (Processing the Data):-  The