Wednesday, 5 March 2014

Big data - Hadoop training Course Outline

This course has been designed to cover all type of audiences spanning from Architect, Administrator to developer. 

In case of any question regarding duration/fees/schedule , do call me @ 9840014739


Module 1
Big data Getting Started
What is Big Data?
What is Big Data Analytics ?
What  is Apache Hadoop ?
History of Hadoop
Understanding distributed file systems and Hadoop
Hadoop eco system components
Hadoop use cases
Ubuntu Installation
JDK Installation
Module 2
Hadoop Distributed File system

Eclipse Installation
Overview of HDFS
Communication Protocols
Rack Awareness
Hadoop cluster Topology
Setting up SSH for Hadoop Cluster
Running Hadoop –
          Pseudo-distributed mode
Linux basic commands
HDFS file commands
Reading and writing to HDFS programmatically
Module 3
MapReduce Framework

Java Basics
Anatomy of a MapReduce Program
Writables
InputFormat
OutputFormat
Streaming API
Inherent failure handling
Reading and writing
Module 4
Advanced MapReduce  Programming
Input splits, Record Reader, Mapper, Partition & Shuffle, Reduce, OutputFormat
Writing MapReduce program
Streaming in Hadoop
Counters
Performance Tuning
Joins
Sorting
Module  5
Apache Hadoop Administration   
Best Practices for Hadoop setup and infrastructure

Hadoop cluster Installation preparation
   Ø Cluster network design
   Ø  Installation of Linux operating system
   Ø  Configuring SSH
   Ø  Walkthrough on Rack topology and set up

Managing Hadoop cluster
   Ø  HDFS cluster management
   Ø  Secondary Name node configuration
   Ø  Task Tracker management
   Ø  Configuring the HDFS quota
   Ø  Configuring Fair Scheduler      
   Ø  Upgrading Hadoop     
   Ø  Deploying and managing Hadoop
         clusters with Ambari

Monitoring Hadoop cluster
   Ø  Monitoring Hadoop cluster with
        Ganglia
   Ø  Monitoring Hadoop cluster with 
             Ambari
   Ø  Monitoring Hadoop cluster with Nagia

Hadoop Cluster Performance Tuning
   Ø  Benchmarking and profiling
   Ø  Using compression for input and 
             output
   Ø  Configuring optimal map and reduce
        slots  for the TT
   Ø  Fine tuning Job Tracker config
   Ø  Fine tuning Task Tracker config
   Ø  Tuning Shuffle, merge and sort
             parameters

Security Implementation
              Kerberos security mplementation      
Workflow Scheduler
              Capacity Scheduler
               Fair Scheduler   

dfsadmin & mradmin commands

Administration of Hcatalog and Hive

Backup and Recovery
Scenario based exercises
-          Data node failure & Recovery
-          Name Node Failure & Recovery
-          JT & TT failure  & Recovery
-          Removing data nodes
-          Adding Data nodes


Module 6
Pig and Pig Latin
Installation and configuration
Running Pig Lating through grunt
Writing programs
-          Filter , Load & Store functions
Writing user defined functions

Working with Scripts
Lab Exercises
Module 7
HBase and ZooKeeper
NoSQL Vs SQL
Cap  Theorem
Architecture
Installation
Configuration
Java API
MR integration
Performance Tuning
Lab Exercises
Module 8
Hive
Features of Hive
Architecture
Installation and configuration
HiveQL

Lab Exercises
Module 9
Other Hadoop eco system components
Overview of Ambari, Oozie ,Mahout
Installing & configuring Sqoop, mysql-server
Installing & configuring flume

Lab Exercises


http://big-data-training-in-chennai.blogspot.in/

20 comments:

  1. This comment has been removed by a blog administrator.

    ReplyDelete
  2. This comment has been removed by a blog administrator.

    ReplyDelete
  3. This comment has been removed by a blog administrator.

    ReplyDelete
  4. This comment has been removed by a blog administrator.

    ReplyDelete
  5. This comment has been removed by a blog administrator.

    ReplyDelete
  6. This comment has been removed by a blog administrator.

    ReplyDelete
  7. This comment has been removed by the author.

    ReplyDelete
  8. This comment has been removed by the author.

    ReplyDelete
  9. Informative post indeed, I’ve being in and out reading posts regularly and I see alot of engaging people sharing things and majority of the shared information is very valuable and so, here’s my fine read.
    click here chat
    click here to check results
    click here to check marks
    click here to change your password
    click here to claim your ecard

    ReplyDelete
  10. Nice article i was really impressed by seeing this article about big data, it was very interesting and it is very useful for me.Thanks for sharing this wonderful content.its very useful to us.I gained many unknown information, the way you have clearly explained is really fantastic.
    DevOps Training in Chennai

    DevOps Online Training in Chennai

    DevOps Training in Bangalore

    DevOps Training in Hyderabad

    DevOps Training in Coimbatore

    DevOps Training

    DevOps Online Training

    ReplyDelete
  11. Why Did Google Choose Kotlin To Be An Official Supported Language in Android. ... Kotlin is an object-oriented language, statically typed and fully interoperable with Java. Kotlin offers some amazing features that support the growth in its adoption by the developers over Java.thanks!!

    Android Training in Chennai

    Android Online Training in Chennai

    Android Training in Bangalore

    Android Training in Hyderabad

    Android Training in Coimbatore

    Android Training

    Android Online Training


    ReplyDelete
  12. Wow! Fantastic article man! Much obliged to you, However I am experiencing issues with your RSS. I don't comprehend the motivation behind why I can't go along with it. Is there any other person having a similar RSS issues? Anyone who realizes the appropriate response will you generously react? Much appreciated!!tech updates

    ReplyDelete
  13. Aw, this was a decent post. Taking the time and genuine exertion to create a brilliant article… yet what would i be able to say… I tarry a ton and never figure out how to complete anything. tech updates

    ReplyDelete