Hadoop Administration: An easy way to become a Hadoop Admin

Hadoop Administration: Online Training for Beginners to Professional

Ratings: 3.82 / 5.00




Description

Module 0: Giveaways

· Linux / UNIX Course

· 100 Solved Queries of Hadoop Administration Day to Day activities.

· Guidelines to create an AWS account.


Module 1: Introduction of Hadoop Administration

· Understanding Big Data

· Common big data domain scenarios

· Analyze Limitation of Traditional Solutions

· Roles and Responsibility

· Case Studies


Module 2: Hadoop Architecture And Mapreduce

· Introduction to Hadoop

· Hadoop Architecture

· Difference between Hadoop 1.x, Hadoop 2.x and Hadoop 3.x

· Hadoop 1.x Ecosystem tools and Core System

· Hadoop 2.x Ecosystem tools and Core System

· HDFS File System

o Introduction of NameNode, DataNode and Secondary NameNode

o Anatomy of Write and Read

o Replication Pipeline

· YARN Framework

o Role and function of YARN in Hadoop

o Mapreduce Theory

§ Cluster testing using MapReduce Code in YARN Environment


Module 3: Cluster Planning

· Types of Rack

· General Principal of selecting CPU Memory and hardware

· Understand Hardware Consideration

· Machines requirement as per the daemons

· Learn Best Practice for selecting hardware

Know the network Consideration


Module 4: Hadoop Cluster Administration, Backup, Recovery and Maintenance

· SafeMode

· Decommissioning, Commissioning and Re-Commissioning of Node

· Trash Functionality

· Distcp

· Rack Awareness

· HDFS / Hadoop Balancer


Module 5: Managing Resources and Scheduling

· Scheduler: Explanation and demo

o Capacity Scheduler


Module 6: HDFS Federation and High Availability

· Understand the YARN framework

· Understand the Federation

· Understand High Availability

· High Availability Implementation Using Quorum Journal Manager


Module 7: Cloudera Setup and Performance Tuning

· Cloudera Distribution Hadoop

· Cloudera Features

· Cloudera Manager Editions

· Cloudera Manager Web UI

· CDH Installation


Module 8: Security

· Basics of Hadoop Platform Security

· Securing the Platform

· Understand Kerberos

Configuring Kerberos on Cloudera Hadoop Cluster using LDAP authentication

What You Will Learn!

  • Create Hadoop Single node cluster on VM-Ware.
  • Create Hadoop Multi-node cluster on AWS platform and know how to submit job on Hadoop Cluster.
  • Learn to plan Hadoop Cluster.
  • Learn to Commission, Decommission and Recommission machines
  • Learn to take back-up from cluster using Distcp Command, recover and maintain Hadoop Cluster.
  • Learn how to enable capacity scheduler in Hadoop Cluster.
  • Enable NameNode High availability configuration on Hadoop Cluster.
  • Learn to install Hadoop using Cloudera Manager and other administrative activites
  • Enable Kerberos security on Cloudera Hadoop Cluster using LDAP connection with Active Directory.
  • How to Monitor a Hadoop Cluster

Who Should Attend!

  • Linux / Unix Administrator, Data analysts and database administrators who are curious about Hadoop Administration part and how it relates to their work.
  • Hadoop Developers and Java Developers who want to be a Hadoop Administrator.
  • Software engineers and programmers who want to understand the administration of larger Hadoop ecosystem.