Hadoop Administration: An easy way to become a Hadoop Admin
Hadoop Administration: Online Training for Beginners to Professional
Description
Module 0: Giveaways
· Linux / UNIX Course
· 100 Solved Queries of Hadoop Administration Day to Day activities.
· Guidelines to create an AWS account.
Module 1: Introduction of Hadoop Administration
· Understanding Big Data
· Common big data domain scenarios
· Analyze Limitation of Traditional Solutions
· Roles and Responsibility
· Case Studies
Module 2: Hadoop Architecture And Mapreduce
· Introduction to Hadoop
· Hadoop Architecture
· Difference between Hadoop 1.x, Hadoop 2.x and Hadoop 3.x
· Hadoop 1.x Ecosystem tools and Core System
· Hadoop 2.x Ecosystem tools and Core System
· HDFS File System
o Introduction of NameNode, DataNode and Secondary NameNode
o Anatomy of Write and Read
o Replication Pipeline
· YARN Framework
o Role and function of YARN in Hadoop
o Mapreduce Theory
§ Cluster testing using MapReduce Code in YARN Environment
Module 3: Cluster Planning
· Types of Rack
· General Principal of selecting CPU Memory and hardware
· Understand Hardware Consideration
· Machines requirement as per the daemons
· Learn Best Practice for selecting hardware
Know the network Consideration
Module 4: Hadoop Cluster Administration, Backup, Recovery and Maintenance
· SafeMode
· Decommissioning, Commissioning and Re-Commissioning of Node
· Trash Functionality
· Distcp
· Rack Awareness
· HDFS / Hadoop Balancer
Module 5: Managing Resources and Scheduling
· Scheduler: Explanation and demo
o Capacity Scheduler
Module 6: HDFS Federation and High Availability
· Understand the YARN framework
· Understand the Federation
· Understand High Availability
· High Availability Implementation Using Quorum Journal Manager
Module 7: Cloudera Setup and Performance Tuning
· Cloudera Distribution Hadoop
· Cloudera Features
· Cloudera Manager Editions
· Cloudera Manager Web UI
· CDH Installation
Module 8: Security
· Basics of Hadoop Platform Security
· Securing the Platform
· Understand Kerberos
Configuring Kerberos on Cloudera Hadoop Cluster using LDAP authentication
What You Will Learn!
- Create Hadoop Single node cluster on VM-Ware.
- Create Hadoop Multi-node cluster on AWS platform and know how to submit job on Hadoop Cluster.
- Learn to plan Hadoop Cluster.
- Learn to Commission, Decommission and Recommission machines
- Learn to take back-up from cluster using Distcp Command, recover and maintain Hadoop Cluster.
- Learn how to enable capacity scheduler in Hadoop Cluster.
- Enable NameNode High availability configuration on Hadoop Cluster.
- Learn to install Hadoop using Cloudera Manager and other administrative activites
- Enable Kerberos security on Cloudera Hadoop Cluster using LDAP connection with Active Directory.
- How to Monitor a Hadoop Cluster
Who Should Attend!
- Linux / Unix Administrator, Data analysts and database administrators who are curious about Hadoop Administration part and how it relates to their work.
- Hadoop Developers and Java Developers who want to be a Hadoop Administrator.
- Software engineers and programmers who want to understand the administration of larger Hadoop ecosystem.