Project Description

Este curso foi projetado para administradores que gerenciarão o Hortonworks Data Platform (HDP) 2.3 com o Ambari. Abrange a instalação, a configuração e outras tarefas típicas de manutenção de cluster.

Público-Alvo

Somente treinamentos para empresas (In-Company). Administradores de TI e operadores responsáveis ​​por instalar, configurar e suportar uma implantação do HDP 2.3 em um ambiente Linux usando o Ambari.

Requisitos

Somente treinamentos para empresas (In-Company). Os participantes devem estar familiarizados com os ambientes Hadoop e Linux.

Conteúdo Programático

Este é o conteúdo que será abordado durante o curso. [inglês]

1. AN INTRODUCTION TO BIG DATA, HADOOP AND THE HORTONWORKS DATA PLATFORM

OBJECTIVES

  • Big Data, Hadoop and the Hortonworks Data Platform
  • List Hadoop Cluster Management Choices
  • Describe Apache Ambari
  • Describe the Hadoop Distributed File System (HDFS)
  • Use Ambari Files View

LABS

  • Managing Ambari Users and Groups
  • Managing Hadoop Services
  • Using HDFS Storage
  • Using WebHDFS
  • Using HDFS Access Control Lists

2. MANAGING HDFS STORAGE, YARN RESOURCE MANAGEMENT

OBJECTIVES

  • Describe HDFS Architecture and Operation
  • Manage HDFS using Command-line Tools
  • Enable and Manage HDFS quotas
  • Describe YARN Resource Management
  • Summarize YARN Architecture and Operation
  • Identify and use YARN Management Options

LABS

  • Managing HDFS Storage
  • Managing HDFS Quotas
  • Configuring and Manage YARN
  • Non-Ambari YARN Management

3. YARN APPLICATIONS, MANAGING CLUSTER NODES AND RACK AWARENESS

OBJECTIVES

  • Understand the Basics of Running Simple YARN Applications
  • Identify Reason to Add, Replace and Delete Worker Nodes
  • Configure and Run the HDFS Balancer
  • Decommission and Re-commission a Worker Node
  • Configure and Manager YARN Queues
  • Summarize the Purpose and Benefits of Rack Awareness

LABS

  • Running Sample YARN Applications
  • Add, Decommission and Re-commission a Worker Node
  • Configure Users and Groups
  • Configure YARN Resources Queues
  • User Group and Resource Management
  • Configuring Rack Awareness

4. HDFS AND YARN HIGH AVAILABILITY, MONITORING CLUSTERS, INSTALLING HDP

OBJECTIVES

  • Summarize the Purpose of NameNode HA
  • Configure NameNode HA using Ambari
  • Summarize the Purpose and Operation of Ambari Metrics
  • Summarize Hadoop Backup Considerations
  • Identify Hadoop Cluster Deployment Options
  • Perform an Interactive HDP Installation using Apache Ambari

LABS

  • Configuring NameNode HA
  • Configuring Resource Manager HA
  • Configuring Ambari Alerts
  • Managing HDFS Snapshots
  • Using DistCp
  • Installing HDP