Cloudera

CA-CATAH

Administrator Training for Apache Hadoop

Price:
United Arab Emirates:
USD 3,195.00 excl. VAT

Duration: 4 Days

Who Should Attend

  • This course is best suited to systems administrators and IT managers who have basic Linux experience.
Dates
Date Country Location Language Register
04-11-2018 - 07-11-2018 AE Dubai English Register
04-11-2018 - 07-11-2018 SA Riyadh English Register
International dates
 
Prerequisites

Prior knowledge of Apache Hadoop is not required.

Associated Certification(s):

Upon completion of the course, attendees are encouraged to continue their study and register for the CCA Administrator exam. Certification is a great differentiator. It helps you as a leader in the field, providing employees and customers with tangible evidence of your skills and expertise

Course Objectives

Cloudera University’s four-day administrator training course for Apache Hadoop provides participants with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster. From installation and configuration through load balancing and tuning, Cloudera’s training course is the best preparation for the real-world challenges faced by Hadoop administrators.

Course Objectives

Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:

  • Cloudera Manager features that make managing your cluster easier, such as aggregated logging, configuration management, resource management, reports, alerts, and service management 
  • The internals of YARN, MapReduce, Spark and HDFS
  • Determining the correct hardware and infrastructure for your cluster 
  • Proper cluster configuration and deployment to integrate with the data center 
  • How to load data into the cluster from dynamically-generated files using Flume and from RDBMS using Sqoop 
  • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster
  • Best practices for preparing and maintaining the Apache Hadoop in production 
  • Troubleshooting, diagnosing, tuning, and solving Hadoop issues
Course Outline
  • Introduction
  • The Case for Apache Hadoop 
  • Hadoop Cluster Installation 
  • The Hadoop Distributed File System (HDFS)
  • MapReduce and Spark on YARN
  • Hadoop Configuration and Daemon Logs 
  • Getting Data into HDFS
  • Planning Your Hadoop Cluster 
  • Installing and Configuring Hive, Impala, and Pig 
  • Hadoop Clients including Hue 
  • Advanced Cluster Configuration 
  • Hadoop Security 
  • Managing resources 
  • Cluster maintenance 
  • Cluster Monitoring and Troubleshooting
Further information

If you would like to know more about this course please contact us