4
GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s, Linux admins and Software engineers responsible for managing and maintaining Hadoop clusters. This is designed to provide knowledge to become a successful Hadoop Administrator. This course covers Hadoop architecture and its components, Managing, Maintaining, Monitoring and Troubleshooting a Hadoop Cluster. The focus of this course is to give the participants hands on experience, so there would be multiple assignments, quizzes and a project. COURSE OBJECTIVES Upon successful completion of this course, participants should be able to: f Describe the fundamental concepts of using Big Data f Identify where Hadoop fits into Big Data f Hadoop Architecture and HDFS f Gain insight on YARN and MapReduce f Installing and Configuring Apache Ecosystem Tools f Configuration and Performance Tuning f Learn about Hadoop Cluster f Manage, Maintain, Monitor and Troubleshoot a Hadoop Cluster COURSE INCLUSION ONE YEAR ACCESS Participants will have access to GreyCampus learn platform for a period of one year, This includes access to the Course PPTs, Reading material, Quizzes, Assignments, Project, and Class videos DEDICATED SUPPORT Participants will get the Technical and Nontechnical support through email within 1 business day. Participants can send their queries at [email protected] or they can call the toll free no: 1800 102 0723. FACT SHEET HADOOP ADMINISTRATOR TRAINING & CERTIFICATION BECOME A CERTIFIED HADOOP ADMINISTRATOR © www.greycampus.com

BECOME A CERTIFIED HADOOP ADMINISTRATOR DATA & HAD… · GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s,

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: BECOME A CERTIFIED HADOOP ADMINISTRATOR DATA & HAD… · GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s,

GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s, Linux admins and Software engineers responsible for managing and maintaining Hadoop clusters. This is designed to provide knowledge to become a successful Hadoop Administrator. This course covers Hadoop architecture and its components, Managing, Maintaining, Monitoring and Troubleshooting a Hadoop Cluster. The focus of this course is to give the participants hands on experience, so there would be multiple assignments, quizzes and a project.

COURSE OBJECTIVESUpon successful completion of this course, participants should be able to:

f Describe the fundamental concepts of using Big Data

f Identify where Hadoop fits into Big Data

f Hadoop Architecture and HDFS

f Gain insight on YARN and MapReduce

f Installing and Configuring Apache Ecosystem Tools

f Configuration and Performance Tuning

f Learn about Hadoop Cluster

f Manage, Maintain, Monitor and Troubleshoot a Hadoop Cluster

COURSE INCLUSIONONE YEAR ACCESSParticipants will have access to GreyCampus learn platform for a period of one year, This includes access to the Course PPTs, Reading material, Quizzes, Assignments, Project, and Class videos

DEDICATED SUPPORTParticipants will get the Technical and Nontechnical support through email within 1 business day. Participants can send their queries at [email protected] or they can call the toll free no: 1800 102 0723.

FACT SHEET

HADOOP ADMINISTRATOR TRAINING & CERTIFICATION

BECOME A CERTIFIED HADOOP ADMINISTRATOR

© www.greycampus.com

Page 2: BECOME A CERTIFIED HADOOP ADMINISTRATOR DATA & HAD… · GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s,

VIRTUAL MACHINEParticipants will be provided instructions to set up their Virtual Machine before the course starts.

HANDS ON PROJECTAt the end of the course participants submit a project which covers all the key aspects of the course. This allows them to implement techniques they learnt in the course.

COURSE CERTIFICATIONUpon completing 30 hrs of training participants will be provided a Project which they have to submit within 15 days. A successful completion of the project would make the participants eligible for the GreyCampus certificate.

30 PDUS30 PDUs will be sent to PMI credential holders within 2 business day upon request.

SYSTEM REQUIRMENTS

• Min. 2 MBPS Internet Connectivity

• Multimedia PC with Speakers/Headphones and Microphone

• Windows 7 (or newer) / Mac OS 10.7 (Lion) or newer

• Power backup (preferred) through the duration of the Live Online classes

• Power backup: for both Internet Router and PC

2

© www.greycampus.com

Page 3: BECOME A CERTIFIED HADOOP ADMINISTRATOR DATA & HAD… · GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s,

COURSE AGENDA 3

© www.greycampus.com

MODULE 1: UNDERSTANDING BIG DATA AND HADOOP

• Big Data

• Limitations and Solutions of existing Data

Analytics Architecture

• Hadoop

• Hadoop Features

• Hadoop Ecosystem

• Hadoop 2.x core components

• Hadoop Storage: HDFS

• Hadoop Processing: MapReduce Framework

• Anatomy of File Write and Read

• Rack Awareness.

MODULE 2: HADOOP ARCHITECTURE AND HDFS

MODULE 3: YARN AND MAPREDUCE

• Hadoop 2.x Cluster Architecture - Federation and

High Availability

• A Typical Production Hadoop Cluster

• Hadoop Cluster Modes

• Common Hadoop Shell Commands

• Installation of Hadoop on Single Node/Multi

Cluster env

• Hadoop 2.x Configuration Files, Password-Less SSH

• MapReduce Job Execution

• Data Loading Techniques: Hadoop Copy Commands

• FLUME

• SQOOP

• Node roles

• Data Processing

• Network configuration

MODULE 4: LOAD DATA AND RUN APPLICATIONS

• Hive

• Pig

• Mahout

• HBase

• Hcatalog/Hive

• Hbase Administration

• Data Loading Techniques: Hadoop Copy Commands

• FLUME

• SQOOP

• What Is MapReduce?

• Basic MapReduce Concepts

• YARN Cluster Architecture

• Resource Allocation

• Failure Recovery

• Using the YARN Web UI

• MapReduce Version 1

MODULE 5: INSTALLING AND CONFIGURING APACHE ECOSYSTEM TOOLS

MODULE 6: ADVANCED CLUSTER CONFIGURATION

MODULE 7: HADOOP SECURITY

MODULE 8: MANAGING AND SCHEDULING JOBS

MODULE 9: CONFIGURATION AND PERFORMANCE TUNING

• OS

• JVM and Hadoop configuration parameters tuning

MODULE 10: INSTALLING AND CONFIGURING APACHE ECOSYSTEM TOOLS

• Checking HDFS Status

• Copying Data between Clusters

• Adding and Removing Cluster Nodes

• Rebalancing the Cluster

• Cluster Upgrading

• General System Monitoring

• Monitoring Hadoop Clusters

• Common Troubleshooting Hadoop Clusters

• Common Misconfigurations

• Checking Logs and Log File Locations

• Managing Running Jobs

• Scheduling Hadoop Jobs

• Configuring the FairSchedulers

• Why Hadoop Security Is Important

• Hadoop’s Security System Concepts

• What Kerberos Is and How it Works

• Securing a Hadoop Cluster with Kerberos

• Advanced Configuration Parameters

• Configuring Hadoop Ports

• Explicitly Including and Excluding Hosts

• Configuring HDFS for Rack Awareness

• Configuring HDFS High Availability

Page 4: BECOME A CERTIFIED HADOOP ADMINISTRATOR DATA & HAD… · GreyCampus provides Instructor Led Classes on Hadoop Administration. The course is intended for System Administrators, DBA’s,

© www.greycampus.com

TRAINED OVER 15,000PROFESSIONALS

REACH ACROSS50+ COUNTRIES

EXAM PASS RATE OFOVER 97 %

COURSES ACCREDITED BY LEADING GLOBAL BODIES

ABOUT GREYCAMPUS

GreyCampus is a leading provider of on-demand training that address the unique learning needs of professionals, delivered as online self-learning, live online training or in-person classroom training. Our aim is to provide quality training enabling professionals to achieve their certification and career enhancement goals. We offer training for certifications in areas of Big Data & Hadoop, Project Management, IT Service Management, Quality Management, Python Programming, Agile Training Coaching & Certification and Workplace Tools.

DISCLAIMER

“PMI®”, “PMBOK®”, “PMP®” “CAPM®” and “PMI-ACP®” are registered marks of the Project Management Institute, Inc.

The Swirl logo™ is a trade mark of AXELOS Limited.ITIL® is a registered trade mark of AXELOS Limited.PRINCE2® is a Registered Trade Mark of AXELOS Limited.IASSC® is a registered mark of International Association for Six Sigma Certification.

ACCREDITATIONS & ASSOCIATIONS

Provider ID : 3871