Big Data and Hadoop Solutions Architect online training is designed to make sure that you transform into a Hadoop Solutions Architect Expert. You will be able to gain expertise of core skill sets, including designing, deploying, maintaining, and securing Hadoop clusters, to lead the enterprise-wide administration of the Hadoop infrastructure of Big Data Ecosystem.
Preview
By the end of this training, you will learn,
1.DW architecture
2.Mapreduce features
3.Haoop features and configuration
4.Introduction to Hive,Pig,Hbase and VMware
Course Contents
Day 1
DW Definition
DW Architecture
Operation Data Bases(ODB)
Data Modeling 3NF
and Dimensional Modeling
OLTP and OLAP
ETL concepts
Top Down/Bottom up approaches
Bill Inmon approach – advantages and disadvantages
Ralph Kimbell approach – advantages
and disadvantages
Star and Snowflake schemas
Dimension modeling design considerations
Normalization techniques with live examples
Data Mart project examples
Customer, Products, Geo dimensional concepts
Hierarchy structures
Master Data Management systems
Information Management systems
NO SQL – BIG Data
ACID Model
CAP Model
Day 2
HADOOP Architecture
HDFS Architecture
HDFS Features
Intro Name node & Data Node
File storage & Replication
Build HADOOP Cluster–EC2/AWS
Hadoop Configuration
Hadoop MapReduce Features
MapReduce Job recovery
MapReduce Job Check
Cluster Rebalancing
Secondary Name Node features
Practice Hadoop Commands
Introduction to Hive
Installation of Hive
Hive SQLsHive internal and external tables
Hive Partitions
Introduction to SQOOP
Installation of Sqoop
Sqoop practice with Hadoop and HBase
Day 3
Introduction to Pig
Installation of Pig
Pig Relations, Bags, Tuples, Fields
Pig- expressions
Pig- Schemas
Pig- Join and Split Optimization
Pig- JSON
Introduction to HBASE
Architecture
Install Hbase
Region Servers , Master
Hbase with Hive
Hbase with Sqoop
Hbase with PIG
Installation on VM Ware
Installation of CDH4
Training Hours
Time: 12:00 NOON GMT | 07:00AM EST | 4:00AM PST | 6:00AM CST | 5:00AM MST | 5:30PM IST | 01:00PM GMT+1
Audience
1.Project Managers
2.Systems administrators and IT managers
3.IT administrators and operators
4.IT Systems Engineer
5.Data Management Professionals
6.Cloud Systems Administrator
7.Data Engineer