UNIT 1:
Challenges of Conventional Systems
Data - Analytic Processes and Tools
Modern Data Analytic Tools
Intelligent data analysis
Statistical Concepts: Sampling Distributions - Re-Sampling
Statistical Concepts: Sampling Distributions - Re-Sampling
UNIT 2:
History of Hadoop- The Hadoop Distributed File System
History of Hadoop- The Hadoop Distributed File System
History of Hadoop- The Hadoop Distributed File System
Developing a Map Reduce Application-How Map Reduce Works
Job Scheduling-Shuffle and Sort – Task execution
Developing a Map Reduce Application-How Map Reduce Works
Map Reduce Types and Formats- Map Reduce Features
Analyzing the Data with Hadoop Scaling Out- Hadoop Streaming
History of Hadoop- The Hadoop Distributed File System
UNIT 3:
Setting up a Hadoop Cluster
Cluster specification - Cluster Setup and Installation
HDFS - Monitoring-Maintenance
UNIT 4:
Data processing operators in Pig
Applications on Big Data Using Pig and Hive
Data processing operators in Pig
Fundamentals of HBase and ZooKeeper
Visualizations - Visual data analysis techniques
Interaction techniques; Systems and applications
Applications on Big Data Using Pig and Hive
Fundamentals of HBase and ZooKeeper
UNIT 5:
Introduction to R: Overview of R
Functions and packages in R
use R for doing statistical analysis and graphics
working with dataset in R
Adoption of R in Industry : Oralce R, Revolution Analytics
Adoption of R in Industry : Oralce R, Revolution Analytics