Evolution of Big Data - Best Practices for Big data Analytics - Big Data Characteristics - Validating - The promotion of the values of Big Data - Big Data Use Cases - Characteristics of Big Data Applications - Perception & Quantification of Value.
Introduction to Data Analytics, Visualization & Data Exploration, Basic & Intermediate analysis, Linear & Logistic Regression, Decision Tree
Introduction to HADOOP - HADOOP Scalability - Vertical Scalability, Horizontal Scalability - Anatomy of HADOOP.
Installation & Execution of Hadoop - Stand-alone mode, Pseudo Distributed mode and Fully Distributed mode - File Management - Create a Directory in HDFS - List the contents, upload & download a file in HDFS - Copy, move & remove a file in HDFS.
Weather Report POC, Map Reduce Program - Regression Modelling using R - Implementation of classification models of using R - Implementation of clustering models using R - Execution Big Data using R Programming
Reference Book:
1. Philipp K. Janert, "Data Analysis with open Source Tools", O' Reilley, 2010 2. EMC education Services, "Data Science & Big data Analytics : Discovering, Analyzing, Visualizing & Presenting Data", Wiley Publishers, 2015
Text Book:
1. David Livingstone, "Big Data Analytics: From Strategic Planning to Enterprise INtegration with Tools, Techniques, NoSQL & Graph", 2013.(Unit - I, II, III, IV, V) 2. Andrew Gelman & Jennifer Hill, "Data Analysis using Regression and multilevel / Hierarchical Models", Cambridge University Press,2007 (Unit - I, II, III, IV, V)