Introduction to BigData – Challenges of Conventional Systems - Intelligent data analysis – Data - Analytic Processes and Tools - Modern Data Analytic Tools - Statistical Concepts: Sampling Distributions - Re-Sampling - Statistical Inference - Prediction Error
History of Hadoop- The Hadoop Distributed File System – Components of Hadoop- Analyzing the Data with Hadoop- Scaling Out- Hadoop Streaming- Design of HDFS-Java interfaces to HDFS Basics- Developing a Map Reduce Application-How Map Reduce Works-Anatomy of a Map Reduce Job run-Failures-Job Scheduling-Shuffle and Sort – Task execution - Map Reduce Types and Formats- Map Reduce Features
Setting up a Hadoop Cluster - Cluster specification - Cluster Setup and Installation - Hadoop Configuration-Security in Hadoop - Administering Hadoop – HDFS - Monitoring-Maintenance-Hadoop benchmarks- Hadoop in the cloud
Applications on Big Data Using Pig and Hive – Data processing operators in Pig – Hive services – HiveQL – Querying Data in Hive - fundamentals of HBase and ZooKeeper - Visualizations - Visual data analysis techniques, interaction techniques; Systems and applications
Introduction to R: Overview of R; functions and packages in R; working with dataset in R; use R for doing statistical analysis and graphics; R commands . Adoption of R in Industry : Oralce R, Revolution Analytics
Reference Book:
1. A.Ohri, “R for Business Analytics”, Second edition, Springer, 2012 2. Chris Eaton, Dirk DeRoos, Tom Deutsch, George Lapis, Paul Zikopoulos, “Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data”, McGrawHill Publishing, 2012 (UNIT III-IV) 3. Bill Franks, “Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics”, John Wiley & sons, 2012. (UNIT III-IV) 4. Prabhanjan NarayanacharTattar, “R Statistical Application Development byExample Beginner's Guide”, PACKT, 2013 (UNIT V)
Text Book:
1. Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition, Elsevier, Reprinted 2008. 2. Tom White, “ Hadoop: The Definitive Guide” Third Edition, O’reilly Media, 2012