Subject Details
Dept     : MCA
Sem      : 3
Regul    : R2019
Faculty : Yuvarani E
phone  : NIL
E-mail  : learnlearnn@gmail.com
1.031K
Page views
55
Files
4
Videos
0
R.Links

Icon
Syllabus

UNIT
1
INTRODUCTION TO BIG DATA

Introduction to BigData – Challenges of Conventional Systems - Intelligent data analysis – Data - Analytic Processes and Tools - Modern Data Analytic Tools - Statistical Concepts: Sampling Distributions - Re-Sampling - Statistical Inference - Prediction Error

UNIT
2
HADOOP

History of Hadoop- The Hadoop Distributed File System – Components of Hadoop- Analyzing the Data with Hadoop- Scaling Out- Hadoop Streaming- Design of HDFS-Java interfaces to HDFS Basics- Developing a Map Reduce Application-How Map Reduce Works-Anatomy of a Map Reduce Job run-Failures-Job Scheduling-Shuffle and Sort – Task execution - Map Reduce Types and Formats- Map Reduce Features

UNIT
3
HADOOP ENVIRONMENT

Setting up a Hadoop Cluster - Cluster specification - Cluster Setup and Installation - Hadoop Configuration-Security in Hadoop - Administering Hadoop – HDFS - Monitoring-Maintenance-Hadoop benchmarks- Hadoop in the cloud

UNIT
4
FRAMEWORKS

Applications on Big Data Using Pig and Hive – Data processing operators in Pig – Hive services – HiveQL – Querying Data in Hive - fundamentals of HBase and ZooKeeper - Visualizations - Visual data analysis techniques, interaction techniques; Systems and applications

UNIT
5
R PROGRAMMING

Introduction to R: Overview of R; functions and packages in R; working with dataset in R; use R for doing statistical analysis and graphics; R commands . Adoption of R in Industry : Oralce R, Revolution Analytics

Reference Book:

1. A.Ohri, “R for Business Analytics”, Second edition, Springer, 2012 2. Chris Eaton, Dirk DeRoos, Tom Deutsch, George Lapis, Paul Zikopoulos, “Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data”, McGrawHill Publishing, 2012 (UNIT III-IV) 3. Bill Franks, “Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics”, John Wiley & sons, 2012. (UNIT III-IV) 4. Prabhanjan NarayanacharTattar, “R Statistical Application Development byExample Beginner's Guide”, PACKT, 2013 (UNIT V)

Text Book:

1. Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition, Elsevier, Reprinted 2008. 2. Tom White, “ Hadoop: The Definitive Guide” Third Edition, O’reilly Media, 2012

 

Print    Download