Subject Details
Dept     : CSE
Sem      : 7
Regul    : R 2017
Faculty : Mr. Karthikeyan. K
phone  : NIL
E-mail  : sns.cse.karthik@gmail.com
511
Page views
94
Files
3
Videos
9
R.Links

Icon
Syllabus

UNIT
1
INTRODUCTION TO BIG DATA

Introduction to Big Data Platform – Challenges of conventional systems - Web data – Evolution of Analytic scalability, analytic processes and tools, Analysis vs reporting - Modern data analytic tools, Stastical concepts: Sampling distributions, resampling, statistical inference, prediction error.

UNIT
2
DATA ANALYSIS

Regression modeling, Multivariate analysis, Bayesian modeling, inference and Bayesian networks, Support vector and kernel methods, Analysis of time series: linear systems analysis, nonlinear dynamics - Rule induction - Neural networks: learning and generalization, competitive learning,principal component analysis and neural networks; Fuzzy logic: extracting fuzzy models from data,fuzzy decision trees, Stochastic search methods

UNIT
3
MINING DATA STREAMS

Introduction to Streams Concepts – Stream data model and architecture - Stream Computing, Sampling data in a stream – Filtering streams – Counting distinct elements in a stream – Estimating moments – Counting oneness in a window – Decaying window - Realtime Analytics Platform(RTAP)applications - case studies - real time sentiment analysis, stock market predictions.

UNIT
4
FREQUENT ITEMSETS AND CLUSTERING

Mining Frequent itemsets - Market based model – Apriori Algorithm – Handling large data sets in Main memory – Limited Pass algorithm – Counting frequent itemsets in a stream – Clustering Techniques –Hierarchical – K- Means – Clustering high dimensional data – CLIQUE and PROCLUS – Frequent pattern based clustering methods – Clustering in non-euclidean space – Clustering for streams and Parallelism.

UNIT
5
FRAMEWORKS AND VISUALIZATION

MapReduce – Hadoop, Hive, MapR – Sharding – NoSQL Databases - S3 - Hadoop Distributed file systems – Visualizations - Visual data analysis techniques, interaction techniques; Systems and applications:

Reference Book:

1.Bill Franks, Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with advanced analystics, John Wiley & sons, 2012. 2.Glenn J. Myatt, Making Sense of Data, John Wiley & Sons, 2007 Pete Warden, Big Data Glossary, O’Reilly, 2011. 3.Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition, Elsevier, Reprinted

Text Book:

1. Michael Berthold, David J. Hand, Intelligent Data Analysis, Springer, 2007. 2. Anand Rajaraman and Jeffrey David Ullman, Mining of Massive Datasets, Cambridge University Press, 2012.

 

Print    Download