Subject Details
Dept     : CSE
Sem      : 6
Regul    : 19
Faculty : Vijayalakshmi
phone  : NIL
E-mail  : vlakshmi.n.cse@snsct.org
358
Page views
28
Files
1
Videos
1
R.Links

Icon
Syllabus

UNIT
1
INTRODUCTION

Data Science Process: Roles and stages in a data science project, working with files and databases, Exploring and managing data; Big Data- Types, Characteristics, Tools and Applications; Data Analytics- Types, Tools and Applications; Data and Relations: Data set - Data Scales - Set and Matrix Representations - Relations - Similarity Measures - Dissimilarity Measures - Sequence Relations – Sampling and Quantization.

UNIT
2
PREPROCESSING AND VISUALIZATION

Data preprocessing : Error Types - Error Handling - Filtering - Data Transformation - Data Merging; Data visualization: Diagrams - Principal Component Analysis - Multidimensional Scaling - Auto Associator - Histograms - Spectral Analysis.

UNIT
3
CORRELATION AND REGRESSION

Correlation: Linear Correlation - Causality - Chi-Square Tests; Regression: Linear Regression - Robust Regression - RBF Networks - Cross Validation - Feature Selection

UNIT
4
CLASSIFICATION

Classification: Classification Criteria - Naive Bayes‘ Classifier -Rule Based Classification – Classification by Back Propagation - Support Vector Machine - Decision Trees - Lazy Learners – Model Evaluation and Selection-Techniques to improve Classification Accuracy.

UNIT
5
CLUSTERING

Clustering: Cluster Partitions - Sequential - Prototype-Based - Fuzzy - Relational - Cluster Tendency Assessment - Cluster Validity - Self Organizing Maps; Case Study: Advertising on the Web.

Reference Book:

Dean J, “Big Data, Data Mining and Machine learning”, Wiley publications, 2014. Provost F and Fawcett T, “Data Science for Business”, O‘Reilly Media Inc, 2013. Janert PK, “Data Analysis with Open Source Tools”, O‘Reilly Media Inc, 2011.

Text Book:

Runkler TA, “Data Analytics: Models and algorithms for intelligent data analysis”, Springer, Third Edition 2020.

 

Print    Download