Subject Details
Dept     : AIDS
Sem      : 5
Regul    : 2019
Faculty : ashokkumar
phone  : NIL
E-mail  : ashok.r.it@snsce.ac.in
834
Page views
42
Files
0
Videos
5
R.Links

Icon
Syllabus

UNIT
1
INTRODUCTION TO DATA SCIENCE AND BIG DATA

Data Science – Fundamentals and Components – Data Scientist – Terminologies Used in Big Data Environments – Types of Digital Data – Classification of Digital Data – Introduction to Big Data – Characteristics of Data – Evolution of Big Data – Big Data Analytics – Classification of Analytics – Top Challenges Facing Big Data – Importance of Big Data Analytics – Data Analytics Tools.

UNIT
2
DATA SCIENCE USING PYTHON

Introduction to Essential Data Science Packages: Numpy: Numpy Data types, Scipy, Jupyter, Statsmodels and Pandas Package – Scikit learn, R programming . Programs : Numpy - Creation of Arrays, Indexing and Slicing Operations, Copy and View Scipy – Manipulation of mathematical functions using special package, Pandas – Creation of Series, Creation of DataFrame

UNIT
3
DATA ANALYTICAL FRAMEWORKS

Introducing Hadoop – Hadoop Overview – RDBMS versus Hadoop – HDFS (Hadoop Distributed File System): Components and Block Replication – Processing Data with Hadoop – Introduction to MapReduce – Features of MapReduce, YARN, HBASE

UNIT
4
DATA PREPARATON AND VISUALIZATION

Data Munging: Introduction to Data Munging, Data Pipeline and Machine Learning in Python – Data Visualization Using Matplotlib – Interactive Visualization with Advanced Data Learning Representation in Python Program - Creation of various plots using pyplo

UNIT
5
BIG DATA DATABASES

Introduction to NoSQL: Types of NoSQL Databases-Key-value store, Document store, Column family, Graph store, CAP theorem – CAP Theorem NoSQL databases, MongoDB: RDBMS Vs MongoDB – Mongo DB Database Model – Data Types, Sharding –Types of sharding, Introduction to Hive – Hive Architecture – Hive Query Language (HQL).

Reference Book:

1.Alberto Boschetti, Luca Massaron, “Python Data Science Essentials”, Packt Publications, 2nd Edition, 2016. 2. Yuxi (Hayden) Liu, “Python Machine Learning”, Packt Publication, 2017. 3.VDT Editorial Services, Big Data, Black Book, Dream Tech Press, 2015.

Text Book:

1. Seema Acharya, Subhashini Chellapan, “Big Data and Analytics”, Wiley, 2015. 2. Frank Pane, “Hands On Data Science and Python Machine Learning”, Packt Publishers, 2017.

 

Print    Download