
Lecture Notes
Dear Students the Lecture Notes has been uploaded for the following topics:
Introducing Hadoop –Hadoop Overview – RDBMS versus Hadoop ,
HDFS (Hadoop Distributed File System):,
Components and Block Replication ,
Processing Data with Hadoop – Introduction to MapReduce 
Lecture Notes
Dear Students the Lecture Notes has been uploaded for the following topics:
Types of Data ,
Mean, Median and Mode – Standard Deviation and Variance ,
Probability Density Function ,
Types of Data Distribution ,
Percentiles and Moments – Correlation and Covariance,
Conditional Probability – Bayes’ Theorem ,
Introduction to Univariate, Bivariate and Multivariate Analysis ,
Principal Component Analysis (PCA) ,
Dimensionality Reduction using Principal Component Analysis and LDA ,
Linear Regression – Polynomial Regression – Multivariate Regression – Multi Level Models ,
Data Warehousing Overview ,
Bias/Variance Trade Off – K Fold Cross Validation ,
Data Cleaning and Normalization – Cleaning Web Log Data ,
Detecting Outliers ,
Introduction to Machine learning algorithms,
Supervised Learning,
Unsupervised Learning,
Reinforcement learning 
Lecture Notes
Dear Students the Lecture Notes has been uploaded for the following topics:
Data Science – Fundamentals and Components ,
Terminologies Used in Big Data Environments,
Types of Digital Data ,
Classification of Digital Data,
Introduction to Big Data – Characteristics of Data ,
Evolution of Big Data ,
Classification of Analytics ,
Top Challenges Facing Big Data – Importance of Big Data Analytics ,
Data Analytics Tools. 
Question Bank
Dear Students the Question Bank has been uploaded for the following topics:
Question Bank IAE 1 
Youtube Video
Dear Students the Youtube Video has been uploaded for the following topics:
Hadoop
HDFS
Mapreduce
Hive 
Assignment
Assignment topic is Big Data Characteristics and due date is 20042023.

Assignment
Assignment topic is Big Data Characteristics and due date is 21042023.

Resource Link
Dear Students the Resource Link has been uploaded for the following topics:
Pandas
bigdatacharacteristics
PCA
Supervised Learnong